The DeepSeek You've Been Waiting For Has Long Changed

marsbitОпубликовано 2026-04-15Обновлено 2026-04-15

Введение

The article discusses the delayed release of DeepSeek V4, a highly anticipated AI model in China, and explores the reasons behind its slowed development. Initially a leader in the global AI race, DeepSeek has fallen behind competitors like OpenAI, Anthropic, and Google, which release major updates every few months. A key factor is DeepSeek's shift in focus due to national strategic priorities. In early 2025, the Chinese government encouraged the company to use Huawei’s Ascend processors instead of NVIDIA’s GPUs, aligning with broader efforts to achieve technological self-reliance. DeepSeek attempted to train its models on Huawei’s Ascend 910C chips but faced technical challenges, including instability and communication issues during distributed training. As a result, the company continued using NVIDIA hardware for training while only using Ascend chips for inference. In 2026, DeepSeek prioritized adapting V4 to Huawei’s new Ascend 950PR and Cambricon chips, aiming for a full migration from NVIDIA’s CUDA to Huawei’s CANN framework. This adaptation process, particularly ensuring precision alignment across hardware, consumed significant time and resources, slowing down model iteration. The delay also reflects DeepSeek’s evolving role from a purely market-driven entity to a "national mission-oriented" company. This shift has come at a cost: the model now lags behind competitors in areas like code generation and multimodal capabilities, and the company has faced talent drain, ...

By | World Model Workshop

There is now only one question mark left in China's internet: Will DeepSeek V4 be released or not?

Over the past year, the global model competition has already entered a fierce, head-to-head battle mode.

Giants like OpenAI, Anthropic, and Google have maintained a major iteration cycle of 2-3 months, or even just 1 month, rapidly pushing new versions to the market for validation, iteration, and re-validation.

However, DeepSeek has significantly slowed down its major version updates over the past 15 months. V4 has been delayed multiple times, and its pace has clearly fallen behind, transforming from an industry leader to one that is a step slow.

The anxiety of waiting has evolved into a vortex of suspicion.

In early April, some media suddenly hinted: V4 might be released in the coming weeks, but don't set your expectations too high for DeepSeek V4.

Why would such a long-awaited release come with a preemptive warning?

What exactly happened to DeepSeek this year?

The Burden of Localization

The truth might be hidden in a piece of news that most people overlooked.

In January 2025, a Reuters report revealed a detail: After the release of DeepSeek R1, relevant authorities encouraged DeepSeek to use Huawei's Ascend processors instead of continuing to rely on NVIDIA.

In that context, the word "encouraged" carried far more weight than its literal meaning.

DeepSeek is no ordinary startup; it is the first successful example of China's AI breaking through U.S. technological封锁 (blockade).

This symbolic significance quickly turned DeepSeek from a technology company into a key piece on the chessboard of the national strategy for independent and controllable technology.

Shortly after, in February, Liang Wenfeng attended that highly anticipated symposium for private enterprises.

He was seated in the front row, alongside tech giants like Ma Huateng, Ren Zhengfei, and Lei Jun, standing shoulder-to-shoulder as representatives of the national team for new quality productive forces.

The gears of policy orientation began to turn from there.

According to foreign media reports, DeepSeek did indeed attempt to use Huawei's Ascend 910C chips to train its next-generation model in early 2025.

However, the process encountered multiple technical obstacles: insufficient training stability, frequent crashes in large-scale distributed scenarios, and inter-chip communication speeds falling short of expectations.

Huawei dispatched a team of engineers to DeepSeek's office to provide on-site support, but ultimately failed to resolve the adaptation issues during the training phase.

The result was a compromise: DeepSeek continued to use NVIDIA GPUs for the training phase, while Ascend chips were only used for the inference环节 (segment).

This means that, at least in the core环节 of training, DeepSeek spent nearly a year on trial and error.

But the adaptation work did not stop. In 2026, new developments emerged regarding the V4 version.

According to leaks, DeepSeek did not grant NVIDIA early testing access. Instead, it prioritized giving the pre-release version to Huawei's new-generation Ascend 950PR chips for adaptation.

Simultaneously, to分散风险 (spread risks), it also adapted for Cambricon chips.

However, technical challenges remained significant.

According to media reports, the goal this time was to complete the migration at the underlying code level, moving entirely from NVIDIA's CUDA ecosystem to Huawei's CANN framework, achieving full-chain localization replacement for both training and inference.

According to sources close to the project, the core difficulty of the adaptation work lies in precision alignment—ensuring the model outputs consistent results across different hardware ecosystems, which involves extensive adjustments to the underlying code.

This incurred a time cost.

While global mainstream manufacturers maintained a 2-3 month model iteration rhythm, DeepSeek became increasingly slower. During this period, a significant portion of its technical resources were likely invested in domestic chip adaptation.

After all, there is indeed a generational performance gap, along with differences in ecosystem maturity and toolchain completeness, between domestic chips and NVIDIA. The model adaptation process is incredibly time-consuming.

This marks a clear departure from DeepSeek's initial path, which was purely focused on pursuing model performance improvements.

Linkages at the industry chain level were also happening simultaneously.

In early 2026, the market传出 (circulated) news that Alibaba, ByteDance, and Tencent had placed orders with Huawei for hundreds of thousands of Ascend 950PR chips.

A reasonable speculation is: leading cloud vendors are waiting for the validation results of DeepSeek V4 to assess the practical usability of domestic chips in large-scale AI training.

If V4 succeeds, Huawei's 950PR will transform from a technical sample into a commercially viable product. If it fails to meet expectations, it will essentially map out the current capability boundaries of domestic chips for the industry.

Considering Liang Wenfeng's一贯极高 (consistently very high) standards for model releases—never releasing until expectations are met—the imminent launch of V4 likely means it has passed effectiveness tests on the inference side.

If successfully validated, this would be a critical step for DeepSeek, and indeed for China's entire AI sector, towards independent and controllable technology.

The Cost of Identity Transformation

By proactively undertaking the重任 (heavy responsibility) of validating the domestic computing power ecosystem, DeepSeek's choice makes it resemble more of a national mission-oriented company rather than a purely profit-driven market player.

But the costs of this transformation are obvious: a short-term slowdown in pace, increased pressure on talent retention, and a temporary pause in competitiveness.

According to independent evaluations and community data from March-April 2026, DeepSeek's code generation capability in third-party benchmark tests has been significantly surpassed by the Claude 4 series (Opus 4.6 / Sonnet 4.6).

DeepSeek's multimodal processing capabilities are also primarily limited to text + images, lagging far behind Claude and GPT's performance in image analysis, computer use, and video understanding.

Entering 2026, DeepSeek shifted its product focus towards the more challenging field of Agent system engineering.

Based on current community feedback, DeepSeek is接近 (close to) the first tier in code Agents and Chinese search Agents. However, there remains a clear systems engineering gap compared to top international models like OpenAI and Google in areas such as multi-tool coordination, long-chain task execution, and robustness in real-world environments.

This gap may not necessarily indicate a decline in technical ability, but rather seems like the result of trade-offs made between market competition and national strategy.

The cost at the organizational level is equally apparent.

Starting in the second half of 2025, key members of the DeepSeek core team began to leave.

According to confirmation by LatePost, Wang Bingxuan (core author of the first-generation large model), Guo Daya (core author of R1), Wei Haoran (OCR lead), and Ruan Chong (multimodal lead)相继离职 (left one after another).

Behind these names lies the technical积淀 (accumulation) of DeepSeek from V1 to R1.

The reasons for the departures are complex, but the relative disadvantage of the compensation system is a visible factor.

Headhunters revealed that competitors offered packages that were "two to three times" those of DeepSeek, with some major companies directly offering total compensation in the eight-figure range.

As a startup without external funding (its parent company is幻方量化 (Huanfang Quant)), DeepSeek's salaries, while绝对值不低) (not low in absolute terms), cannot match the equity incentives and valuation premiums offered by market-oriented giants like ByteDance, Alibaba, and Tencent.

Liang Wenfeng has begun promoting company valuation work, clarifying option pricing to give the team more certainty.

But against the backdrop of peers like智谱 (Zhipu) and MiniMax going public and their stock prices soaring, the pressure to retain top talent remains significant.

The DeepSeek of today is陷入 (caught in) a kind of identity模糊 (ambiguity).

It still needs commercialization, it still needs to retain talent, but simultaneously it is burdened with the expectations of domestic adaptation.

The conflict arising from this dual identity is perhaps the deep-seated logic behind DeepSeek's increasing slowness over this past year.

Consequently, market expectations for V4's capabilities are also being adjusted downward.

It might not become the blockbuster, record-breaking model that sweeps across screens once again, but it could be a milestone in industrial significance, proving that China's cutting-edge models can achieve usability within a domestic hardware ecosystem.

The report card for V4 might be more important for the long-term direction of China's AI industry.

Связанные с этим вопросы

QWhat is the main reason for DeepSeek V4's delayed release according to the article?

AThe main reason is DeepSeek's significant technical resources being diverted to adapt its models to domestic hardware ecosystems, particularly Huawei's Ascend processors, as part of China's national strategy for technological self-reliance, rather than focusing solely on performance improvements.

QWhich domestic chip manufacturers is DeepSeek collaborating with for hardware adaptation?

ADeepSeek is primarily collaborating with Huawei (using its Ascend 950PR chips) and also synchronously adapting to Cambricon chips to diversify risks and support domestic hardware ecosystems.

QWhat challenges did DeepSeek face when adapting to Huawei's Ascend chips?

ADeepSeek encountered multiple technical obstacles, including insufficient training stability, frequent crashes in large-scale distributed scenarios, and suboptimal inter-chip communication speeds. Precision alignment across different hardware ecosystems also required extensive low-level code adjustments.

QHow has DeepSeek's role evolved beyond being a purely market-driven AI company?

ADeepSeek has transitioned into a 'national mission-oriented company,' actively承担 (undertaking) the responsibility of validating domestic computing ecosystems and supporting China's autonomous and controllable technology strategy, which has impacted its release节奏 (rhythm) and competitive pace.

QWhat are the competitive gaps mentioned between DeepSeek and international models like Claude and GPT?

ADeepSeek lags behind in code generation capabilities compared to Claude 4 series, and its multimodal processing is limited to text and images, falling short of Claude and GPT's advancements in image analysis, computer use, and video understanding. It also trails in Agent system engineering, particularly in multi-tool coordination, long-chain task execution, and real-world robustness.

Похожее

South Korean Exchanges 'Battle' Regulators, Challenging the Boundaries of Enforcement and Legislation

South Korea's cryptocurrency industry is engaged in a rare, direct confrontation with regulators. The Financial Intelligence Unit (FIU), the primary anti-money laundering (AML) watchdog, has recently imposed heavy penalties on major exchanges like Upbit and Bithumb for alleged violations involving unregistered overseas VASPs and AML procedures. However, exchanges are now actively challenging these actions in court and through industry associations. In a significant shift, the Seoul Administrative Court ruled in favor of Upbit's operator, Dunamu, overturning part of an FIU-ordered business suspension. The court found the FIU's penalty criteria and justification insufficiently clear. Similarly, the court suspended the enforcement of a six-month business suspension against Bithumb pending a final ruling, citing potential irreversible harm to the exchange. Beyond legal battles, the industry is contesting proposed legislative amendments. The Digital Asset eXchange Alliance (DAXA) strongly opposes a draft rule that would mandate Suspicious Transaction Reports (STRs) for all crypto transfers over 10 million KRW (~$6,800). DAXA argues this "poison pill" clause violates legal principles and would overwhelm the STR system, increasing reports from 63,000 to an estimated 5.45 million annually for major exchanges, thereby crippling effective AML monitoring. This conflict highlights a structural tension in South Korea's crypto governance: comprehensive digital asset laws are still developing, while regulators rely heavily on AML enforcement. The industry's move from passive compliance to active legal and legislative challenges signifies a new phase, pressing for clearer rules and more proportionate enforcement. While short-term disputes may intensify, this clash could ultimately lead to a more mature and sustainable regulatory framework for South Korea's vibrant crypto market.

marsbit31 мин. назад

South Korean Exchanges 'Battle' Regulators, Challenging the Boundaries of Enforcement and Legislation

marsbit31 мин. назад

Earnings Report, CLARITY Bill, and Warsh's Arrival: CRCL Faces Three Major Tests This Week

Circle (CRCL) faces three major tests this week that will significantly impact its stock price and valuation. First, on May 11, it will release its Q1 2026 earnings. Key metrics to watch are overall revenue and EPS, the proportion of revenue paid to distributors like Coinbase, and growth in non-interest income. The market also awaits Circle's stance on renegotiating its revenue-sharing contract with Coinbase, which expires in August. Second, on May 14, the U.S. Senate Banking Committee will vote on the CLARITY Act. This bill aims to establish a clear federal regulatory framework for digital assets. A recent compromise proposal would ban yield on static stablecoin reserves but allow rewards for active ones. Its passage, currently seen as likely by prediction markets, would be a major positive for the industry and Circle. Finally, on May 15, Kevin Warsh will succeed Jerome Powell as Federal Reserve Chair. Warsh's proposed policy of quantitative tightening combined with interest rate cuts could pose a short-term headwind for CRCL, as lower rates reduce Circle's primary revenue from USDC reserves. However, long-term prospects may improve as Warsh, a known crypto investor who opposes a Fed CBDC, is seen as potentially favorable to regulated private stablecoins like USDC.

marsbit36 мин. назад

Earnings Report, CLARITY Bill, and Warsh's Arrival: CRCL Faces Three Major Tests This Week

marsbit36 мин. назад

After 50x Storage Surge, Justin Sun Always Looks to the Next Decade

Sun Yuchen, known for his controversial stunts like a $30 million lunch with Warren Buffett (canceled due to a kidney stone) and eating a $6.2 million duct-taped banana, is often overshadowed by a significant fact: his decade-long track record of spotting major investment trends. In 2016, he famously advised young people to invest in Bitcoin, Nvidia, Tesla, and Tencent instead of buying property. A hypothetical $20,000 investment in Nvidia and Tesla from that list would now be worth over 50 million RMB. His latest major call was on November 6, 2025, predicting a "50x storage opportunity" tied to the AI boom, which materialized with Sandisk's stock surging nearly 50-fold by 2026. Looking ahead, Sun now focuses on the next frontier: Physical AI. He identifies four key areas: 1. **Embodied AI/Robotics**: He sees this reaching its "iPhone moment," with companies like UBTech and Galaxy General leading in commercialization. 2. **Drones**: Viewed as the first commercially viable form of Physical AI, revolutionizing sectors from warfare (e.g., AeroVironment's Switchblade) to logistics. 3. **Spatial Computing**: Beyond VR, it's about AI understanding physical space, a foundational technology for robotics and autonomous systems, exemplified by Apple's Vision Pro. 4. **Space Exploration**: After a 2025 suborbital flight with Blue Origin, Sun advocates for space as the ultimate frontier, discussing blockchain's potential role in space asset management and data transactions. His investment philosophy involves betting on entire, inevitable trends rather than single companies. For robotics, he sees Tesla (the body/manufacturer) and Nvidia (the brain/AI platform) as complementary plays. In defense drones, he highlights companies making tanks obsolete (AeroVironment) and those augmenting fighter jets (Kratos). For space, he participated in Blue Origin's flight and anticipates SpaceX's potential IPO to redefine the sector's valuation. Sun Yuchen's vision frames the next two decades not as a revolution in information flow (like the internet), but in the fundamental operation of the physical world through AI-powered robots, autonomous systems, and spatial intelligence, ultimately extending human and AI activity into space. While many still focus on conventional assets, he continues to look toward the next technological horizon.

marsbit1 ч. назад

After 50x Storage Surge, Justin Sun Always Looks to the Next Decade

marsbit1 ч. назад

The Billionaires Behind the Most Expensive Midterm Election in History

"The Most Expensive Midterm Elections and Their Billionaire Backers" This analysis details the unprecedented scale of spending in the 2026 midterm elections, highlighting the key billionaire donors shaping the political landscape. Jeff Yass, founder of Susquehanna International Group, has contributed over $81 million, ranking third among individual donors behind George Soros ($102.6M) and Elon Musk ($84.8M). Yass is a major donor to Trump's MAGA Inc. and supports school choice and various candidates. Overall, federal committees have raised over $4.7 billion this cycle, with political ad spending projected to reach $10.8 billion. Republican-aligned groups are significantly out-raising their Democratic counterparts. "Dark money" from undisclosed sources continues to grow. The core stakes involve control of Congress and policy direction for Trump's final term. Donors are also motivated by specific issues: Sergey Brin and Chris Larsen are funding opposition to a proposed California wealth tax and supporting crypto-friendly policies. Other top donors include OpenAI's Greg Brockman and his wife Anna ($50M total to MAGA Inc. and an AI-focused PAC), Richard Uihlein ($45.3M to conservative causes), venture capitalists Marc Andreessen and Ben Horowitz (each over $44M to crypto/AI PACs and MAGA Inc.), Miriam Adelson ($42.6M to GOP leadership PACs), Paul Singer ($33.9M), and Diane Hendricks ($25.8M to MAGA Inc.). The article notes that the peak fundraising period is still ahead, with major primaries approaching.

marsbit1 ч. назад

The Billionaires Behind the Most Expensive Midterm Election in History

marsbit1 ч. назад

The Largest IPO in History Is Approaching, Surpassing SpaceX, 28 Years of AI Self-Iteration, Countdown to Intelligence Explosion

"Anthropic Nears Trillion-Dollar IPO, Fueled by Explosive Growth and 2028 'Intelligence Explosion' Warning Anthropic is considering a deal valuing the AI company near $1 trillion, potentially leading to one of the largest IPOs ever and surpassing SpaceX. Its revenue has skyrocketed, with Annual Recurring Revenue (ARR) reaching $45 billion in May 2026—a 500% increase in just five months. This vertical growth curve is attributed to its key products, Claude Code and Cowork, dominating AI coding and enterprise collaboration. Beyond commercial success, co-founder Jack Clark issued a pivotal warning in an interview: there is a greater than 50% chance that by the end of 2028, AI systems will achieve recursive self-improvement—the ability to autonomously build a 'better version' of themselves, initiating an 'intelligence explosion.' This prophecy underpins the company's astronomical valuation, as the market prices in the potential for transformative and disruptive AI. Further signaling its ambition, Anthropic formed a $1.5 billion joint venture with Goldman Sachs and Blackstone, aiming to disrupt traditional consulting firms like McKinsey by deploying Claude AI for complex strategic work. This move tests AI's capacity to replace high-level cognitive labor, a precursor to its predicted autonomous evolution. The narrative presents a dual future: unprecedented economic opportunity alongside significant risks like economic restructuring and security threats. Anthropic's meteoric rise and Clark's 2028 prediction frame the coming years as a countdown to a potential technological singularity."

marsbit1 ч. назад

The Largest IPO in History Is Approaching, Surpassing SpaceX, 28 Years of AI Self-Iteration, Countdown to Intelligence Explosion

marsbit1 ч. назад

Торговля

Спот

Фьючерсы

Обсуждения

Добро пожаловать в Сообщество HTX. Здесь вы сможете быть в курсе последних новостей о развитии платформы и получить доступ к профессиональной аналитической информации о рынке. Мнения пользователей о цене на S (S) представлены ниже.

The DeepSeek You've Been Waiting For Has Long Changed

Введение

The Burden of Localization

The Cost of Identity Transformation

Связанные с этим вопросы

Похожее

South Korean Exchanges 'Battle' Regulators, Challenging the Boundaries of Enforcement and Legislation

Earnings Report, CLARITY Bill, and Warsh's Arrival: CRCL Faces Three Major Tests This Week

After 50x Storage Surge, Justin Sun Always Looks to the Next Decade

The Billionaires Behind the Most Expensive Midterm Election in History

The Largest IPO in History Is Approaching, Surpassing SpaceX, 28 Years of AI Self-Iteration, Countdown to Intelligence Explosion

Торговля

Популярные статьи

Как купить S

Sonic: Обновления под руководством Андре Кронье – новая звезда Layer-1 на фоне спада рынка

HTX Learn: Пройдите обучение по "Sonic" и разделите 1000 USDT

Обсуждения

Топ вопросы

Популярные категории

Популярные теги