DeepSeek V4 Finally Released, Breaking the Strongest Closed-Source Monopoly, Explicitly Partnering with Huawei Chips

marsbitPublished on 2026-04-24Last updated on 2026-04-24

Abstract

DeepSeek AI has officially released DeepSeek-V4, available in two versions: the high-performance **DeepSeek-V4-Pro** (49B activated parameters, 1.6T total) and the more efficient **DeepSeek-V4-Flash** (13B activated parameters, 284B total). Both support a 1M context length, making long-context capability a baseline feature rather than a premium offering. The Pro version rivals top closed-source models in agent capabilities, world knowledge, and reasoning performance. It outperforms Claude Sonnet 4.5 in agentic coding and approaches Claude Opus 4.6 (non-thinking mode) in quality. The Flash version offers competitive performance at a lower cost, though it lags in highly complex tasks. A key technical innovation is a new attention mechanism that reduces computational and memory demands for long contexts. The models are optimized for agent frameworks like Claude Code and OpenClaw. API services are available with support for both OpenAI and Anthropic-style interfaces. DeepSeek also announced upcoming support for Huawei’s computing hardware in the second half of the year. The models are open-sourced on Hugging Face and ModelScope.

Just now, DeepSeek-V4 is here!

The preview version is officially launched and simultaneously open-sourced.

There are two versions in total:

DeepSeek-V4-Pro: Comparable to top closed-source models, 1.6T, 49B activated, 1M context length;

DeepSeek-V4-Flash: A smaller and faster economical version, 284B, 13B activated, 1M context length.

The official statement is: It leads domestically and in the open-source field in Agent capabilities, world knowledge, and reasoning performance.

And:

Currently, DeepSeek-V4 has become the Agentic Coding model used by company employees. According to evaluation feedback, the user experience is better than Sonnet 4.5, and the delivery quality is close to Opus 4.6 non-thinking mode. However, there is still a certain gap compared to the Opus 4.6 thinking model.

Currently, both the official website and the app have been updated, and the API service has also been synchronized.

Regarding the much-concerned domestic computing power, the key point is: Support for Huawei computing power in the second half of the year.

Top-Tier and Cost-Effective Choices, Two Versions Launched Together

This time, V4 releases two versions at once.

V4-Pro, performance comparable to top closed-source models.

The official judgment has three points:

Significantly improved Agent capabilities: In the Agentic Coding evaluation, V4-Pro has reached the best level among current open-source models and also performed excellently in other Agent-related evaluations. In internal evaluations, in Agent Coding mode, the V4 experience is better than Sonnet 4.5, and the delivery quality is close to Opus 4.6 non-thinking mode, but there is still a certain gap compared to the Opus 4.6 thinking mode.

Rich world knowledge: In world knowledge evaluations, DeepSeek-V4-Pro significantly leads other open-source models, only slightly inferior to the top closed-source model Gemini-Pro-3.1.

World-class reasoning performance: In evaluations of mathematics, STEM, and competitive code, DeepSeek-V4-Pro surpasses all currently publicly evaluated open-source models and achieves excellent results comparable to the world's top closed-source models.

V4-Flash, a smaller and faster economical version. Reasoning ability is close to Pro, world knowledge reserve is slightly inferior, but with smaller parameters and activation, and cheaper API.

In Agent tasks, DeepSeek-V4-Flash is on par with DeepSeek-V4-Pro in simple tasks, but there is still a gap in high-difficulty tasks.

In the car wash test, V4 also passed quickly.

In the classic biological scenario "Desperate Father," DeepSeek-V4 did not immediately grasp the key point of red-green color blindness in one round (according to genetic rules, if a female is red-green color blind, her biological father must be as well).

Million Context Length Becomes Standard

It is worth mentioning that from today, 1M context length is standard for all DeepSeek official services.

A year ago, 1M context length was Gemini's exclusive trump card; all other closed-source models were either 128K or 200K; on the open-source side, almost no one could afford this level.

DeepSeek directly moved the million context length from a "high-end feature" to "basic infrastructure."

And it's open source. How did they do it? The release directly gave the answer—

V4 has created a new attention mechanism that compresses at the token dimension and is used in combination with DSA sparse attention. Compared to traditional methods, the demand for computation and memory is significantly reduced.

DSA is not a new term. It was first introduced in the V3.2-Exp update half a year ago. At that time, external attention was low because the benchmark scores were almost the same as V3.1-Terminus, making it seem like an insignificant intermediate version.

Looking back now, that was the foundation of V4.

Special Optimization for Agent Capabilities

On the Agent side, V4 has been adapted and optimized for mainstream Agent products such as Claude Code, OpenClaw, OpenCode, CodeBuddy, etc., with improvements in code tasks and document generation tasks.

The release also included an example of a PPT inner page generated by V4-Pro under a certain Agent framework.

API Pricing

On the API side, V4-Pro and V4-Flash are simultaneously launched, supporting both OpenAI ChatCompletions interface and Anthropic interface.

The base_url remains unchanged, just change the model parameter to deepseek-v4-pro or deepseek-v4-flash to call.

Both versions have a maximum context length of 1M and support both non-thinking mode and thinking mode. In thinking mode, the intensity can be adjusted through the reasoning_effort parameter, with two levels: high and max. The official recommendation is to directly use max for complex Agent scenarios.

Here is a key point—Support for Huawei computing power in the second half of the year.

In addition, old model names will be discontinued.

deepseek-chat and deepseek-reasoner will be discontinued three months later (July 24, 2026). During the current phase, these two names point to the non-thinking and thinking modes of V4-Flash, respectively.

It has little impact on individual developers; just change one model parameter. Companies with production environments need to migrate during these three months.

One more thing

At the end of the release, DeepSeek quoted a sentence.

"Not tempted by praise, not frightened by slander, follow the path and act,端正自己端正自己 (correct oneself)."

This is a sentence from Xunzi's "Non-Twelve Masters." The literal meaning is: not tempted by praise, not frightened by slander, move forward according to the path one believes in, and correct oneself.

In today's context, it's somewhat interesting.

Over the past six months, rumors about when V4 would be released, whether it was delayed, whether it had been surpassed by others, whether it had been compromised by Claude's distilled data, etc., have circulated back and forth in both Chinese and English AI circles. At the beginning of the year, some even confidently said that V4 would be released before the Spring Festival, but it wasn't until the end of April.

They never responded once.

Then, on a Friday afternoon, they released V4, simultaneously open-sourced it, simultaneously launched it on the official website and app, simultaneously updated the API, and even wrote into the release that internal employees have already abandoned Claude.

No roadmap, no live stream, no interviews.

The four words "率道而行" (follow the path and act) sound like a slogan. But if you look at the path over the past six months: the V3.2 "unremarkable" Exp version, the DSA sparse attention that paved the way for V4 for half a year, and the path of making 1M context length from a trump card to a standard feature.

DeepSeek has already done it.

DeepSeek-V4 model open-source links:

[1]https://huggingface.co/collections/deepseek-ai/deepseek-v4

[2]https://modelscope.cn/collections/deepseek-ai/DeepSeek-V4

DeepSeek-V4 Technical Report: https://huggingface.co/deepseek-ai/DeepSeek-V4-Pro/blob/main/DeepSeek_V4.pdf

This article is from the WeChat public account "QbitAI", author: QbitAI

The More Lifelike the Robot, the More Terrifying? Unveiling the 'Uncanny Valley Effect' in the Era of Humanoid Robots

As humanoid robots become increasingly lifelike, they confront a significant psychological barrier known as the "Uncanny Valley Effect," a concept proposed by Japanese roboticist Masahiro Mori in 1970. This phenomenon describes a dip in human comfort and acceptance when robots appear almost, but not perfectly, human. Minor imperfections in facial expressions, eye movements, or skin texture trigger a subconscious sense of unease, as the brain detects something trying, yet failing, to mimic a person. Examples range from the controversial human-like robot Sophia to animated characters in films like *The Polar Express*. The effect poses a key design challenge for robotics companies. Some, like Boston Dynamics, avoid it entirely by creating highly capable but visibly mechanical robots. Others, like Hanson Robotics, push for greater human likeness despite the risk. For consumer robots, especially in homes, most manufacturers opt for stylized or clearly mechanical designs to ensure broader acceptance. While the Uncanny Valley remains a powerful force, its impact may diminish over time through technological advancements that achieve near-perfect realism or through generational familiarity as people grow accustomed to interacting with humanoid machines. Ultimately, navigating this psychological frontier requires as much understanding of human perception as of robotics technology itself.

marsbitJust now

The More Lifelike the Robot, the More Terrifying? Unveiling the 'Uncanny Valley Effect' in the Era of Humanoid Robots

marsbitJust now

Over $31 Million Stolen from Humanity, Is the Team Behind It Paving the Way for a New Project?

The article reports a major security incident involving Humanity Protocol (H), resulting in over $31 million stolen. The team attributes the hack to a private key leak from a Humanity Foundation member, leading to a rapid sale of H tokens and a 90% price drop. However, prominent on-chain investigator ZachXBT suggests the event might be an "exit scam" orchestrated by the team, rather than an external hack. This suspicion is fueled by the team's controversial history, including past management failures, legal issues, and previous scandals like outsourced technology and problematic airdrops. Further investigation reveals that the core team behind Humanity is already involved with a new project called "Everything," which recently secured funding. This has led to community speculation that the alleged hack is a deliberate scheme to abandon the H project while shifting focus and resources to their new venture, leaving investors to bear the losses. The article questions whether this is a genuine security failure or a premeditated "rug pull" strategy.

Odaily星球日报1h ago

Over $31 Million Stolen from Humanity, Is the Team Behind It Paving the Way for a New Project?

Odaily星球日报1h ago

SoftBank's Son, Bankrupted by Good Stories, Awaits His Next Alibaba

Masayoshi Son is back. After years defined by Vision Fund's massive losses and costly missteps like WeWork—which shattered market confidence and led to billions in write-downs—Son has found redemption through AI. His early, high-conviction bets are paying off handsomely. Arm, acquired for $32 billion in 2016 and once a heavily leveraged burden, has become a gold mine amid the AI compute boom, delivering roughly a 10x return for SoftBank after its 2023 IPO. More crucially, SoftBank has made an enormous, concentrated bet on OpenAI, committing over $64 billion for a roughly 13% stake. This investment is already showing massive paper gains, contributing significantly to SoftBank's soaring valuation and propelling Son back to the top as Asia's richest person. The narrative has shifted from the "fool" who fell for grandiose stories to the visionary who endured a brutal downturn and is now being rewarded for his early faith in artificial intelligence.

marsbit1h ago

SoftBank's Son, Bankrupted by Good Stories, Awaits His Next Alibaba

marsbit1h ago

WeChat Agent Issues a 'Heroic Summons,' Half of the Internet Responds

WeChat AI Agent is on the horizon. The WeChat Open Platform has issued a guide for developers, offering them ways to integrate into the WeChat AI ecosystem. This will enable mini-programs to be discovered and invoked by the AI. Meituan has already announced its integration, allowing users to access services like food delivery through WeChat AI. Other platforms like Ctrip and Tongcheng have followed suit. Furthermore, WeChat is collaborating with major smartphone manufacturers to enable their native AI assistants to perform actions within WeChat, such as initiating calls or sending messages, through a controlled protocol called Agent-to-Agent (A2A). Reports indicate the WeChat AI Agent will be accessible by swiping right on the main interface. It aims to understand user intent within the rich context of chats, groups, and past interactions, then automatically call upon relevant mini-programs to complete tasks like ordering coffee or booking restaurants. This positions it as a potential "super app" with direct access to WeChat's vast ecosystem of services, social connections, and payment systems. Technically, this is a complex endeavor. It requires advanced natural language understanding, a "world model" to predict interactions within mini-programs (UI-Oceanus), multi-model orchestration for cost efficiency, and careful coordination with millions of third-party service providers. Tencent's development follows a "Co-Design" approach, where product teams and the Hunyuan model team collaborate closely, allowing capabilities honed in other AI products (like Yuanbao for chat, ima for search, WorkBuddy for office tasks) to be transferred to the WeChat Agent. Tencent is strategically opting for the A2A protocol over GUI-based automation (which it has blocked in the past), maintaining control over its ecosystem. To manage the immense scale and cost of serving 1.4 billion monthly active users, Tencent is deepening its ties with DeepSeek, known for its cost-effective training, to secure a low-cost inference backbone. The ultimate goal is to solve practical, everyday problems for users within the WeChat ecosystem, moving beyond technical benchmarks to deliver real utility, which Tencent sees as the key to winning in the long-term AI game.

marsbit1h ago

WeChat Agent Issues a 'Heroic Summons,' Half of the Internet Responds

marsbit1h ago

Strategy Erases Last Week’s Bitcoin Sale With 1,550 BTC Purchase

Strategy, a major Bitcoin treasury firm, has made a substantial purchase of 1,550 BTC for approximately $101 million. This acquisition completely offsets the company's sale of 32 BTC just one week prior, which had negatively impacted market sentiment. Following this buy, Strategy's total Bitcoin holdings have reached a new record of 845,256 BTC. The company's leadership, including Chairman Michael Saylor and CEO Phong Le, emphasized their ongoing strategy to accumulate Bitcoin over time. Concurrently, Strategy added $100 million to its USD reserve, bringing its total to $1 billion, to ensure dividend payments. The purchase was reportedly funded through sales of the company's stock. Meanwhile, U.S. Bitcoin spot ETFs experienced a fourth consecutive week of net outflows, totaling $1.72 billion last week. Bitcoin's price is currently around $63,400, reflecting a nearly 12% decline over the past seven days.

bitcoinist2h ago

Strategy Erases Last Week’s Bitcoin Sale With 1,550 BTC Purchase

bitcoinist2h ago

Trading

Spot

Futures

DeepSeek V4 Finally Released, Breaking the Strongest Closed-Source Monopoly, Explicitly Partnering with Huawei Chips

Abstract

Top-Tier and Cost-Effective Choices, Two Versions Launched Together

Million Context Length Becomes Standard

Special Optimization for Agent Capabilities

API Pricing

One more thing

Related Questions

Related Reads

The More Lifelike the Robot, the More Terrifying? Unveiling the 'Uncanny Valley Effect' in the Era of Humanoid Robots

Over $31 Million Stolen from Humanity, Is the Team Behind It Paving the Way for a New Project?

SoftBank's Son, Bankrupted by Good Stories, Awaits His Next Alibaba

WeChat Agent Issues a 'Heroic Summons,' Half of the Internet Responds

Strategy Erases Last Week’s Bitcoin Sale With 1,550 BTC Purchase

Trading

Hot Categories

Hot Tags