DeepSeek V4 Finally Released, Breaking the Strongest Closed-Source Monopoly, Explicitly Partnering with Huawei Chips

marsbitPublished on 2026-04-24Last updated on 2026-04-24

Abstract

DeepSeek AI has officially released DeepSeek-V4, available in two versions: the high-performance **DeepSeek-V4-Pro** (49B activated parameters, 1.6T total) and the more efficient **DeepSeek-V4-Flash** (13B activated parameters, 284B total). Both support a 1M context length, making long-context capability a baseline feature rather than a premium offering. The Pro version rivals top closed-source models in agent capabilities, world knowledge, and reasoning performance. It outperforms Claude Sonnet 4.5 in agentic coding and approaches Claude Opus 4.6 (non-thinking mode) in quality. The Flash version offers competitive performance at a lower cost, though it lags in highly complex tasks. A key technical innovation is a new attention mechanism that reduces computational and memory demands for long contexts. The models are optimized for agent frameworks like Claude Code and OpenClaw. API services are available with support for both OpenAI and Anthropic-style interfaces. DeepSeek also announced upcoming support for Huawei’s computing hardware in the second half of the year. The models are open-sourced on Hugging Face and ModelScope.

Just now, DeepSeek-V4 is here!

The preview version is officially launched and simultaneously open-sourced.

There are two versions in total:

DeepSeek-V4-Pro: Comparable to top closed-source models, 1.6T, 49B activated, 1M context length;

DeepSeek-V4-Flash: A smaller and faster economical version, 284B, 13B activated, 1M context length.

The official statement is: It leads domestically and in the open-source field in Agent capabilities, world knowledge, and reasoning performance.

And:

Currently, DeepSeek-V4 has become the Agentic Coding model used by company employees. According to evaluation feedback, the user experience is better than Sonnet 4.5, and the delivery quality is close to Opus 4.6 non-thinking mode. However, there is still a certain gap compared to the Opus 4.6 thinking model.

Currently, both the official website and the app have been updated, and the API service has also been synchronized.

Regarding the much-concerned domestic computing power, the key point is: Support for Huawei computing power in the second half of the year.

Top-Tier and Cost-Effective Choices, Two Versions Launched Together

This time, V4 releases two versions at once.

V4-Pro, performance comparable to top closed-source models.

The official judgment has three points:

Significantly improved Agent capabilities: In the Agentic Coding evaluation, V4-Pro has reached the best level among current open-source models and also performed excellently in other Agent-related evaluations. In internal evaluations, in Agent Coding mode, the V4 experience is better than Sonnet 4.5, and the delivery quality is close to Opus 4.6 non-thinking mode, but there is still a certain gap compared to the Opus 4.6 thinking mode.

Rich world knowledge: In world knowledge evaluations, DeepSeek-V4-Pro significantly leads other open-source models, only slightly inferior to the top closed-source model Gemini-Pro-3.1.

World-class reasoning performance: In evaluations of mathematics, STEM, and competitive code, DeepSeek-V4-Pro surpasses all currently publicly evaluated open-source models and achieves excellent results comparable to the world's top closed-source models.

V4-Flash, a smaller and faster economical version. Reasoning ability is close to Pro, world knowledge reserve is slightly inferior, but with smaller parameters and activation, and cheaper API.

In Agent tasks, DeepSeek-V4-Flash is on par with DeepSeek-V4-Pro in simple tasks, but there is still a gap in high-difficulty tasks.

In the car wash test, V4 also passed quickly.

In the classic biological scenario "Desperate Father," DeepSeek-V4 did not immediately grasp the key point of red-green color blindness in one round (according to genetic rules, if a female is red-green color blind, her biological father must be as well).

Million Context Length Becomes Standard

It is worth mentioning that from today, 1M context length is standard for all DeepSeek official services.

A year ago, 1M context length was Gemini's exclusive trump card; all other closed-source models were either 128K or 200K; on the open-source side, almost no one could afford this level.

DeepSeek directly moved the million context length from a "high-end feature" to "basic infrastructure."

And it's open source. How did they do it? The release directly gave the answer—

V4 has created a new attention mechanism that compresses at the token dimension and is used in combination with DSA sparse attention. Compared to traditional methods, the demand for computation and memory is significantly reduced.

DSA is not a new term. It was first introduced in the V3.2-Exp update half a year ago. At that time, external attention was low because the benchmark scores were almost the same as V3.1-Terminus, making it seem like an insignificant intermediate version.

Looking back now, that was the foundation of V4.

Special Optimization for Agent Capabilities

On the Agent side, V4 has been adapted and optimized for mainstream Agent products such as Claude Code, OpenClaw, OpenCode, CodeBuddy, etc., with improvements in code tasks and document generation tasks.

The release also included an example of a PPT inner page generated by V4-Pro under a certain Agent framework.

API Pricing

On the API side, V4-Pro and V4-Flash are simultaneously launched, supporting both OpenAI ChatCompletions interface and Anthropic interface.

The base_url remains unchanged, just change the model parameter to deepseek-v4-pro or deepseek-v4-flash to call.

Both versions have a maximum context length of 1M and support both non-thinking mode and thinking mode. In thinking mode, the intensity can be adjusted through the reasoning_effort parameter, with two levels: high and max. The official recommendation is to directly use max for complex Agent scenarios.

Here is a key point—Support for Huawei computing power in the second half of the year.

In addition, old model names will be discontinued.

deepseek-chat and deepseek-reasoner will be discontinued three months later (July 24, 2026). During the current phase, these two names point to the non-thinking and thinking modes of V4-Flash, respectively.

It has little impact on individual developers; just change one model parameter. Companies with production environments need to migrate during these three months.

One more thing

At the end of the release, DeepSeek quoted a sentence.

"Not tempted by praise, not frightened by slander, follow the path and act,端正自己端正自己 (correct oneself)."

This is a sentence from Xunzi's "Non-Twelve Masters." The literal meaning is: not tempted by praise, not frightened by slander, move forward according to the path one believes in, and correct oneself.

In today's context, it's somewhat interesting.

Over the past six months, rumors about when V4 would be released, whether it was delayed, whether it had been surpassed by others, whether it had been compromised by Claude's distilled data, etc., have circulated back and forth in both Chinese and English AI circles. At the beginning of the year, some even confidently said that V4 would be released before the Spring Festival, but it wasn't until the end of April.

They never responded once.

Then, on a Friday afternoon, they released V4, simultaneously open-sourced it, simultaneously launched it on the official website and app, simultaneously updated the API, and even wrote into the release that internal employees have already abandoned Claude.

No roadmap, no live stream, no interviews.

The four words "率道而行" (follow the path and act) sound like a slogan. But if you look at the path over the past six months: the V3.2 "unremarkable" Exp version, the DSA sparse attention that paved the way for V4 for half a year, and the path of making 1M context length from a trump card to a standard feature.

DeepSeek has already done it.

DeepSeek-V4 model open-source links:

[1]https://huggingface.co/collections/deepseek-ai/deepseek-v4

[2]https://modelscope.cn/collections/deepseek-ai/DeepSeek-V4

DeepSeek-V4 Technical Report: https://huggingface.co/deepseek-ai/DeepSeek-V4-Pro/blob/main/DeepSeek_V4.pdf

This article is from the WeChat public account "QbitAI", author: QbitAI

A 'Overlooked' Market Event: Joint US-Japan-South Korea Intervention, Rare US Treasury Involvement, and Bessent's Quiet 'Market Rescue'?

Summary: The United States, Japan, and South Korea executed their largest coordinated foreign exchange intervention in nearly 30 years. The action targeted depreciation pressure on the Japanese yen and South Korean won. This move is seen as a significant effort by the US to stabilize the financial markets of its key allies and prevent the spillover of risks. Key details: * Japan reportedly intervened on July 30 using approximately 8.45 trillion yen (about $52.8 billion). South Korean authorities also intervened that day, selling dollars to support the won. * Notably, the US Treasury Department intervened directly in yen markets for the first time in roughly 30 years. The New York Fed, reportedly acting on behalf of the Treasury, sold euros to buy yen via Goldman Sachs and Morgan Stanley on July 31. Analysts view the use of the euro-yen pair as a way to alleviate yen pressure without adding selling pressure to the US dollar. * Prior to the action, the New York Fed conducted "rate checks" on both USD/JPY and EUR/JPY, a newer signaling tool that falls between verbal and physical intervention. The intervention is interpreted as going beyond traditional currency stabilization. Analysts, such as Michael Hartnett of Bank of America, suggest it resembles a "Price Keeping Operation" for the AI era. The core US objectives are perceived to be: 1. Preventing rapid yen depreciation from triggering a sharp rise in Japanese government bond yields. 2. Containing financial stress from spreading across Asian markets like South Korea and Japan. 3. Reducing the risk of disorderly capital flows impacting the US bond market. This coordinated action underscores the importance of Japan and South Korea as critical partners in the US semiconductor and AI supply chain. Stabilizing their financial markets is seen as vital to mitigating risks to the broader tech industry and the US market itself. The intervention coincides with market pressures, including the KOSDAQ index hitting a low since October 2022. While seen as a move to control volatility, some analysts caution it may not fundamentally reverse existing market trends.

marsbit4m ago

A 'Overlooked' Market Event: Joint US-Japan-South Korea Intervention, Rare US Treasury Involvement, and Bessent's Quiet 'Market Rescue'?

marsbit4m ago

Will the Federal Reserve Definitely Raise Interest Rates in September? How Will Cryptocurrencies and US Stocks Bear the Pressure?

In early August 2024, market expectations for a September Federal Reserve rate hike surged dramatically, from below 50% to over 80%, driven by renewed inflation concerns. This shift followed a contentious July FOMC meeting where a 9-3 vote to hold rates revealed a growing hawkish faction advocating for an immediate hike, citing prolonged above-target inflation. The key catalyst is escalating conflict near the Strait of Hormuz, which has pushed oil prices up approximately 20% in July, threatening to reignite inflation. The next critical data point is the July CPI report on August 12th; a hot reading could solidify hike expectations. For crypto assets, particularly Bitcoin, this represents near-term pressure. Bitcoin continues to exhibit high-beta, risk-on characteristics, making it sensitive to tightening liquidity and higher opportunity costs. However, historical precedent suggests that if a hike is perceived as the cycle's end rather than its start, the negative impact may be brief, with markets quickly pivoting to anticipate future rate cuts. U.S. stocks, especially crypto-linked equities like Coinbase and high-valuation tech stocks, face amplified volatility. Higher rates increase discount rates in valuation models, pressuring growth stocks. This coincides with a pivotal tech earnings season where investor focus has shifted from massive AI capital expenditures to demonstrable revenue and cash flow generation. Companies with negative cash flows and weak growth narratives could see severe pressure if a September hike materializes, as financing costs would rise. Key indicators to watch include oil prices, upcoming inflation data, and Fed commentary at events like the Jackson Hole symposium.

Odaily星球日报4m ago

Will the Federal Reserve Definitely Raise Interest Rates in September? How Will Cryptocurrencies and US Stocks Bear the Pressure?

Odaily星球日报4m ago

How Likely is it for TradeXYZ to Go Solo from Hyperliquid?

The article analyzes the possibility of TradeXYZ, a dominant project on Hyperliquid's HIP-3 real-world asset (RWA) perpetual contracts market, splitting off to build its own independent trading platform. With TradeXYZ commanding over 90% of HIP-3 volume and over 35% of Hyperliquid's total open interest, its leverage and influence are significant. The primary motive for a split is seen as capturing the underlying protocol fees, of which TradeXYZ currently receives only half. However, several factors strongly discourage such a move. First, replicating Hyperliquid's high-performance infrastructure would be challenging and could degrade TradeXYZ's product quality. Second, Hyperliquid serves as a crucial distribution channel, with most users accessing TradeXYZ's liquidity through its frontend. Third, the close, trusting relationship between the founders of both projects makes a hostile "betrayal" unlikely. The conclusion is that a split would likely result in a lose-lose scenario. Hyperliquid would lose its main growth narrative and see its valuation impacted, while TradeXYZ would face immense technical hurdles, lose its primary user channel, and damage its reputation. Instead of a full separation, the more probable path is for TradeXYZ to negotiate better terms while deepening integration, focusing on enhancing its own brand and user ownership.

marsbit55m ago

How Likely is it for TradeXYZ to Go Solo from Hyperliquid?

marsbit55m ago

"Teletubbies" Robot Cleaning Service, $30/Hour, Pure·Manual·Intelligence

Anthropomorphic "Teletubby" robot offers cleaning services in San Francisco at $30/hour, but it's entirely remote-controlled. The robot, created by startup Tau Robotics, can perform household tasks like washing hands, mopping floors, and taking out trash. While the initial demo videos appear impressive and are notably shown at normal speed (unlike many sped-up robot demos), the company reveals the actions are performed via human teleoperation, not autonomous AI. Tau Robotics, founded in 2024, argues this "cheat" is a strategic way to bridge the current capability gap, ensure task completion, and collect real-world home data to eventually train autonomous systems. Their service features three robot models: Chelsea for kitchens/bathrooms, Elon for regular tidying with memory, and Tony for deep cleaning. Priced at $30 per hour, it's cheaper than average human cleaners in the US. The article discusses the broader challenge of deploying humanoid robots in homes, comparing Tau's approach to others like China's Ziliang and the US's 1X Neo, which also use teleoperation. A key reason for choosing a humanoid form is to make remote control more intuitive for human operators. The piece also notes the potential "emotional value" of having a humanoid servant. The service is currently invite-only in San Francisco.

marsbit1h ago

"Teletubbies" Robot Cleaning Service, $30/Hour, Pure·Manual·Intelligence

marsbit1h ago

From South Korea to the United States: Blue-Collar Jobs Are Becoming Increasingly Popular, Thanks to AI

AI is reshaping the labor market's value proposition. The traditional four-year college degree is losing its appeal as a guaranteed career path, while skilled blue-collar trades like electricians, welders, and plumbers are experiencing historic demand and wage premiums. This shift is driven by dual pressures: AI's displacement of certain white-collar roles and a booming need for physical infrastructure and data center construction. Data confirms the trend. In the U.S., vocational school revenue surged, and a significant portion of recent layoffs are AI-related. Surveys show a majority of Gen Z adults plan to pursue blue-collar work, citing better job security against AI automation. Vocational education interest has exploded recently. Experts cite a psychological shift as younger generations seek tangible, AI-resistant careers and avoid high student debt. In many cases, salaries for skilled trades now match or exceed those requiring a bachelor's degree. In South Korea, semiconductor vocational high schools boast near-total employment, with graduates securing high-paying roles at companies like Samsung. The shortage is structural, exacerbated by a retiring baby boomer workforce and massive infrastructure projects. Companies like JPMorgan Chase, Meta, and Lowe's are investing heavily in training programs. However, overcoming historical stigma and a "perception gap" around trade careers remains a key challenge to closing the talent gap.

marsbit1h ago

From South Korea to the United States: Blue-Collar Jobs Are Becoming Increasingly Popular, Thanks to AI

marsbit1h ago

Trading

Spot

DeepSeek V4 Finally Released, Breaking the Strongest Closed-Source Monopoly, Explicitly Partnering with Huawei Chips

Abstract

Top-Tier and Cost-Effective Choices, Two Versions Launched Together

Million Context Length Becomes Standard

Special Optimization for Agent Capabilities

API Pricing

One more thing

Related Questions

Related Reads

A 'Overlooked' Market Event: Joint US-Japan-South Korea Intervention, Rare US Treasury Involvement, and Bessent's Quiet 'Market Rescue'?

Will the Federal Reserve Definitely Raise Interest Rates in September? How Will Cryptocurrencies and US Stocks Bear the Pressure?

How Likely is it for TradeXYZ to Go Solo from Hyperliquid?

"Teletubbies" Robot Cleaning Service, $30/Hour, Pure·Manual·Intelligence

From South Korea to the United States: Blue-Collar Jobs Are Becoming Increasingly Popular, Thanks to AI

Trading