DeepSeek Paralyzed for 12 Hours: Is the Computing Power of Domestic Large Models Failing to Keep Up with Ambitions?

marsbitPublished on 2026-04-03Last updated on 2026-04-03

Abstract

On the evening of March 29, DeepSeek, a leading Chinese large language model, experienced a 12-hour service outage affecting both its web and app platforms. The disruption, marked by repeated server failures and instability, raised significant concerns about the platform's reliability. Initial explanations pointed to overwhelming user traffic, but data did not support a sudden surge in users. Instead, the incident highlighted deeper structural challenges, particularly in computational supply struggling to keep pace with growing AI demands. As models become more complex—supporting longer context, advanced reasoning, and multimodal tasks—the computational load increases substantially. The outage also underscored emerging usage patterns like “lobster farming,” where automated, high-frequency API calls amplify server load. Meanwhile, anticipation for DeepSeek’s upcoming V4 model—featuring major upgrades like million-token context and stronger multimodal capabilities—suggests even greater future computational pressure. This incident signals a shift in the AI industry: competition is no longer just about model capability but also infrastructure stability, cost efficiency, and engineering scalability. The outage serves as an early warning of the systemic challenges facing AI platforms as adoption expands.

If you used AI on the evening of March 29, you likely experienced a sudden "disconnection."

The center of this storm was DeepSeek, a leading domestic large model provider. Starting at 9:35 PM that night, its web and app services simultaneously encountered anomalies—login failures, interrupted conversations, lost content, and screens flooded with "server busy" prompts. For ordinary users, this was merely a temporary inconvenience, but for students rushing to finish papers and workers racing against deadlines, it felt more like an unannounced "disaster."

What made it even more frustrating was that this outage wasn’t a one-time crash but a typical case of "repeated fluctuations." Briefly restored at 11 PM, crashed again at midnight, urgently repaired in the early morning, and gradually stabilized only by the next morning.

A full 12 hours of instability not only set a new record for DeepSeek’s downtime but also led users to question the stability of large models like never before.

01 An Unexpected Crash: Was the Problem Really Just "Too Many Users"?

When the outage occurred, the first explanation offered was that "too many users overloaded the servers."

While this sounded plausible, it was quickly contradicted by real data. According to recent AI application rankings, DeepSeek’s monthly active user base is around 150 million—a substantial number, but not indicative of explosive growth. In other words, this wasn’t a typical "breakout traffic surge."

That makes the situation even more intriguing: If there was no sudden spike in user numbers, why did the system completely fail in such a short time?

The answer likely lies in deeper structural issues.

02 Computing Power vs. Demand: The Hidden Crisis in the AI Industry

Over the past year, large models have evolved at an almost visible pace. From longer context windows to stronger reasoning capabilities and the continuous expansion of multimodal abilities, the "upper limits" of model capabilities have been constantly raised.

But at the same time, a more fundamental yet critical issue is becoming increasingly apparent—computing power supply is gradually approaching its limits.

Every response from a large model is essentially a consumption of computing power. The larger the model, the longer the context, and the more complex the reasoning, the higher the computational resources required. When user scale, call frequency, and model complexity rise simultaneously, system strain is almost inevitable.

It is against this backdrop that DeepSeek’s outage appears less like an isolated incident and more like a "system-wide stress test."

According to information from Tianyancha, DeepSeek’s affiliated entities have been increasing investments in AI algorithm R&D and computing infrastructure, continuously strengthening related technological inputs and industrial collaboration.

In fact, DeepSeek isn’t the only one under pressure. Recently, other vendors, including MiniMax, have begun limiting call frequencies during peak hours, while computing service providers like Alibaba Cloud have also adjusted their pricing strategies to varying degrees.

On the surface, these are business decisions, but they reflect the same reality—AI infrastructure supply is struggling to keep up with the pace of demand growth.

03 The "Lobster Farming" Trend: An Overlooked Traffic Amplifier

Another easily overlooked yet highly influential factor in this incident is the so-called "lobster farming" trend.

This approach essentially involves continuously calling the model via API to automate tasks, representing an early form of Agent applications. Compared to ordinary conversations, these calls occur at an extremely high frequency, sometimes triggering every minute or even every second.

When only a small number of users employ this method, it’s just an interesting experiment; but once it scales, it quickly becomes an "amplifier" of computing power consumption. This also explains why, even without a significant change in total user numbers, the system can still experience a "snowball" effect.

In a way, this outage is a classic case of "new application forms impacting old infrastructure."

04 V4 Is Approaching: Greater Expectations, Greater Pressure

Interestingly, this 12-hour outage did not significantly dampen market expectations for DeepSeek; instead, it amplified attention.

The reason is simple—the next-generation model, V4, is coming soon.

Industry rumors suggest that DeepSeek V4 will achieve leaps in multiple key capabilities: context length is expected to increase from the previous 128K tokens to millions, while multimodal and Agent execution capabilities will also be enhanced. More importantly, its computing power adaptation may further align with domestic chip systems, which holds significant meaning for China’s AI ecosystem.

But the problem is equally clear: as model capabilities improve, the demand for computing power will also increase. If the underlying infrastructure isn’t upgraded simultaneously, similar stability issues are likely to recur.

05 From "Model Competition" to "Infrastructure Competition"

Looking back at this incident, its significance may extend beyond a single product level.

Over the past two years, competition in the large model industry has centered on "capabilities"—who is smarter, who is more powerful, who leads in benchmarks. But as applications scale, a new dimension is emerging: stability and cost.

Users are starting to care not just about "whether it works" but "whether it keeps working"; enterprises are focusing not only on performance metrics but also on overall operational costs and sustainability.

In other words, AI competition is shifting from the "model layer" to the "infrastructure layer."

DeepSeek’s 12-hour outage serves as an early warning: when AI truly enters the phase of large-scale application, the deciding factor may not be the model itself but the underlying computing power, architecture, and engineering capabilities.

06 Conclusion: An Accident or a Signal?

So, what does these 12 hours really mean?

It can be seen as an accident in the development process, or it can be interpreted as a "structural warning." The former pertains to the individual, the latter to the industry.

One thing is certain: as AI applications deepen, similar stress tests will continue to occur. And each fluctuation will push the entire industry one step closer to maturity.

In a sense, DeepSeek’s crash is not an end but a beginning.

Finally, we’d like to ask: What were you using AI for during those 12 hours?

This article is from WeChat public account "Laotech," author: Laotech

Related Questions

QWhat was the main cause of the 12-hour DeepSeek service outage according to the article?

AThe outage was not primarily due to a sudden surge in users, but rather a systemic issue where computing power (算力) could not keep up with growing model complexity and usage patterns, including high-frequency API calls from 'lobster farming' applications.

QHow did the 'lobster farming' phenomenon contribute to the server instability?

A'Lobster farming' refers to automated, high-frequency API calls that act as a traffic amplifier, significantly increasing compute resource consumption even without a large growth in total user numbers.

QWhat broader industry trend does the DeepSeek incident reflect?

AIt reflects a shift in AI industry competition from a focus solely on model capabilities to a critical need for robust infrastructure, including computing power, system architecture, and engineering capabilities to ensure stability and manage costs.

QWhat upcoming DeepSeek model version is mentioned, and what challenges might it bring?

ADeepSeek V4 is mentioned, which is expected to have significantly enhanced capabilities like million-token context windows. This could put even greater pressure on computing infrastructure if not matched with corresponding upgrades.

QWhat was the significance of the 12-hour outage beyond being a single service failure?

AThe outage served as a structural early warning for the entire AI industry, highlighting that as AI applications scale, the stability and scalability of the underlying infrastructure become paramount to success.

Related Reads

Can DeepSeek Save China One Trillion Dollars?

"DeepSeek and the $1 Trillion Infrastructure Question" The article examines whether DeepSeek's AI optimization breakthroughs could potentially save China $1 trillion in future AI infrastructure costs. The analysis begins with Nvidia's upcoming Vera Rubin AI platform, costing ~$7.8 million, where memory (HBM4/LPDDR5X) constitutes $2 million—a 435% cost increase in one year, highlighting how AI hardware spending is shifting toward expensive memory components. DeepSeek's approach works in the opposite direction. Through three key technical innovations showcased in DeepSeek V4, the company dramatically improves hardware efficiency: 1. **Memory Compression (MLA)**: Re-engineers the attention mechanism to compress long-context memory (KV Cache) by over 90%, drastically reducing expensive HBM usage. 2. **Selective Activation (MoE)**: Employs Mixture-of-Experts architecture where only a small fraction of parameters (e.g., 49B out of 1.6T in V4-Pro) are activated per token, allowing most parameters to reside in cheaper memory/SSD. 3. **Computation Caching**: Reuses previously computed results via cache hits, replacing expensive GPU computations with cheap memory reads. Combined, these optimizations allow the same hardware to produce approximately 4x more tokens, effectively reducing required hardware investment by 75%. DeepSeek's pricing reflects this: a 10-billion token workload costs ~$522 monthly versus ~$9,000-$10,000 for competitors. The $1 trillion savings projection stems from McKinsey's estimate that global AI infrastructure will require ~$5.2 trillion investment by 2030. As China's daily token consumption grows toward quadrillions, even marginal efficiency gains scale massively. With a conservative 4x throughput improvement, China could avoid building tens of thousands of AI data centers equivalent to ~7 trillion RMB ($1 trillion) in saved investment. Critically, this strategy shifts dependency from scarce, expensive GPU/HBM—where China lags—toward more accessible storage, caching, and systems engineering where domestic suppliers like CXMT are gaining strength. Rather than "replacing Nvidia," DeepSeek rebalances AI's value chain away from monolithic hardware dependency. Ultimately, DeepSeek's technical breakthroughs could lower the barrier to AI adoption across Chinese industries by making advanced capabilities affordable at scale—transforming who can access next-generation AI.

marsbit41m ago

Can DeepSeek Save China One Trillion Dollars?

marsbit41m ago

Overturning the Mainstream Approach to Hallucinations: Metacognition is the New Solution for Large Models to Break the Hallucination Barrier

This paper, "Hallucinations Undermine Trust; Metacognition is a Way Forward," proposes a paradigm shift in combating AI hallucination. It argues that the current mainstream approaches—striving for omniscience by scaling data/models or having AI abstain from uncertain answers—are fundamentally flawed. The former has inevitable knowledge gaps, while the latter imposes a crippling "utility tax," requiring the rejection of many correct answers to achieve high accuracy, due to models' poor "discrimination" (the ability to distinguish correct from incorrect answers internally). The core contribution is redefining hallucination not as "being wrong," but as "expressing false information with unwarranted certainty." The proposed solution is **Faithful Uncertainty** or **Metacognition**: enabling AI to accurately perceive its internal uncertainty and honestly express it in its language (e.g., using hedging phrases when unsure). This creates a more reliable assistant that provides useful information while signaling its confidence, minimizing harm from errors. The paper emphasizes that metacognition is critical for the era of AI Agents. Without it, Agents cannot intelligently decide when to use tools like search engines, leading to inefficiency and misuse. Key implementation challenges are highlighted: the "bootstrapping paradox" of training with static uncertainty data, the "alignment distortion signal" where human preference training suppresses internal uncertainty cues, and the difficulty of causally evaluating true metacognition vs. its superficial imitation. The paper concludes that the goal should not be an infallible AI, but one that is honest about the limits of its knowledge, thereby building user trust through transparent communication of its certainty.

marsbit46m ago

Overturning the Mainstream Approach to Hallucinations: Metacognition is the New Solution for Large Models to Break the Hallucination Barrier

marsbit46m ago

Hedge by Buying Gold and Oil, Chase Soaring Returns with AI. ‘Dated’ Bitcoin Enters a Bear Market

Bitcoin has recently declined, hitting a two-month low near $66,123, while Ethereum fell to a three-month low around $1,837. Analysts suggest the drop is not merely due to factors like ETF outflows or MicroStrategy's selling but reflects a deeper issue: Bitcoin is losing a broader asset competition. In a near-zero interest rate environment, Bitcoin previously thrived as an outlet for investor dissatisfaction with inflation and limited options. However, the market landscape has shifted. Bitcoin now occupies an "awkward middle ground," facing competition on three fronts. For inflation hedging, investors prefer gold, energy stocks, and commodity producers—assets with tangible backing and clearer pricing power. For growth exposure, AI-related companies with actual revenues and profits are more attractive. Even within crypto, investors can choose stablecoins, exchanges, or infrastructure firms tied directly to adoption, offering clearer business models and leverage. Thus, Bitcoin is no longer the top choice for hedging, growth, or crypto exposure. This shift is evident in market reactions: despite recent warnings about persistent inflation from a Fed official, Bitcoin did not rally as it might have in the past. Instead, capital flowed to assets with direct commodity or energy exposure. The recent ETF outflows and MicroStrategy sales are symptoms, not causes, of this new reality. Investors are becoming more selective, demanding clearer value propositions beyond mere scarcity. The emerging bear case for Bitcoin is not about it being a bubble or failed technology, but that scarcity alone is no longer sufficient.

华尔街日报48m ago

Hedge by Buying Gold and Oil, Chase Soaring Returns with AI. ‘Dated’ Bitcoin Enters a Bear Market

华尔街日报48m ago

Trading

Spot
Futures
活动图片