The Night Before the AI Model Shakeout

marsbit发布于2026-05-10更新于2026-05-10

文章摘要

China's large language model (LLM) industry is entering a critical consolidation phase. In a concentrated wave of funding in May 2026, leading players Kimi, StepFun, and DeepSeek reportedly secured over $70 billion combined, signaling a dramatic capital rush towards the few remaining independent contenders. This frenzy masks an impending shakeout. The core dynamic has shifted from a pure technology race to a battle for survival and strategic positioning. LLM capabilities are rapidly commoditized; gaps between top models are narrowing. Consequently, investment logic has pivoted from betting on future potential to prioritizing cash flow, user access, and ecosystem integration. The economic model poses a fundamental challenge: while user growth previously meant profits, in the AI era, it drives soaring inference costs. Startups, lacking the cross-subsidy ability of tech giants like ByteDance or Tencent, face immense pressure to achieve financial sustainability. DeepSeek's open-source, high-performance, low-cost strategy has further compressed industry profit margins. Facing this reality, the top players are scrambling to lock in their status before the window closes. StepFun is accelerating its港股 IPO, embedding itself in hardware supply chains. Kimi is aggressively showcasing revenue growth (ARR doubling to $2 billion in a month) to prove viability. DeepSeek, with new state-backed investment, is solidifying its role as a strategic national asset. The parallel to China's prev...

By: Huahua, Banjun

Over the past week, China's large model industry suddenly entered a state of nearly frenzied fundraising.

Kimi secured $2 billion in funding, with its valuation surpassing $20 billion.

StepFun was reported to be close to completing nearly $2.5 billion in funding, while accelerating the dismantling of its VIE structure, with its Hong Kong IPO entering the final sprint.

DeepSeek was rumored for the first time to be accepting external investment, with national-level funds stepping in, pushing its valuation range to $45–$50 billion.

Three companies, three days, over $7 billion flooded in simultaneously. This is no longer just supplementary investment post-funding; it's more like a collective scramble for future survival quotas.

On the surface, this seems like the hottest moment for the large model industry.

But truly dangerous industries are often the hottest.

When capital no longer spreads evenly but pushes all its chips toward the final few leading players, the industry appears exceptionally prosperous, but in reality, it has already entered the night before the shakeout.

The soaring valuations of Zhipu and MiniMax post-IPO have gradually made everyone realize:

The time left for independent large model companies may be running out.

I. Models Are Becoming Commoditized

Over the past two years, the biggest consensus in the large model industry was the existence of massive generational gaps in model capabilities.

GPT-4 was once seemingly unreachable. As long as a startup could get slightly closer in some dimension—long context, reasoning, multimodality, Agent—capital was willing to offer a high premium.

Everyone believed capability gaps would form long-term moats.

But the situation changed in 2026.

Long context is no longer scarce. Reasoning capability is no longer scarce. Multimodality is no longer scarce.

After DeepSeek V4 brought open-source capabilities close to the level of GPT-4 or even newer versions, the industry truly realized for the first time that model capabilities themselves might be easier to catch up with than everyone imagined.

Gaps still exist between Qwen, DeepSeek, Gemini, Claude, and GPT-5.5, but it's already difficult to form generational碾压.

Models are becoming commoditized.

Once commoditization occurs, the capital market will ask a question anew: What's left besides the model?

Thus, the industry's narrative suddenly switched scripts.

In 2023, all companies talked about stronger models, more parameters, better reasoning, longer context. Today, they talk about holding the end-user touchpoint, binding to the industrial chain, possessing user entry points, and having national strategic value.

This shift marks the large model industry's official move from a technology race into a phase of position consolidation.

Capital market data already reflects this.

During the 2023 "Hundred Models Battle," the number of domestic large model companies proliferated exponentially. Data from Yibang Power showed that year, the "Six Tigers" collectively raised over 6 billion RMB, accounting for more than half of the early-stage funding for domestic large models. 2024 was even crazier, with over 168 global large model industry chain funding rounds exceeding 100 million RMB, totaling over 400 billion RMB.

The Six Tigers collectively raised at least 20 billion RMB for the year, with single-round records constantly being broken.

Then came 2025. A sharp turn downward.

According to reports from Touzijie, AI model-layer companies completed only 22 investments for the full year, with a total disclosed amount of 9.4 billion RMB, a 52.9% drop from 2024. The proportion of large model funding in total AI investment plummeted from 51% in 2024 to 14%. Companies with single-round funding exceeding 2 billion RMB were only Zhipu, MiniMax, and Moonshot.

One hundred companies became less than ten that could get funding. Two years, an elimination rate over 90%.

So when we see these three funding rounds totaling $7 billion concentrated in one week in May 2026, its meaning is clear: money isn't flowing to the industry; it's flowing to the last few players.

The larger this funding, the higher the concentration. The higher the concentration, the smaller the space left for those behind.

II. The Music Hasn't Stopped, But the Seats Are Already Insufficient

The crazy surge of Zhipu and MiniMax post-IPO did something with profound impact on the entire industry: it established a reference point in the secondary market for how much domestic large models are worth.

Once this reference standard is set, all unlisted companies face a life-or-death sprint. If they don't lock in their valuation during the current window, and the market experiences aesthetic fatigue leading to a correction, their private market valuations could be instantly shattered.

The window wasn't opened by you; it was propped open by the first movers. If you don't jump in, it closes.

StepFun plans to submit its Hong Kong listing application by the end of June and complete its IPO by year-end. Its VIE structure has been dismantled. The shareholding system reform landed in April. All preparatory steps compressed within months.

Kimi's ARR rose from $100 million to $200 million within a month. Investors proactively disclosed this number to the media, which is extremely rare in the private market. Usually, only when preparing for the next funding round or sprinting towards an IPO would a company allow core financial metrics to leak.

This eagerness to "prove innocence" indicates the private market no longer believes in pure imagination; they want to see revenue, see the certainty of exit. (Extended reading: Kimi isn't short of money; it's short of DeepSeek)

DeepSeek had never previously accepted external funding. Now, state-level funds have entered.

The three companies seem to be doing different things, but the underlying logic is identical: lock in identity, lock in valuation, lock in exit channels. While the window is still open.

III. More Expensive, Yet Less Valuable

Why now? Why not wait?

The reason lies in the economic model of the large model industry, which is exposing an increasingly fatal contradiction.

On the cost side: GPU clusters, inference compute, long context, multimodality, Agent. Each new capability devours cash.

But what's truly terrifying isn't training. It's inference.

Training is a one-time investment. Inference costs grow in sync with user scale. Every token, every API call, every Agent task corresponds to real GPU consumption.

In the mobile internet era, more users meant more profit for the platform. In the AI era, more users might first make the model company poorer.

One more user for WeChat, Tencent's marginal cost hardly changes. One more user for Douyin, ByteDance gets one more ad slot. One more high-frequency user for Doubao, behind it lies continuously increasing inference expenses. (Extended reading: More Users, The Poorer ByteDance Gets)

Large model companies inherently require continuous fundraising ability. And private market money cannot be supplied infinitely.

The reason why going public suddenly becomes so crucial isn't just about exit; more critically, it's about obtaining a public capital channel for long-term blood transfusion.

This is what truly makes all independent model companies anxious today.

The revenue side is even more brutal.

DeepSeek truly brought a price war into the large model industry. High capability, open-source, extremely low price—these three things hold true simultaneously for the first time.

This is a devastating shock to the entire industry. The profit margin in the API market is directly compressed.

The entire industry suddenly realizes model capability might not be the scarcest thing. What's truly scarce is the ability to keep burning money, endure long-term losses, and withstand price wars.

And these abilities, startups inherently can't match giants.

The scary thing about giants isn't their models. It's that they possess cross-subsidization capabilities startups will never have. ByteDance can run Doubao for free long-term because its ad business continuously provides blood transfusions. Recent plans for Doubao to start charging also show it can't sustain the burn, indicating how crazy the spending is.

Tencent can push Yuanbao at low prices because its gaming and social businesses are still profitable. A startup's model must learn to support itself.

Giants compete on ecosystem. Startups compete on survival.

There's another change many haven't realized yet.

Back in 2023, when capital invested in large models, it was essentially buying "possibility."

Because everyone believed that as long as you built the next GPT-4, you could redefine the internet. So at that stage, funding looked at founder background, tech team, model capability, and imagination.

But today, capital is starting to look at another set of things.

It starts asking: Do you have cash flow? Do you have end-user entry points? Do you have ecosystem bindings? Can you survive the next price war?

This means the fundraising logic for the large model industry has shifted from venture capital towards infrastructure investment.

Venture capital believes in the future. Infrastructure investment only believes in survival rates.

Once an industry enters the infrastructure stage, capital naturally concentrates towards the top. Because infrastructure industries never need many players.

IV. The Sense of Deja Vu from the "Four AI Dragons"

This script isn't playing out for the first time.

Around 2018, the "Four AI Dragons" in the computer vision赛道—SenseTime, Megvii, Cloudwalk, Yitu—experienced almost identical plotlines: frenzied fundraising, soaring valuations, record-breaking rounds. Everyone believed the AI era had arrived.

What happened later?

Tencent, Alibaba, Huawei entered the field comprehensively. Computer vision was turned into a standard feature within cloud services. The technology premium of independent companies evaporated instantly, commercialization couldn't achieve scale, and finally, they could only experience prolonged post-IPO破发 and silence.

Today's large model赛道 is entering the same phase. The difference is the stakes are higher this round, the burn rate is faster, and the giants'碾压 is more direct. ByteDance's annual spending on AI might exceed the total funding of the entire "Six Tigers."

Global money is telling the same story. In Q3 2025, the overall funding scale for global AI startups reached $97 billion, of which nearly 46%, about $44.6 billion, concentrated in流向不超过 five leading foundational model companies like Anthropic and xAI.

Entering 2026, funding for leading model companies further accelerated, reaching new levels:

OpenAI completed a $122 billion funding round in March, with a post-money valuation of $852 billion; Anthropic then completed a $30 billion Series G in February, valued at $380 billion,紧接着又 launched a new pre-IPO round of about $50 billion, targeting a valuation直奔 $900 billion.

Capital is concentrating towards the very top with unprecedented intensity. Middle-layer companies are experiencing the longest liquidity winter.

This trend holds true in China as well. In the full year of 2025, large model funding's share of total AI investment dropped from 51% to 14%, but the top three took the vast majority of that. Money didn't disappear; it just stopped being evenly distributed.

And the淘汰速度 is far faster than the last generation. The mobile internet took nearly a decade to go from the "Hundred Groupons War" to AT monopoly. The large model industry might take only three years to go from the "Hundred Models Battle" to shakeout.

A year ago, Baichuan AI was still one of the companies most resembling a Chinese OpenAI. Wang Xiaochuan appeared in almost every large model discussion. Today, it rarely appears at the center of funding news. 01.AI was once a star startup team, with Kai-Fu Lee高调 announcing "All in AI." But the industry increasingly seldom discusses whether it can enter the next round.

The large model industry淘汰 companies doesn't require their technology to fall behind. It only requires the capital window to close first.

V. Three Paths, Three Bets

Today's large model startups have already diverged into three completely different paths.

DeepSeek chose to become a national-level technical asset.

Its $45 billion valuation doesn't come entirely from commercialization, but from the strategic significance of its technical moat and领先 in algorithmic efficiency, making it a kind of national reserve. The entry of national funds indicates its positioning has transcended commercial competition. Its risk lies elsewhere—fragile organizational structure, with several core researchers having departed.

StepFun chose to bind itself to the hardware industry chain. Huaqin, Longcheer, Omnivision, ZTE—core players in the consumer electronics supply chain collectively invested.

The logic of StepFun's Chairman Yin Qi is clear: foundational model capabilities will eventually level out. The real moat lies in who can embed the model into the end-device supply chain, making it impossible for competitors to replace you without replacing the entire chain. By the end of 2025, numbers like 42 million预装 phones, covering 60% of top brands—their importance lies not in scale, but in depth of嵌入.

Kimi chose user scale and speed. ARR growing from $100 million to $200 million in a month, paid subscriptions and API growing simultaneously. But its problems are also the sharpest: monthly active users dropping from a peak of 36 million to 8.33 million, ByteDance's Doubao with 350 million MAUs forming absolute压制, and its B2B API pricing又被 DeepSeek打穿.

Kimi's product is still excellent. But having an excellent product is no longer enough.

The three paths are completely different, but share one commonality: None of them are still talking about building the best model in China. Everyone has started talking about what position they have卡住了.

VI. The End Goal of Fundraising Isn't Expansion

Why did $7 billion flow in simultaneously within three days?

On the surface, it's industry heat. But when an industry is truly hottest, fundraising should be从容; companies would slowly挑选 investors,延长 cycles, waiting for higher valuations.

Now the keyword is only one: scramble.

StepFun scrambles to list. Kimi scrambles to prove revenue. DeepSeek scrambles to complete身份确认.

They aren't scrambling for money. Money is just the tool. What they are truly scrambling for is the last window for independent survival.

The large model industry might not end up leaving many independent players. Infrastructure industries have always been like this: cloud computing eventually归属 to a few giants; communication networks ultimately只剩 three operators; power systems are高度集中.

When model capabilities become commoditized, API prices approach zero, and giants harvest users with免费 strategies, independent model companies either go public to obtain continuous fundraising ability, get integrated into some ecosystem, or disappear.

Going public is obtaining an ID card. State-level endorsement is another form of ID card. An ID card doesn't guarantee you'll win. But without an ID card, you can't even enter the next round.

And for those names not appearing in this week's news, the silence itself is already the answer.

Words 【Off the Page】:

In 2023, the most frequently asked question about the domestic large model赛道 was: Who can build it?

In 2026, the question has become: Who can survive?

From building it to surviving it, only three years passed in between. But these three years are enough for an industry to jump directly from spring to autumn.

This article is from the WeChat public account "Off the Page," author: Huahua

你可能也喜欢

Kalshi、MTS 与 a16z 的野望

本文探讨了预测市场在2025年成为投资热点的现象，并着重分析了其精神内核与风险投资机构a16z的新媒体战略之间的关联。文章梳理了预测市场理念的演变：从哈耶克关于市场作为信息协调机制的理论，到Robin Hanson设计的经济激励机制，再到“Futarchy”治理乌托邦的设想。然而，作者指出，这些传统讨论在a16z关注该领域后才被赋予新的意义。 a16z于2025年投资了预测市场平台Kalshi，并将其估值推高至220亿美元。其核心理念在于，预测市场为用户提供了对抗后现代疏离感的“在场感”。通过真金白银的下注，用户从被动观察者转变为能介入和影响事件的“超级观察者”，从而获得对事件真实性与重要性的解释权。这使其成为a16z构建新媒体帝国的关键拼图。文章以媒体公司MTS为例，说明a16z所倡导的“新媒体”是一种全频段、高强度的信息发布模式，旨在“接管时间线”。而Kalshi的独特价值在于，其市场交易数据凭借真实资金流动，具备了看似客观的权威性和强大的现实扭曲力场，能够影响公众认知与判断。这种能力正是其获得高估值的深层原因。

marsbit1小时前

marsbit1小时前

突发：OpenAI芯片元老加入Anthropic

OpenAI自研芯片团队早期核心成员Clive Chan宣布离职，并已正式加入竞争对手Anthropic。Clive Chan是OpenAI硬件团队的“002号员工”，全程参与了公司自研芯片项目从组建到推进的过程。他在声明中高度评价了OpenAI芯片团队的人才实力，但表示自己渴望“重新攀登一座新山”，因此选择加入Anthropic，并对Anthropic团队的人才、价值观和野心印象深刻。关于OpenAI的自研芯片进展，Clive Chan未透露更多细节，但提及了OpenAI与博通在2025年10月公布的合作计划。根据该计划，双方将共同建设总规模达10GW的AI加速器系统，首批机架预计在2026年下半年开始交付。 Clive Chan毕业于滑铁卢大学，曾先后在谷歌、SpaceX、特斯拉等公司从事AI基础设施相关工作，于2024年1月加入OpenAI。此次跳槽后，Anthropic内部员工表示了欢迎，而网友则调侃这像“离开皇马加盟巴萨”。近期，OpenAI与Anthropic之间人才流动频繁，此前OpenAI联合创始成员Andrej Karpathy也已加盟Anthropic。随着Anthropic近期完成巨额融资，估值逼近万亿美元，其与OpenAI在人才和资源上的竞争将持续受到关注。

marsbit1小时前

marsbit1小时前

a16z 全球化转向：VC 正在成为美国科技联盟的「推手」

a16z（Andreessen Horowitz）发布公告，宣布其全球化战略发生重要转向：不再局限于海外寻找项目和投资，而是将自身定位融入更大的技术竞争与国际盟友合作框架中。面对AI、机器人、国防科技等成为国家竞争焦点的领域，创业公司面临复杂的国际监管、产业政策和地缘关系。a16z通过设立东京办公室、任命Anne Neuberger负责全球事务、将投资者关系团队升级为全球合作伙伴团队等举措，主动应对这一变化。公告明确将a16z的全球网络与“美国及其盟友”的技术领导力绑定，标志着技术创新已进入国家安全和国际竞争语境。未来，风投的角色不仅是提供资本和增长建议，更要帮助创始人对接关键市场、政府机构和战略资源，理解多国政策环境。a16z旨在成为连接创业公司、国家能力、产业资源和全球资本的组织者，支持盟友国家在关键创新领域的合作，并助力投资组合公司进行全球扩张。这一布局体现了硅谷资本对全球科技竞争新格局的主动站位。

marsbit1小时前

marsbit1小时前

解读Agent商业、支付与基础设施的真相

作者基于一年来为Agent经济构建基础设施的经验，指出当前Agent商业尚未形成真实、规模化的市场需求，初创公司面临结构性挑战。文章分析了四个关键场景： 1. **Agent对商户**：目前电商体验中，聊天界面在视觉比价购物上逊于传统界面，商户接入多出于防御性“优化”心态。对话式商业在如外卖等高頻、低决策场景有潜力，但受限于平台开放性和成本。 2. **Agent对API**：开发者现有支付方式（如预付）已能处理低频、小额的API调用成本问题。真正的机会在于服务长尾、小众的供应商市场，但规模有限。 3. **Agent对Agent**：这是长期的愿景，涉及机器间的自动交易与结算，需求真实但当前市场几乎为零，需要专用的基础设施。 4. **Agent对金融**：这是唯一存在现成需求和付费客户的领域。将AI嵌入金融工作流是自然演进，但竞争激烈，老牌机构优势明显。文章认为，行业巨头因资金充足和战略防御而持续投入，但对初创公司而言，真正的机会并非单纯构建支付层。支付只是更宏大问题——**Agent与人类的协同工作、验证与结算**——的一部分。未来，解决协同问题的公司将主导市场，而非支付服务商。作者团队已转向一个存在真实需求、快速增长且未被充分服务的领域。

marsbit1小时前

marsbit1小时前

Kalshi、MTS 与 a16z 的野望

本文探讨了预测市场在2025年成为投资、加密和媒体领域共同关注焦点的现象，并着重分析了其精神内核的演变及其与风投机构a16z所倡导的“新媒体”愿景的契合。文章首先回顾了预测市场的思想渊源：从哈耶克关于市场作为分散知识协调机制的观点，到罗宾·汉森设计对数市场评分规则（LMSR）以激励信息真实披露，乃至衍生出的“未来统治”（Futarchy）治理乌托邦构想。然而，作者指出，a16z在2024-2025年投资估值飙升的预测市场平台Kalshi，为此领域注入了新的精神内涵——“在场感”。在人们与现实世界日益疏离的后现代语境下，预测市场提供了一种通过真金白银下注来介入和“预测”未来的方式，使用户从被动观察者转变为主动的“超级观察者”，从而对抗不确定性与无力感。当足够多人使用并依赖这种媒介时，市场本身将对事件的真实性与重要性获得解释权，这正是a16z构建新媒体帝国的关键拼图。最后，文章以媒体公司MTS为例，说明a16z的“新媒体”是一种全频段、高烈度的信息工程，旨在“接管时间线”。而Kalshi的核心价值在于，它通过真实的交易数据构建了一种强大的“现实扭曲力场”，其显示的市场概率能深刻影响公众认知与判断，这种赋予私营公司的社会影响力是其获得高估值的根本原因。

链捕手1小时前

交易

现货

合约

The Night Before the AI Model Shakeout

文章摘要

I. Models Are Becoming Commoditized

II. The Music Hasn't Stopped, But the Seats Are Already Insufficient

III. More Expensive, Yet Less Valuable

IV. The Sense of Deja Vu from the "Four AI Dragons"

V. Three Paths, Three Bets

VI. The End Goal of Fundraising Isn't Expansion

相关问答

你可能也喜欢

Kalshi、MTS 与 a16z 的野望

突发：OpenAI芯片元老加入Anthropic

a16z 全球化转向：VC 正在成为美国科技联盟的「推手」

解读Agent商业、支付与基础设施的真相

Kalshi、MTS 与 a16z 的野望

交易

热门文章

加密市场宏观研报：原油飓风、AI巨浪与比特币的十字路口

自主AI经济的基石：Talus如何重塑链上智能代理

火币成长学院：AI与Crypto深度研报：算法与账本的共生时代

相关讨论

热门问答

热门分类

热门标签