DeepSeek Announces Permanent Price Cut, But Liang Wenfeng Is Not Trying to Be a "Cyber Bodhisattva"

marsbitPublicado em 2026-05-24Última atualização em 2026-05-24

Resumo

DeepSeek has announced a permanent 75% discount on its V4-Pro API, significantly reducing its token prices. This move stands out as a major industry-wide price cut while competitors like Anthropic, OpenAI, and Google have been quietly raising theirs. The article contrasts this strategy with the broader trend of AI becoming more expensive, citing examples of companies like Microsoft and Uber struggling with high token costs as usage soars. While CEO Liang Wenfeng is hailed by some as a "Cyber Bodhisattva" for this普惠 approach, the article argues this is a strategic business choice, not mere altruism. DeepSeek's ability to maintain low prices is attributed to several structural advantages: lower-cost AI talent in China, the impending use of domestic昇腾 hardware for further cost reductions, and, most critically, access to China's cheaper and more abundant energy infrastructure, which drastically reduces the electricity costs dominating AI operations. The analysis suggests that for many commercial applications, a "good enough" model that is radically cheaper (e.g., 1% to 11% of GPT-5.5's cost) is more valuable than the absolute top-tier model. This allows for vastly more experimentation and iteration within a budget. Therefore, as AI generally becomes more expensive, DeepSeek's cost-competitiveness—rooted in China's energy and talent advantages—becomes its core strategic value and differentiator in the global market.

By | Luo Chao Channel

DeepSeek has announced it will make the 75% discount on its V4-Pro API "permanent," effective globally.

The final pricing structure: the base input price has been reduced from $1.74 / million tokens to $0.435 / million tokens, and the output price from $3.48 / million tokens to $0.87 / million tokens. For input cache hits across the entire API product line, DeepSeek has implemented even greater concessions: $0.003625 / million tokens, all following a Pinduoduo-style rock-bottom pricing model.

A wave of voices immediately emerged on social media, including X, hailing Liang Wenfeng as the Cyber Bodhisattva, the "Feng God," or "Saint Liang" of the AI circle. The sentiment doesn't just come from the cheapness itself — DeepSeek has long been called the "Pinduoduo of AI," free for C-end users and affordable for B-end, and the world is used to its low prices. The difficulty of this round of price cuts lies in the fact that AI prices are rising globally.

Reports indicate that in DeepSeek's ongoing record-breaking Series A financing, Liang Wenfeng will personally contribute up to 20 billion RMB, accounting for 40% of the total funding. For most companies, the first thing after financing is to strengthen cash flow and make performance look better, but Liang Wenfeng does not plan to use commercialization promises to attract investors, instead insisting on open-source and pursuing AGI. This price cut truly follows through on his word. The last time someone bravely expressed a lack of desire to make money was Pinduoduo, whose co-founder explicitly told investors in a 2024 earnings call: "Starting from Q3, our profits will gradually decline and won't rebound in the short term. In the long run, the decline in profitability is inevitable." The stock price plummeted.

Sam Altman talks about democratizing AI, but OpenAI is rapidly moving towards the opposite of its name: CloseAI. Liang Wenfeng, however, is personally ensuring that everyone, every enterprise, can use AI as affordably as possible. But is Liang Wenfeng truly a living bodhisattva? Not at all. He is an entrepreneur. Open-source and affordability are merely choices of business model, which is exceptionally commendable now and will become increasingly rare in the future.

Because: AI is getting more expensive.

This week, Microsoft canceled internal Claude Code licenses because the token-based billing was too high to bear. Microsoft heavily invested in OpenAI and also provides Azure cloud services to Anthropic, possessing computing resources envied by all enterprises, yet the token costs still hurt. Coincidentally, Uber's CTO reported an embarrassing situation to management in April: the company's AI budget prepared for the entire year of 2026 was exhausted within four months, with 95% of engineers using AI programming tools monthly and 70% of submitted code generated by AI. The exact quote was: "I'm back to the drawing board because the budget I thought I would need is blown away already."

The fact that big companies' token budgets are burning through much faster than expected certainly has to do with employees "not treating beans as rations" and burning tokens recklessly, but the root cause of token budget tension is that AI is getting more expensive. US AI software prices have risen by 20% to 37% over the past year. The "Big Three" — Anthropic, OpenAI, and Google — have all quietly increased the effective price for the same AI output over the past six months.

(Source: X)

The prevailing narrative used to be: "The larger the scale of AI application, the higher the level of industrialization, the lower the cost, and the happier enterprises will be." How naive.

And this trend will not reverse. Price is determined by supply and demand, not cost, but the supply-demand relationship for AI has completely reversed in 2026. Before, big companies begged people to use AI, needing to educate the market and promote the technology. AI was always subsidized — how many free "Thousand Questions" cups of milk tea have you had? Now? People are increasingly actively using it, "can't live without it after the first puff." AI programming, AI documentation, AIGC, even AI search are becoming more widespread. The era of AI subsidies is completely over.

The more people use it, the greater the demand, the tighter token resources become, so the computing power shortage has spilled over from GPUs to CPUs, storage, and even bandwidth. Intel, Micron, SK Hynix, Samsung Electronics, SanDisk, and domestic companies like Jiangbolong and the two "Longs" are feasting alongside Nvidia. Where does the doubling revenue growth of semiconductor giants in 2026 come from? It certainly isn't from the OpenAI-Oracle-Microsoft triangular closed-loop investment, right? Enterprises' pain is just beginning. And AI products like ChatGPT, Claude, Gemini, and Doubao emphasizing strict hierarchies between free and paid tiers will also make individual users increasingly conflicted.

It's like ride-hailing: during the craze, you could take premium cars to and from work for free, with capital footing the bill. After user habits were established, subsidies ended, prices returned to normal levels, and those who needed to take the subway still did. AI is the same. Therefore, against the backdrop of rising token prices across the industry, DeepSeek's insistence on cutting prices is no longer just a display of personal courage as a "Cyber Bodhisattva," but demonstrates a kind of reverse pricing power: we can be this cheap, still operate normally, and quality doesn't drop.

If Liang Wenfeng wanted, DeepSeek wouldn't need to be this cheap at all. So people are starting to worry: Will DeepSeek become the Linux of the AI era? Huge influence, but not making big money. Linux's contribution to the IT industry is far greater than Windows or Android (which itself is based on the Linux kernel), but it's open-source and commercially hasn't spawned giants like Microsoft or Google. DeepSeek currently has huge influence, but its commercial capabilities are far inferior to the Silicon Valley Big Three, and even cannot compete with the three domestic players: Kimi, MiniMax, and Zhipu. 2025 revenue ranking of the "Four Little Dragons": Zhipu (2025 revenue 724 million RMB) > MiniMax (2025 revenue approx. 560 million RMB) > Moonshot AI (approx. 200 million) > DeepSeek (unknown but lower).

Liang Wenfeng made money through AI quantitative trading and can personally invest 20 billion in DeepSeek, but the story of "running on love" cannot last forever.

Furthermore, under the open-source model, others can also distill, deploy, and fine-tune, so DeepSeek's technological moat will become thinner. So you always see news like this: "After Zhipu GLM-5.1 was open-sourced, it refreshed global scores on the SWE-bench Pro benchmark," "Xiaomi's MiMo-V2.5-Pro tops the global open-source large model leaderboard..." A joint report from MIT and Hugging Face shows that in the past year, open-source models developed in China accounted for 17.1% of global downloads, surpassing the US's 15.8%, ranking first globally.

No wonder there are increasing voices in Silicon Valley saying: There must be a US version of DeepSeek; we cannot sit back and watch the AI industry replay the stories of Shein, Temu, or TikTok. "If an open-source champion does not rise in the United States, the world will run on open-source models and software from whichever country can produce the strongest, most stable, cheapest, customizable, scalable models that meet personal and commercial needs." Topics involving great power competition often sound grand, but the competition behind them is real.

Behind DeepSeek's rise lies the narrative of indigenous substitution. V4's support for Ascend brought joy, and driven by domestic computing power, the price competitiveness DeepSeek currently displays is just the appetizer. In the technical report, DeepSeek indicated that after the batch release of Ascend 950 super nodes in the second half of the year, the price of V4-Pro will be significantly reduced further. The best is yet to come.

There's also the advantage of high-level AI talent. AI talent is expensive to a "luxury" level, but China's is relatively cheaper. Lei Jun making headlines for poaching Luo Fuli from DeepSeek with a ten-million-yuan salary, while Zuckerberg was offering $1 billion to poach people, including via acqui-hires. But the difference between what a $1 billion person and a ten-million-yuan person produces is clearly not 700 times. The price gap in AI talent will actually translate into a systemic price gap in the token production system.

The bigger competitive advantage lies in the energy system, which is the first layer of Jensen Huang's AI five-layer cake.

The end of AI is computing power, and the end of computing power is electricity. In April 2026, DeepSeek posted job listings for Senior Data Center Operations Engineers and Senior Delivery Managers in Ulanqab, Inner Mongolia, indicating its move to build token factories in the west, pushing cost advantages from the software layer down to the physical layer. The last time I wrote about Ulanqab in an article was when Kuaishou was building a data center there: close to power plants, suitable climate for cooling. Moreover, green electricity prices in western China are about 0.2-0.3 RMB/kWh, only 1/5 to 1/4 of those in Europe and the US.

It's not just that western green electricity is competitive. According to 2025 data from the International Energy Agency, China's total installed power generation capacity has exceeded 2,300 GW, accounting for about 22% of the global total, ranking first; the US has about 1,300 GW. More crucially, China has the world's most complete power structure: thermal, hydro, wind, nuclear, and photovoltaic all included. Data shows that China's industrial electricity prices have long been maintained at $0.06 to $0.08/kWh, while California's industrial electricity prices are close to $0.18/kWh, and even exceed $0.25/kWh in parts of Germany. This means that training a ten-thousand-card cluster in China is naturally dozens of percentage points cheaper than in Europe and the US.

In the operating costs of AI large models, electricity costs account for 60%-70% of total operating costs. It's not just running the model that uses electricity; cooling is a major part. The "infrastructure maniac" even builds data centers directly on the seabed, with offshore wind power input nearby and seawater circulation for free cooling. Then there are large-scale projects like "West-East Electricity Transmission" and "East Data West Computing." The ability to regionally dispatch power and computing is extremely strong. Guizhou, Inner Mongolia, Ningxia were already core nodes of "East Data West Computing." The pathway for moving AI computing power centers west was prepared long ago.

Using Chinese AI essentially means using AI trained by a more competitive energy system — more economical, more affordable AI. This is one reason why the overseas revenue of Kimi, MiniMax, etc., exploded after the Spring Festival. It's not just that the algorithms are stronger; it's that they have an electricity price cheat.

Nvidia can define the price of high-end computing power, but DeepSeek and others are grasping the pricing power of tokens. You might say cheap AI is no good. AI is indeed you get what you pay for. DeepSeek V4 only narrowed the gap between open-source and closed-source to a historical minimum. The company candidly acknowledges the objective gap with top models like GPT, and it's not multimodal — it can recognize images but not generate them.

But this hasn't stopped the community from flocking to DeepSeek. The reason: most real business scenarios don't require calling the world's strongest model every time. Consulting, customer service, summarization, translation, code completion, enterprise knowledge bases, automated workflows — these don't require the highest intelligence, but "barely usable + cheap enough + stable enough." When DeepSeek V4's inference cost is only about 1% (Flash) to 11% (Pro) of GPT-5.5's, an enterprise with the same budget can call tens of times more tokens, try more prompt chains, iterate more agent workflows, and the final output might even be better. After all, AI itself is a "probability" game. If it's cheap enough, getting results by making do is perfectly acceptable.

Therefore, the more expensive AI gets, the more valuable DeepSeek's cheapness becomes, and the more valuable the company DeepSeek becomes. Liang Wenfeng and his investors understand this better than anyone.

Perguntas relacionadas

QWhat is the main reason for the recent wave of social media calling Liang Wenfeng a 'Cyber Bodhisattva'?

AThe main reason is DeepSeek's announcement to make the 75% discount on its V4-Pro API pricing permanent, lowering input and output token costs significantly. This action stands in stark contrast to the global trend of major AI companies raising their prices, earning Liang praise for making AI more accessible and affordable.

QAccording to the article, what is the fundamental factor making AI increasingly expensive?

AThe fundamental factor is the reversal of the supply-demand relationship for AI in 2026. The era of AI subsidies has ended as usage has become widespread and indispensable. Increased demand leads to greater strain on token resources, causing shortages across the compute stack (GPU, CPU, storage, bandwidth), which in turn drives up costs.

QWhat competitive advantages does China possess, according to the article, that contribute to the lower cost of AI models like DeepSeek?

AChina possesses several competitive advantages: 1) A more affordable and abundant talent pool for AI engineers. 2) A highly competitive energy system with significantly lower industrial electricity prices (as low as 1/5 to 1/4 of Western rates), which is crucial as power can constitute 60-70% of AI operational costs. 3) Strong regional power and compute调度 capabilities through initiatives like 'West-East Power Transmission' and 'East Data West Computing'.

QWhy might DeepSeek's low-cost strategy be commercially viable even as it forgoes higher profits in the short term?

ADeepSeek's strategy is viable because its extreme cost efficiency grants it反向定价权 (reverse pricing power). In a market where AI is getting universally more expensive, DeepSeek's ability to operate profitably at drastically lower prices demonstrates superior operational efficiency. Furthermore, for many real-world business applications (customer service, summarization, code completion), 'good enough' performance at a fraction of the cost allows businesses to experiment and scale more, potentially delivering better overall results.

QWhat concern does the article raise regarding the long-term sustainability of DeepSeek's open-source and low-cost model?

AThe article raises the concern that DeepSeek could become the 'Linux of the AI era'—immensely influential and foundational but not a major commercial powerhouse itself. The open-source model allows others to distill, deploy, and fine-tune its models, potentially eroding its technical moat. While impactful, its current commercial revenue lags behind competitors like智谱, MiniMax, and月之暗面, raising questions about the long-term sustainability of a model heavily reliant on founder investment and '发电' (generating power with love, i.e., idealistic drive).

Leituras Relacionadas

SpaceX, OpenAI, Anthropic: The Three AI Giants Racing for IPO, Which One Is Worth Betting On?

SpaceX, OpenAI, and Anthropic are poised for historic IPOs within weeks, potentially raising a combined $180 billion—a sum exceeding the entire internet bubble's fundraising. The hosts of the Limitless Podcast argue this isn't just individual company financing but an unprecedented capital concentration for AI infrastructure, driven by an insatiable need for compute, data centers, power, and chips. SpaceX's IPO is notable for reportedly changing market index rules to allow faster inclusion, potentially funneling trillions in passive retirement funds into its stock, despite its unproven space-based data center business model. In contrast, Anthropic demonstrates explosive growth, with ARR reportedly hitting $45 billion and approaching profitability, fueled by strong enterprise adoption of products like Claude Code. Google's separate $80 billion raise highlights the immense capital pressure, even for giants. The discussion acknowledges bubble risks but leans optimistic. The hosts contend the massive spending is building essential physical infrastructure for the next technological era. A key bottleneck isn't capital but the real-world limits of chip manufacturing and construction speed. As long as demand for AI compute outstrips supply, this investment cycle represents a foundational build-out rather than a purely financial bubble. All three companies are seen as foundational bets on the future, with Anthropic often cited as the most immediately compelling due to its proven revenue trajectory.

marsbitMesmo agora

SpaceX, OpenAI, Anthropic: The Three AI Giants Racing for IPO, Which One Is Worth Betting On?

marsbitMesmo agora

From 'Old Guys' to 'New Favorites': How AI Is Revaluing Old Infrastructure from Dell to Nokia?

From "Vintage Tech" to "New AI Darlings": How AI Revalues Old Infrastructure One year ago, tech giants like Dell, Nokia, Cisco, and Western Data were seen as slow-growth, low-valuation stories, far from the AI spotlight dominated by players like Nvidia. Now, these legacy tech stocks are gaining market attention, sparking debate on whether this is genuine industry revaluation or a temporary narrative. As AI moves from model parameters to real-world data centers, the market is recognizing companies with proven delivery and infrastructure capabilities. This shift marks a change in the AI investment thesis: from pure model and GPU focus to the complex systems engineering required for deployment. Companies like Dell, HPE, and Corning are being revalued not for being "sexy" AI innovators, but for their decades of accumulated expertise in supply chains, enterprise delivery, and infrastructure—assets that have become critical in the AI buildout phase. The revaluation is unfolding across three key infrastructure lines: 1. **Servers & System Integration:** Dell and HPE are emerging as crucial system integrators or "general contractors" for AI data centers, translating GPU orders into complete, deployable server racks integrated with power, cooling, and networking. 2. **Networking & Connectivity:** AI's scale demands robust high-speed connections. Corning (fiber optics), Nokia (AI-RAN, 6G), and Cisco (data center switches) are gaining importance for enabling efficient data transfer within and between AI clusters. 3. **Storage:** Beyond high-speed memory (HBM/DRAM), the AI data explosion is driving demand for high-capacity hard drives (HDDs) from companies like Western Digital and Seagate to handle training data, logs, and cold storage cost-effectively. For this revaluation to be substantive and not just a narrative, three criteria are key: 1) Concrete AI-related order and revenue growth (e.g., Dell's AI server sales), 2) Upward revisions to company financial guidance, and 3) Sustainable improvements in profit quality, not just top-line revenue spikes. In essence, AI's transition to a real construction phase is re-pricing "old assets" against "new demand." The opportunity, however, is selective. Only those legacy firms that are demonstrably integrated into the capital expenditure chains of data center and enterprise AI deployment are likely to experience a true "logic re-rating" rather than just a temporary valuation bounce.

marsbitHá 6m

From 'Old Guys' to 'New Favorites': How AI Is Revaluing Old Infrastructure from Dell to Nokia?

marsbitHá 6m

The Merger of Codex and ChatGPT Marks the Beginning of a Major Reshuffle in Programming Tools

OpenAI is shifting its strategic focus from ChatGPT to Codex, merging them along with the browser tool Atlas into a unified desktop super-app. This move signals an internal belief that Codex, originally a programming tool, represents the next evolution of AI more than conversational models like ChatGPT. Over the past year, Codex's weekly active users have surged past 5 million. The key distinction is that while ChatGPT answers questions, Codex executes tasks. Enterprises increasingly value this ability to get work done over simply receiving advice. Consequently, Codex is attracting professionals beyond developers, including analysts, bankers, marketers, and product managers. OpenAI's reorganization and increased investment in Codex stem from recognizing that the future of AI competition lies in execution capabilities, not just conversation. The company is launching role-specific plugins (e.g., for data analysis, sales, design) to transform Codex into a broad knowledge work platform that automates and redefines white-collar workflows. Beyond being a tool, Codex reflects OpenAI's ambition to redefine software. New features like "Sites"—which generates interactive websites from documents—and collaborative "Annotations" aim to create a paradigm where the AI understands the goal and handles the tools and steps, functioning more like a digital colleague than traditional software. The ultimate goal is a unified experience where the user cares only about the completed task.

marsbitHá 16m

The Merger of Codex and ChatGPT Marks the Beginning of a Major Reshuffle in Programming Tools

marsbitHá 16m

Interpreting Investment Opportunities in the Age of Great Navigation, Invesco Great Wall Fund Releases '2026 Report on Chinese Enterprises Going Global'

Invesco Great Wall Fund has released its "2026 China Corporate Globalization Report," titled "The 'Great Navigation Era' of Chinese Enterprises." The report analyzes the new trends and investment opportunities as Chinese companies expand globally, moving from simple product exports to comprehensive overseas operations involving services, branding, and local production. Driven by factors like trade friction, the pursuit of higher profit margins abroad, and policy support, globalization is becoming essential for Chinese companies. The report outlines an evolution: from early product export ("Globalization 1.0") to the current "Globalization 2.0," characterized by overseas capacity, capital goods investment, consumer brand expansion, and service exports. Chinese firms' competitive advantages are highlighted, including a vast engineer talent pool, low-cost and robust infrastructure, and complete industrial clusters. Specific sectors with significant出海 potential are identified: * **Capital Goods** (e.g., engineering machinery, power equipment): Benefiting from global demand, especially in Belt & Road markets and the AI-driven power grid upgrade cycle. * **Consumer Brands**: Transitioning from cost to brand advantage, leveraging供应链 efficiency. * **Technology & Innovation**: Including AI applications, optical modules within global tech supply chains, and new energy vehicles focusing on local production. * **Pharmaceuticals**: Chinese biotech firms are becoming preferred partners for global pharma, with potential for breakthrough drugs in areas like oncology and weight loss. The report concludes that corporate globalization represents a sustained, core theme for China's capital markets, though companies must navigate challenges like geopolitics and localization.

marsbitHá 28m

Interpreting Investment Opportunities in the Age of Great Navigation, Invesco Great Wall Fund Releases '2026 Report on Chinese Enterprises Going Global'

marsbitHá 28m

Bitcoin Bull Michael Saylor Hints At Next Purchase With Cryptic Post

MicroStrategy's stock (MSTR) has declined sharply, dropping over 9% in a day and 25% over the past month, following the company's first Bitcoin sale in years. The firm sold 32 BTC for approximately $2.5 million, raising investor concerns about further sales. Meanwhile, Executive Chairman Michael Saylor sparked speculation of a renewed Bitcoin purchase with an unusual midweek social media post stating "Back to Work," breaking from his typical Sunday routine. The company holds about $900 million in cash and recently raised $128 million through share sales. Despite the recent activity, MicroStrategy remains the world's largest corporate Bitcoin holder with 843,706 BTC. Additional speculation arose from the firm moving Bitcoin to Coinbase Prime.

bitcoinistHá 46m

Bitcoin Bull Michael Saylor Hints At Next Purchase With Cryptic Post

bitcoinistHá 46m

Trading

Spot

Futuros