Giants Collectively Raise Prices, Is the AI Price Hike Wave Coming? Can We Still Afford Lobster Employees?

marsbitPublished on 2026-04-13Last updated on 2026-04-13

Abstract

Major AI companies, including Alibaba Cloud, Baidu Intelligent Cloud, Tencent Cloud, and Zhipu, have recently announced significant price increases for AI computing and storage services, with hikes ranging from 5% to over 460% in some models. This trend follows similar moves by global giants like Amazon AWS and Google Cloud earlier this year. The price surge is driven by explosive demand for computing power, fueled by the rapid adoption of AI agents like OpenClaw (referred to as "Lobster" in the article), which consume tokens at rates dozens or even hundreds of times higher than traditional AI applications. This has created a severe supply-demand imbalance. Additionally, shortages in high-end hardware—such as AI chips and high-bandwidth memory (HBM)—have constrained computing capacity and raised operational costs. The industry is shifting away from loss-leading pricing strategies toward value-based models, prioritizing sustainable development over market-share competition. A new "token economy" is emerging, where pricing is increasingly based on token usage, complexity, and speed rather than flat fees. This reflects AI computing's evolution from a generic service to a specialized, high-value resource. Some companies are even considering token allowances as part of employee benefits, highlighting its growing role as both a production tool and a cost factor. The article concludes by questioning whether AI services will remain affordable as compute costs continue to rise.

In recent years, with the rapid development of AI, major domestic internet giants have been actively deploying large AI models. Especially recently, the popularity of Lobster has allowed various AIs to make a fortune. However, as expected, the AI price hike wave has arrived on schedule. With overseas giants like Amazon and Google, as well as domestic giants like BAT collectively raising prices, domestic large model companies such as Zhipu have also followed suit. This makes one wonder: with the price hike wave here, can we still afford Lobster employees?

I. Giants Collectively Raise Prices

According to a report by Haibao News, recently, concept stocks related to optical modules and computing hardware have surged against the trend. Behind this is the OpenClaw (commonly referred to as "Lobster" domestically) craze sweeping from abroad to China this year, causing a surge in Token calls and directly driving up computing power demand. The Token industry chain has become a new core focus.

Alibaba Cloud and Baidu Intelligent Cloud have successively announced price increases for AI computing power-related products. Alibaba Cloud's official announcement stated that due to the global AI demand explosion and supply chain price hikes, Alibaba Cloud's AI computing power, storage, and other products have seen price increases of up to 34%. According to insiders, Alibaba Cloud's MaaS business platform, Bailian, achieved its highest growth rate in history from January to March this year.

Baidu Intelligent Cloud stated that due to the rapid development of global AI applications, computing power demand continues to rise, and the costs of core hardware and related infrastructure have significantly increased. Therefore, it decided to raise the prices of AI computing power-related products and services by 5% to 30%, with parallel file storage increasing by about 30%.

Earlier this year, Amazon AWS and Google Cloud announced price increases for some services. On January 22, AWS announced a 15% price increase for EC2 used for large model training. On January 27, Google Cloud significantly adjusted prices for data transmission services such as CDN Interconnect, Direct Peering, and Carrier Peering, with a 100% increase in North America.

Domestically, Tencent Cloud announced a price increase for large model services on March 11, adjusting the billing strategies for some models. For example, the input price for the Tencent HY2.0 Instruct model increased sharply from 0.0008 yuan per thousand Tokens to 0.004505 yuan per thousand Tokens, a rise of 463.13%.

Moreover, the price hike wave is not over. On April 8, Zhipu announced its third price increase plan this year, raising prices by 10% with the release of its new flagship model GLM-5.1, just one month after its last price increase of 30% or more. Compared to the price cuts and free trials in 2025, the current clear signal is that domestic AI large models are entering an era of collective price increases.

II. Is the AI Price Hike Wave Coming? Can We Still Afford Lobster Employees?

Recently, major large model giants collectively announced price increases for some of their cloud services and related products, giving those who were worried about being distilled a glimmer of hope. Compared to expensive Tokens, we seem cheaper. So, what is the industrial logic behind this price hike wave?

First, the explosion of AI agents has led to a surge in computing power demand. From the perspective of structural changes on the demand side, the popularity of new-generation AI agents like OpenClaw has completely reshaped the underlying logic of computing power consumption. In the early days of large model applications, user behavior was mostly limited to single-round dialogues or simple text generation, where Token consumption was relatively limited and predictable. However, with the maturity of autonomous agent technology, AI is no longer a passive Q&A machine but a digital employee capable of autonomous planning, tool invocation, and executing complex tasks.

The daily per capita Token consumption of a mature agent is often dozens or even hundreds of times that of traditional chat users. This exponential demand surge is not linear business growth but a dimensional leap. When massive numbers of agents are online simultaneously, engaging in high-frequency logical reasoning and data interactions, the computing power infrastructure originally designed for human interaction suddenly faces enormous throughput pressure.

This explosive growth in demand directly breaks the old supply-demand balance, rapidly shifting computing power resources from "relatively abundant" to "extremely scarce." When marginal utility rises sharply and supply elasticity is insufficient, price increases are not only an inevitable reflection of market laws but also a necessary means to screen high-value application scenarios and curb inefficient computing power waste.

Second, the supply-demand imbalance of core hardware leads to computing power tension. From the perspective of hard constraints on the supply side, the supply-demand imbalance of high-performance computing chips and HBM high-bandwidth memory, among other core hardware, forms the physical foundation of this price hike wave. Although domestic cloud vendors have heavily invested in building a domestic computing power ecosystem over the past few years, capacity bottlenecks in high-end training and inference chips remain severe globally. Particularly, HBM memory, the "blood" of large models, has high technical barriers and long expansion cycles, becoming a key bottleneck restricting computing power release.

Currently, computing power is no longer merely about stacking servers but a sophisticated system composed of advanced process chips, high-speed interconnection networks, and high-bandwidth storage. The shortage of core hardware has significantly increased the marginal cost of computing power supply, and cloud vendors can no longer dilute costs through simple economies of scale. This rigid constraint on the supply side forces the industry to re-examine the pricing mechanism of computing power. When "computing power is power" becomes a consensus, vendors with stable, high-performance computing power supply capabilities naturally have stronger pricing power. The current price increase is actually a reasonable revaluation of the value of scarce hardware resources and an inevitable result of cost pressure transmission from the upstream industry chain to the downstream.

Recently, many of my friends in tech companies, especially CTOs, have been complaining that the prices of storage chips and servers are now unaffordable. The battle for computing power has instantly become a battle for costs, which is the most noteworthy aspect at present.

Third, the industry's pricing logic of "exchanging price for volume" has fundamentally changed. Looking back at the development of the cloud computing industry over the past decade, it's not hard to see a vicious cycle:恶性价格战 (malicious price wars). To compete for market share, major vendors have wielded the "price knife," not only squeezing competitors' survival space but also greatly compressing their own profit margins. At times, the price of cloud services even fell below operating costs, resulting in typical "involution." This "exchanging price for volume" model might have worked in the mobile internet era because marginal costs were接近零 (close to zero), and traffic monetization paths were clear.

However, the arrival of the AI era has completely broken this logic. Computing power is no longer a cheap commodity but an expensive specialized means of production. If low-price strategies continue, cloud vendors will be unable to cover the high costs of GPU procurement and power operations, let alone invest huge R&D funds for model iteration. Healthy industry development must be based on reasonable profits. Only when prices return to value can enterprises have the ability to reproduce and innovate.

The collective price increases by tech giants are actually a "collective rational return" for the industry. This marks the Chinese cloud computing market moving away from the "burning money subsidies" era and entering a value competition era centered on technical strength and service quality. This is extremely beneficial for building a良性生态 (healthy ecosystem) for the entire industry, shifting the focus of competition from "who is cheaper" to "who is more stable, who is smarter, who can solve problems better," which is undoubtedly a positive signal of industry升级 (upgrade).

Fourth, Token economics is emerging, and tiered pricing will become the norm. Once, data centers were seen as "warehouses" for storing data, with their value mainly体现在 (reflected in) space leasing and data保管 (custody). In the AI era, data centers have evolved into "factories" producing intelligence, with their core output being high-value Tokens. This role change directly催生 (gives rise to) a new pricing logic. Future AI services will no longer follow traditional annual/monthly subscriptions or pay-as-you-go models but will adopt tiered pricing based on dimensions such as Token throughput, response speed, and推理复杂度 (reasoning complexity).

This refined pricing strategy can more accurately match computing power需求 (demand) in different scenarios, allowing high-real-time, high-complexity tasks to pay higher premiums while offline batch processing tasks enjoy lower costs. This is not only an innovation in business models but also a great improvement in resource allocation efficiency. Through price leverage, the industry will guide computing power resources to areas that create the greatest social value, avoiding resource misallocation and waste.

In such a context, many companies even directly use the provision of Tokens as a new employee benefit. Of course, we have been discussing whether Tokens are means of production or employee compensation and benefits, but there is no doubt that in the current era, Tokens have become an important computing power bottleneck restricting the development of AI companies.甚至 (Even more), a friend of mine complained to me that at the beginning of the year, the tech giant where my friend works required all employees to use Lobster and their digital twins, but recently, due to massive consumption, they imposed流量限制 (流量限制).

Finally, faced with increasingly expensive computing power resources, what should be the future of AI Lobster? Can we still afford it?

This article is from the WeChat public account "Jianghan Vision Observation," author: Jianghan Vision Observation

Trending Cryptos

Related Questions

QWhy are major cloud and AI companies like Alibaba Cloud, Baidu Cloud, and Tencent Cloud raising their AI service prices?

ADue to a surge in global AI demand, particularly driven by the popularity of AI agents like OpenClaw, which has led to a sharp increase in computational power (compute) requirements. Additionally, rising costs of core hardware such as high-performance chips and HBM memory, coupled with supply constraints, have forced companies to adjust prices to reflect the increased costs and resource scarcity.

QWhat is OpenClaw, and how has it impacted the AI industry's compute consumption?

AOpenClaw (referred to as 'Lobster' in the domestic context) is a representative AI agent that has gained significant popularity. It has revolutionized compute consumption by enabling autonomous planning, tool usage, and complex task execution, leading to token usage that is tens or even hundreds of times higher than traditional chat-based interactions, thereby straining computational resources.

QHow have hardware shortages contributed to the AI price increases?

AShortages in high-performance computing chips and High Bandwidth Memory (HBM) have created a supply bottleneck. These components are critical for AI model training and inference, and their limited availability has increased marginal costs, making it difficult for cloud providers to scale affordably, thus necessitating price hikes.

QWhat shift in pricing logic is occurring in the AI and cloud computing industry?

AThe industry is moving away from volume-based discounting and price wars to a value-based, tiered pricing model. This new approach charges based on token throughput, response speed, and inference complexity, aligning costs with the actual value and resource consumption of AI services, promoting healthier competition and sustainable innovation.

QWhat are the implications of the AI price surge for companies and employees using these services?

ACompanies are facing significantly higher operational costs, with some even implementing token usage limits or treating token allocation as an employee benefit. This trend may force businesses to optimize AI usage, prioritize high-value applications, and could potentially slow down adoption if costs become prohibitive for smaller players.

Related Reads

Two Legends Lost in Three Days: Is Google's AI Talent Dam Cracking?

In three days, Google lost two AI legends. On June 18, Noam Shazeer, co-author of the seminal "Attention is All You Need" paper and Gemini co-lead, left for OpenAI. Just 48 hours later, John Jumper, 2024 Nobel laureate and AlphaFold lead, departed DeepMind for Anthropic. This follows Andrej Karpathy joining Anthropic in May. These moves highlight a structural trend: top AI talent is concentrating at mission-driven, pre-IPO firms like OpenAI and Anthropic, while Google becomes a primary source. The exodus stems from a core mission mismatch. Google's ad-centric model often subordinates AI research to product and revenue goals, creating friction for pioneers like Shazeer, who returned in 2024 only to leave again. In contrast, OpenAI and Anthropic offer singular focus on pushing AI boundaries, whether towards AGI or safety-aligned models, which deeply appeals to top researchers like Jumper. Financial incentives amplify the pull. With both OpenAI and Anthropic nearing IPO, employees stand to gain immensely from equity, an upside Google's mature stock cannot match. Furthermore, the 2023 merger of Google Brain and DeepMind, intended to consolidate strength, has instead created cultural tension and slowed the path from research to product, as evidenced by Gemini's pace. This talent redistribution is reshaping the AI landscape. While Google retains vast data and compute resources, its true crisis is the quiet, continuous loss of the people who define the field's future. The real moat in AI is not infrastructure, but the concentration of brilliant minds—a battle Google is currently losing.

marsbit54m ago

Two Legends Lost in Three Days: Is Google's AI Talent Dam Cracking?

marsbit54m ago

Behind the AI Report Card, Lies a Chinese 'Exam Setter'

Beyond the familiar performance charts like MMLU-Pro and MMMU, which major AI models strive to ace, stands a key "examiner": Chinese-Canadian researcher Wenhu Chen. An assistant professor at the University of Waterloo and founder of TIGERLab, Chen addresses the crucial need for more rigorous AI evaluation. As models like GPT-4 began scoring near-perfect results on older benchmarks like MMLU, it became difficult to distinguish their true capabilities. In response, Chen introduced MMLU-Pro in 2024, featuring harder, more reasoning-focused questions with more answer choices, successfully reintroducing meaningful performance gaps. His work extends to multi-modal evaluation with MMMU and its enhanced version, MMMU-Pro. These benchmarks test a model's ability to understand and reason with complex information from images, charts, and text across diverse academic subjects, exposing the significant challenges even top models face in genuine comprehension. Chen's background in complex QA, table reasoning, and his experience at Google DeepMind on projects like Gemini inform his approach. He understands that effective benchmarks must anticipate how models might "cheat" by memorizing data or avoiding visual analysis. His lab also actively researches video understanding and generation models (e.g., UniVideo, Vamba), ensuring his evaluation work is grounded in practical model-building challenges. Now at Meta's Super Intelligence Lab, Chen continues his focus on multi-modal data and evaluation, representing the deep yet often unseen contributions of Chinese talent in shaping the fundamental tools of the AI industry.

marsbit1h ago

Behind the AI Report Card, Lies a Chinese 'Exam Setter'

marsbit1h ago

Alliance Co-founder's Letter to Entrepreneurs: Written at the Moment Cursor Sold for $600 Billion

Alliance Co-founder's Letter to Entrepreneurs: On Cursor's $60 Billion Sale Many aspiring founders see massive exits like Cursor's $60B sale and wonder why they can't achieve the same, often concluding opportunities are exhausted. But great companies aren't built in obvious, crowded spaces. Cursor, like Stripe, Figma, and Shopify before it, started with a non-consensus belief about the future. Before ChatGPT, they believed AI would transform knowledge work. They focused on a genuinely exciting domain, became their own customer, and obsessed over power users. Their journey involved years of "glass-chewing" effort before the market was ready. The pattern is consistent: identify a long-term technological shift, find a missed entry point, and execute for years before the trend becomes obvious. First-generation products (PayPal, Adobe, Amazon) prove a market exists. Second-generation winners (Stripe, Figma, Shopify) rebuild that market around new insights, technology, or changing customer behaviors. Founders must identify their phase in the cycle. Early entrants like Coinbase or Cursor focus on making new technology usable for power users. Later entrants find the "yin" to the established "yang"—the blind spots incumbents miss as they grow distant from individual users. The key is deep market immersion. Use every product in your space. Talk to users. Build an audience. Stop looking for ideas and start *seeing* them everywhere. Then, choose one. The idea must offer a 10x improvement or solve a "hair-on-fire" pain point—something severe enough that users are already crafting workarounds. When building, avoid feature bloat. Ask: why would someone switch? Great startups rarely force new behaviors; they improve familiar workflows with drastically lower friction (e.g., Cursor forked VS Code instead of creating a new editor). Distribution is the underestimated moat. Before product-market fit, achieve distribution-market fit. How do customers discover new tools? Founders like those at Airbnb, Stripe, and Cursor did unscalable, manual work to recruit early users. The final, unteachable ingredient is resilience. Cursor built for years pre-market, faced rejection, and persisted. So did Airbnb, Nvidia, and Rain (which launched post-FTX collapse). The lesson isn't that these founders were smarter, but that they stayed in the game long enough for their insights to compound. Framework: Spot technological cycles. Cultivate unique insight. Obsess over your market. Talk to customers. Find a hair-on-fire problem. Build the simplest wedge. Win your distribution channel. Above all, don't quit when it gets hard. Most people won't do these things consistently. The few who do build the next generation of great companies. Go build.

marsbit1h ago

Alliance Co-founder's Letter to Entrepreneurs: Written at the Moment Cursor Sold for $600 Billion

marsbit1h ago

Weekly Editor's Picks (0613-0619)

Weekly Editor's Picks (0613-0619): Market Insights & Analysis This weekly digest curates in-depth analysis often lost in the information flow, focusing on key insights across macro trends, investment, and technology. **Macro & Geopolitics:** With the Strait of Hormuz reopening and military conflict shifting to negotiation, markets are pivoting from "war shock" to "supply restoration." Trades include shorting crude risk premiums, longing airlines/tourism, Asian energy importers, and bond duration, while shorting inflation expectations. LNG, fertilizer, and chemical chains are also being repriced. **Investment & VC:** Ray Dalio advises against betting on concentrated AI giants dominating indices, advocating for diversified portfolios of high-quality, low-correlation assets instead. Analysis covers the 4-year crypto cycle, predicting the core surviving product by 2029 will be asset trading markets. Current BTC metrics suggest a potential bottoming zone, presenting a patient accumulation window. SpaceX's high-profile IPO at a $2.1T valuation faces scrutiny over fundamentals, with key watchpoints being its likely inclusion in the Nasdaq index and Q2 earnings. Concerns are raised about potential "gamma squeeze" and systemic risks if its narrative-driven valuation gets amplified by passive index funds. Robinhood (HOOD) is noted for breaking its high correlation with crypto, bolstered by its stock trading and new underwriting business. **Web3 & AI:** A warning highlights ~$1.8T in off-balance-sheet AI infrastructure commitments (purchase commitments, leases) as a potential systemic risk if AI monetization lags. AI models are being used for World Cup predictions, adding a new layer for betting markets. A cost breakdown of a $20 AI subscription reveals the supply chain from model companies to cloud, GPUs, and power. **Prediction Markets:** The emergence of prediction market "concept stocks" is noted, with Robinhood developing its own platform, Rothera, signaling a shift from market competition to a "channel war" for user access. **CeFi & DeFi:** The SpaceX IPO tested perpetual contract mechanisms for pre-IPO assets, highlighting challenges in handling corporate actions like stock splits on-chain. The de-pegging of STRC (Strategy's preferred share) to ~$89 reflects market concerns over MicroStrategy's capital structure and BTC-backed leverage model. BlackRock's covered-call Bitcoin ETF (BITA) offers yield but caps upside, appealing to yield-seeking institutions. **Ethereum:** An opinion piece argues Ethereum's core strength is its vast developer community and composability, solidifying its role as the default operating system for the financial internet. **Weekly Hot Topics:** Include the US-Iran deal reopening the Strait of Hormuz, Fed's hawkish hold, Anthropic restricting model access, SpaceX acquiring Cursor, and a humorous stock surge for "Liuliumei" due to its "LLM" ticker.

marsbit1h ago

Weekly Editor's Picks (0613-0619)

marsbit1h ago

Alliance's Co-Founder's Letter to Entrepreneurs: Written on the Occasion of Cursor's $60 Billion Sale

In this letter to entrepreneurs, Alliance reflects on the success of Cursor's $60 billion sale to Elon Musk, using it as a case study to counter the misconception that opportunities in crowded fields like AI or crypto are exhausted. The piece argues that great companies like Cursor, Stripe, Figma, and Shopify are not built by geniuses with perfect ideas, but by founders who start with a non-consensus belief about the future and build for years before that future becomes obvious to everyone. They identify long-term shifts, find overlooked entry points, and execute relentlessly. The framework for success involves: 1. **Identifying your place in the technology cycle**: Early-stage opportunities focus on making new tech usable for power users (e.g., Coinbase, Cursor). Later-stage opportunities involve finding the "yin" to an existing "yang"—the blind spots of first-generation players (e.g., Stripe vs. PayPal, Figma vs. Adobe). 2. **Cultivating unique insights**: Immerse yourself deeply in the market. Use every product, talk to users, and build an audience. Insights will emerge naturally from deep engagement. 3. **Finding a "hair-on-fire" problem**: Look for a 10x improvement or a severe, urgent pain point. The strongest signal is people already building clumsy workarounds. 4. **Building a focused MVP**: Don't just add features because you can. Ask why users would abandon their current tool for yours. The best startups rarely force new behaviors; they improve familiar workflows with drastically lower friction. 5. **Winning a distribution channel**: Distribution is often the moat. Before product-market fit, achieve channel-market fit. Find where your customers are and build an engine to reach them, even through unscalable, manual efforts initially. 6. **Persistence**: The final, unteachable ingredient is resilience. Success stories like Cursor, Airbnb, and Nvidia involved years of grinding, rejection, and perseverance when the path forward seemed unclear. The conclusion is that there is no secret. Most people fail to consistently execute these steps over the long term. The few who do build the companies that define the next era. The world is yours to create.

链捕手1h ago

Alliance's Co-Founder's Letter to Entrepreneurs: Written on the Occasion of Cursor's $60 Billion Sale

链捕手1h ago

Trading

Spot
Futures

Hot Articles

Discussions

Welcome to the HTX Community. Here, you can stay informed about the latest platform developments and gain access to professional market insights. Users' opinions on the price of AI (AI) are presented below.

活动图片