When Tokens Cost More Than People, 'AI Narrative' Runs Into Trouble

marsbitPublished on 2026-05-29Last updated on 2026-05-29

Abstract

Title: When Tokens Cost More Than People, the "AI Narrative" Hits Trouble The economic sustainability of corporate AI adoption is under scrutiny as token consumption soars while measurable business value remains elusive. Major companies like Uber and Microsoft report struggling to justify rising AI costs, with executives coining terms like "tokenmaxxing" to describe wasteful usage. Data reveals a stark picture: for every dollar spent on AI tokens, only 18 cents translates to user-facing value, with the rest consumed by bug fixes, rework, and friction. The debate splits into bullish and bearish camps. Bulls, like Goldman Sachs analysts, see current inefficiencies as growing pains, predicting a 24-fold increase in token demand by 2030 and a shift towards healthier metrics like "cost per effective action." They point to indicators of real productivity gains and argue current tech valuations are not in bubble territory. Bears, however, highlight an unsustainable model where value is heavily concentrated in semiconductor companies like Nvidia, funded by cloud giants taking on massive debt. Studies show 95% of firms investing in generative AI see zero return. A deeper concern is the circular financial structure between cloud providers (hyperscalers) and AI labs like OpenAI and Anthropic. Billions in cloud service commitments are tied to these labs, which are partly funded by the hyperscalers' own investment. This creates a loop where cloud revenue depends on labs securing contin...

Author: Bao Yilong

Source: Wall Street News

The justification for corporate AI spending is facing a severe test, as Token consumption continues to climb, yet quantifiable commercial value remains elusive.

On May 22, Uber's Chief Operating Officer Andrew Macdonald, whose company is valued at over $200 billion, stated publicly on a podcast that the link between the growth in token consumption and substantial product improvement "doesn't exist yet."

Macdonald pointed out that companies are finding it increasingly difficult to rationalize the continuously rising AI expenditures. He even coined a term for the wasteful phenomenon within engineering teams: "tokenmaxxing."

Earlier in mid-May, Microsoft began cutting internal Claude Code licenses, citing token bills as "unsustainable."

The combination of these two events forces the market to confront a previously overlooked variable. Token economics, specifically the unit economics of token consumption at enterprise scale, has evolved from a peripheral issue to the central load-bearing pillar of the entire AI investment thesis.

Five Data Points, Painting a New Picture

Since April, multiple data points have emerged successively, collectively sketching an alarming picture.

In April this year, Uber's Chief Technology Officer publicly stated that the company had burned through its annual Claude Code budget in just four months.

Among 5,000 engineers, monthly usage rates ranged from 84% to 95%, with individual monthly bills varying from $150 to $2,000. The CTO himself reportedly consumed $1,200 worth of tokens during a two-hour internal demonstration.

Macdonald described being "speechless" upon hearing this number.

Regarding Microsoft, according to a report in The Verge's Tom Warren's Notepad newsletter, Claude Code quickly became popular among Microsoft's internal engineering teams. However, the token-based billing model made scaled spending unsustainable, prompting Microsoft to proceed with cutting related licenses.

GitHub announced that starting June 1, all Copilot plans would shift from a fixed subscription model to usage-based billing.

The official discussion thread garnered nearly 900 downvotes, as users calculated that a single AI programming session typically consumes $30 to $40, meaning a $10 monthly subscription could be exhausted in a single use.

Developer productivity platform Entelligence.AI aggregated data from 2,444 companies and found:

For every $1 spent on AI token costs, only 18 cents generated actual value reaching users.
44 cents were used to fix bugs introduced by the AI itself; 27 cents went to rework; 11 cents were consumed by review friction.

According to Bloomberg's Silicon Data LLM Token Expenditure Index, token prices have risen about 65% since the end of February this year, and US AI software prices have increased by 20% to 37% cumulatively over the past year.

Bull vs. Bear Debate: One Fact, Two Interpretations

The same data points to starkly different conclusions under different analytical frameworks.

The bullish view argues that the current chaos is merely the growing pains of a successful transformation.

According to Goldman Sachs' Jim Schneider in early May, by 2030, agentic AI will drive a 24-fold increase in token consumption, reaching approximately 120 sextillion tokens per month. The gross margins of hyperscale cloud providers and model vendors will turn positive within the next 3 to 12 months.

Goldman's Rich Privorotsky believes that Q1 2026 might have been the peak for "token maximization" as a KPI. The industry is shifting from pursuing consumption volume to the healthier metric of "cost per effective action."

JP Morgan's economic research also found a jump in new and updated Python packages on PyPI in early 2026, a trend not seen when ChatGPT launched in 2022, indicating that real productivity gains are occurring.

Furthermore, the Magnificent 7 currently trades at about 20 times forward earnings, far below the 52 times at the peak of the 2000 tech bubble, 67 times for Japan in 1989, and 34 times during the "Nifty Fifty" era. By historical bubble standards, this does not constitute a bubble.

The bearish view was most systematically articulated by Goldman Sachs semiconductor analyst Jim Covello in an April report.

He pointed out that almost all value in the AI supply chain flows to semiconductor companies, a phenomenon unprecedented and unsustainable in history. Chip companies should benefit when their customers benefit, but in this cycle, their prosperity comes at the expense of consumption across the entire upstream industry chain.

Nvidia's net profit has grown about 20-fold since ChatGPT's launch; major hyperscale cloud providers have burned through their operating cash flow and are turning to debt—data center-related debt issuance in 2025 was approximately $182 billion, doubling from 2024.

MIT Nanda research shows 95% of enterprises investing in generative AI see zero return. This decoupling may persist for a while, but cannot last forever.

Concerns of the Circular Financing Structure

This discussion touches on a more complex level: the financial loop between hyperscale cloud providers and AI labs.

According to corporate disclosure documents compiled by The Information, OpenAI and Anthropic account for more than half of the approximately $2 trillion in future cloud service commitments from Microsoft, Oracle, Google, and Amazon. Specifically:

Of Microsoft's $627 billion cloud service backlog, $280 billion is tied to OpenAI;
Of Oracle's $553 billion pipeline business, 54% (approx. $300 billion) is committed by OpenAI;
Of Google's $467.6 billion, Anthropic accounts for 43% (approx. $200 billion);
Amazon's corresponding exposure also reaches 51% of its $464 billion backlog.

This financing structure is inherently circular. Microsoft's $13 billion investment in OpenAI was largely delivered in the form of Azure credits, which OpenAI used to purchase Azure compute. Microsoft then booked this as cloud revenue.

The same hyperscale cloud providers are both equity investors in the AI labs and service providers collecting compute bills.

This structure is also reflected in profit data. Alphabet reported a record Q1 profit of $62.6 billion, of which about $28.7 billion, nearly half, came from the paper appreciation of its Anthropic stake.

Amazon's Q1 profit of $30.3 billion included $16.8 billion in pre-tax unrealized gains from Anthropic, while its free cash flow plummeted 95% to $1.2 billion due to data center capital expenditures of $44.2 billion in the same period.

The sustainability of this system depends on AI labs' continued ability to secure external financing to fulfill cloud computing commitments, which in turn relies on enterprise customers' continued willingness to pay rising token bills.

It is reported that Anthropic currently incurs costs of $3 for every $1 of revenue. Once the pace of financing slows, the credibility of cloud revenue projections will decline, and the valuation multiples of hyperscale cloud vendors will also face re-evaluation pressure.

This chain transmits in both directions and will break in both directions.

This Isn't 1999, But the Problem is Real

The current situation does not constitute a typical bubble setup.

From a valuation multiple perspective, the Tech 7 currently trades at about 20 times forward price-to-earnings, far below the 52 times at the peak of the 2000 tech bubble, 67 times for the Japanese market in 1989, or the 34 times during the "Nifty Fifty" era.

AI technology itself is real. For heavy user groups, data on productivity gains is verifiable. OpenAI has an annualized revenue of about $20 billion, Anthropic about $4.3 billion; these two labs are not going to disappear.

Today, token cost (compute expense) has become the key determinant of AI success or failure. Six months ago, people weren't even discussing this topic.

Back then, people only cared about "whether the technology works." Now the answer is clear: in the eyes of specific jobs and specific people, the technology indeed works.

But a new question arises: Can the money saved by downstream companies using AI be transmitted upward in time to outrun the valuation window the capital market has left for AI labs and cloud giants?

Those bullish on AI believe that as long as the technology continues to mature, corporate ROI (Return on Investment) will turn positive within 1 to 1.5 years.

The bearish believe more executives will follow Macdonald's lead, publicly complaining about low AI ROI and starting to cut budgets.

Both scenarios are playing out; the outcome is undecided. The only certainty is that the old lie—"as long as token consumption is rising, it means the AI transformation is successful"—has been shattered.

High token consumption does not equal commercial value; this bubble must eventually be squeezed out. The bill for AI has come due, but who will ultimately pay for it? That remains an unknown for now.

Snowflake's stock price surges 33%, AI infrastructure expands from chips to the data layer

Snowflake's stock surged over 33% after the company raised its full-year revenue forecast and announced a $6 billion, five-year partnership with AWS. This agreement, which secures AWS Graviton chip supply, underscores Snowflake's evolving role in the AI infrastructure landscape. The market's reaction reflects a reassessment of data platforms as critical enablers for enterprise AI adoption. As companies move beyond conceptual AI to building operational, data-integrated AI workflows, demand for robust data storage, processing, and analysis capabilities is increasing. The deal strengthens AWS's custom chip ecosystem while positioning Snowflake not just as a data warehouse, but as a key data layer for deploying and managing AI applications. Analysts have rapidly revised their valuations upwards, signaling a shift in sentiment where proven AI-driven revenue growth is now commanding premium multiples.

marsbit18m ago

Snowflake's stock price surges 33%, AI infrastructure expands from chips to the data layer

marsbit18m ago

Ripple Makes New Demands From SEC, What Are They Asking For?

Ripple has sent a letter to the U.S. SEC's Crypto Task Force requesting regulatory clarity on payment stablecoins and tokenized securities. This follows up on a meeting held in March. Key demands include clarification on how stablecoins should be treated as collateral on balance sheets under amended Rule 15c3-1, and defining "Qualified Payment Stablecoins" for custody under Rule 15c3-3. Ripple also asked for confirmation that other major crypto non-securities, beyond Bitcoin and Ethereum, receive equivalent regulatory treatment, suggesting a revision to related SEC FAQs. Furthermore, the firm proposed a 0% haircut for stablecoins under certain conditions and requested that on-chain registries be designated as the single authoritative legal record for ownership, eliminating ambiguity in digital twin structures. In a separate reaction, Ripple CEO Brad Garlinghouse declared the "anti-crypto army" defeated, citing support from courts, voters, and former U.S. President Donald Trump, who criticized former SEC Chair Gary Gensler and vowed to codify pro-crypto legislation.

bitcoinist1h ago

Ripple Makes New Demands From SEC, What Are They Asking For?

bitcoinist1h ago

Trump Rejects Iran Deal — Bitcoin Reacts With Sharp Drop Below $74K

A fabricated report from Iranian state media about a tentative US-Iran deal triggered a sharp sell-off in Bitcoin, briefly pushing its price below $74,000. The report, detailing a supposed memorandum of understanding to ease tensions and restore shipping through the Strait of Hormuz, caused Bitcoin to drop from above $75,000 to a daily low near $73,200. The White House quickly dismissed the report as a "complete fabrication," and President Trump reinforced that the US was unsatisfied with any Iranian proposals. This highlights the heightened sensitivity of cryptocurrency markets to geopolitical news, especially developments affecting global shipping and energy supplies, as traders reacted swiftly to the conflicting signals.

bitcoinist4h ago

Trump Rejects Iran Deal — Bitcoin Reacts With Sharp Drop Below $74K

bitcoinist4h ago

Bitcoin Bulls Face Growing Market Headwinds

Bitcoin briefly reclaimed $72,500, triggering $342 million in long liquidations. Analysts note mounting bearish pressure ahead of a $9 billion monthly options expiry, with put options significantly outnumbering calls. On-chain data reveals a worrying shift: large "whale" balances are stagnating or declining, while "dolphin" accumulation has slowed sharply—a pattern historically linked to sustained price weakness. The long-term holder supply has reached a record high, suggesting a lack of new buyers entering the market amid deteriorating macroeconomic and geopolitical conditions.

TheNewsCrypto6h ago

Bitcoin Bulls Face Growing Market Headwinds

TheNewsCrypto6h ago

Biology's Paradigm Shift: Zuckerberg's New Open-Source Model Completely Overturns Google's AlphaFold Throne

The AlphaFold era faces a major challenge. A new open-source AI model, ESMFold2, from Meta CEO Mark Zuckerberg's Biohub, has been released alongside a massive database of 11 billion predicted protein structures—surpassing the AlphaFold database by 8 billion entries. Published in Nature, the model is reported to outperform AlphaFold3 in key areas, particularly in predicting protein complexes. Crucially, it is fully open-source with no commercial restrictions. ESMFold2 takes a different technical approach, building on a protein language model trained on billions of sequences, including microbial data from diverse environments like soil and ocean—areas less covered by AlphaFold. The team validated its utility by designing and successfully synthesizing novel functional proteins in the lab. The decision to open-source everything is seen as a strategic move, similar to Meta's approach with its Llama models, aiming to build an ecosystem and accelerate global research. While scientists welcome the resource, some urge caution, noting the need for independent validation of predictions and questioning its performance on entirely novel protein folds. The development signals intensified competition in protein AI, rapidly evolving much like the large language model field, and represents a significant step forward in using AI to decode and engineer the machinery of life.

marsbit6h ago

Biology's Paradigm Shift: Zuckerberg's New Open-Source Model Completely Overturns Google's AlphaFold Throne

marsbit6h ago

Trading

Spot

Futures

Hot Articles

Beoble: A Social App for Web3 People

Beoble is a communication infrastructure and ecosystem.

33.7k Total ViewsPublished 2024.03.13Updated 2024.03.13

How to Buy PEOPLE

Welcome to HTX.com! We've made purchasing ConstitutionDAO (PEOPLE) simple and convenient. Follow our step-by-step guide to embark on your crypto journey.Step 1: Create Your HTX AccountUse your email or phone number to sign up for a free account on HTX. Experience a hassle-free registration journey and unlock all features.Get My AccountStep 2: Go to Buy Crypto and Choose Your Payment MethodCredit/Debit Card: Use your Visa or Mastercard to buy ConstitutionDAO (PEOPLE) instantly.Balance: Use funds from your HTX account balance to trade seamlessly.Third Parties: We've added popular payment methods such as Google Pay and Apple Pay to enhance convenience.P2P: Trade directly with other users on HTX.Over-the-Counter (OTC): We offer tailor-made services and competitive exchange rates for traders.Step 3: Store Your ConstitutionDAO (PEOPLE)After purchasing your ConstitutionDAO (PEOPLE), store it in your HTX account. Alternatively, you can send it elsewhere via blockchain transfer or use it to trade other cryptocurrencies.Step 4: Trade ConstitutionDAO (PEOPLE)Easily trade ConstitutionDAO (PEOPLE) on HTX's spot market. Simply access your account, select your trading pair, execute your trades, and monitor in real-time. We offer a user-friendly experience for both beginners and seasoned traders.

6.9k Total ViewsPublished 2024.03.29Updated 2025.03.21

Discussions

Welcome to the HTX Community. Here, you can stay informed about the latest platform developments and gain access to professional market insights. Users' opinions on the price of PEOPLE (PEOPLE) are presented below.