When Tokens Cost More Than People, 'AI Narrative' Runs Into Trouble

marsbitPublished on 2026-05-29Last updated on 2026-05-29

Abstract

Title: When Tokens Cost More Than People, the "AI Narrative" Hits Trouble The economic sustainability of corporate AI adoption is under scrutiny as token consumption soars while measurable business value remains elusive. Major companies like Uber and Microsoft report struggling to justify rising AI costs, with executives coining terms like "tokenmaxxing" to describe wasteful usage. Data reveals a stark picture: for every dollar spent on AI tokens, only 18 cents translates to user-facing value, with the rest consumed by bug fixes, rework, and friction. The debate splits into bullish and bearish camps. Bulls, like Goldman Sachs analysts, see current inefficiencies as growing pains, predicting a 24-fold increase in token demand by 2030 and a shift towards healthier metrics like "cost per effective action." They point to indicators of real productivity gains and argue current tech valuations are not in bubble territory. Bears, however, highlight an unsustainable model where value is heavily concentrated in semiconductor companies like Nvidia, funded by cloud giants taking on massive debt. Studies show 95% of firms investing in generative AI see zero return. A deeper concern is the circular financial structure between cloud providers (hyperscalers) and AI labs like OpenAI and Anthropic. Billions in cloud service commitments are tied to these labs, which are partly funded by the hyperscalers' own investment. This creates a loop where cloud revenue depends on labs securing contin...

Author: Bao Yilong

Source: Wall Street News

The justification for corporate AI spending is facing a severe test, as Token consumption continues to climb, yet quantifiable commercial value remains elusive.

On May 22, Uber's Chief Operating Officer Andrew Macdonald, whose company is valued at over $200 billion, stated publicly on a podcast that the link between the growth in token consumption and substantial product improvement "doesn't exist yet."

Macdonald pointed out that companies are finding it increasingly difficult to rationalize the continuously rising AI expenditures. He even coined a term for the wasteful phenomenon within engineering teams: "tokenmaxxing."

Earlier in mid-May, Microsoft began cutting internal Claude Code licenses, citing token bills as "unsustainable."

The combination of these two events forces the market to confront a previously overlooked variable. Token economics, specifically the unit economics of token consumption at enterprise scale, has evolved from a peripheral issue to the central load-bearing pillar of the entire AI investment thesis.

Five Data Points, Painting a New Picture

Since April, multiple data points have emerged successively, collectively sketching an alarming picture.

In April this year, Uber's Chief Technology Officer publicly stated that the company had burned through its annual Claude Code budget in just four months.

Among 5,000 engineers, monthly usage rates ranged from 84% to 95%, with individual monthly bills varying from $150 to $2,000. The CTO himself reportedly consumed $1,200 worth of tokens during a two-hour internal demonstration.

Macdonald described being "speechless" upon hearing this number.

Regarding Microsoft, according to a report in The Verge's Tom Warren's Notepad newsletter, Claude Code quickly became popular among Microsoft's internal engineering teams. However, the token-based billing model made scaled spending unsustainable, prompting Microsoft to proceed with cutting related licenses.

GitHub announced that starting June 1, all Copilot plans would shift from a fixed subscription model to usage-based billing.

The official discussion thread garnered nearly 900 downvotes, as users calculated that a single AI programming session typically consumes $30 to $40, meaning a $10 monthly subscription could be exhausted in a single use.

Developer productivity platform Entelligence.AI aggregated data from 2,444 companies and found:

  • For every $1 spent on AI token costs, only 18 cents generated actual value reaching users.
  • 44 cents were used to fix bugs introduced by the AI itself; 27 cents went to rework; 11 cents were consumed by review friction.

According to Bloomberg's Silicon Data LLM Token Expenditure Index, token prices have risen about 65% since the end of February this year, and US AI software prices have increased by 20% to 37% cumulatively over the past year.

Bull vs. Bear Debate: One Fact, Two Interpretations

The same data points to starkly different conclusions under different analytical frameworks.

The bullish view argues that the current chaos is merely the growing pains of a successful transformation.

According to Goldman Sachs' Jim Schneider in early May, by 2030, agentic AI will drive a 24-fold increase in token consumption, reaching approximately 120 sextillion tokens per month. The gross margins of hyperscale cloud providers and model vendors will turn positive within the next 3 to 12 months.

Goldman's Rich Privorotsky believes that Q1 2026 might have been the peak for "token maximization" as a KPI. The industry is shifting from pursuing consumption volume to the healthier metric of "cost per effective action."

JP Morgan's economic research also found a jump in new and updated Python packages on PyPI in early 2026, a trend not seen when ChatGPT launched in 2022, indicating that real productivity gains are occurring.

Furthermore, the Magnificent 7 currently trades at about 20 times forward earnings, far below the 52 times at the peak of the 2000 tech bubble, 67 times for Japan in 1989, and 34 times during the "Nifty Fifty" era. By historical bubble standards, this does not constitute a bubble.

The bearish view was most systematically articulated by Goldman Sachs semiconductor analyst Jim Covello in an April report.

He pointed out that almost all value in the AI supply chain flows to semiconductor companies, a phenomenon unprecedented and unsustainable in history. Chip companies should benefit when their customers benefit, but in this cycle, their prosperity comes at the expense of consumption across the entire upstream industry chain.

Nvidia's net profit has grown about 20-fold since ChatGPT's launch; major hyperscale cloud providers have burned through their operating cash flow and are turning to debt—data center-related debt issuance in 2025 was approximately $182 billion, doubling from 2024.

MIT Nanda research shows 95% of enterprises investing in generative AI see zero return. This decoupling may persist for a while, but cannot last forever.

Concerns of the Circular Financing Structure

This discussion touches on a more complex level: the financial loop between hyperscale cloud providers and AI labs.

According to corporate disclosure documents compiled by The Information, OpenAI and Anthropic account for more than half of the approximately $2 trillion in future cloud service commitments from Microsoft, Oracle, Google, and Amazon. Specifically:

  • Of Microsoft's $627 billion cloud service backlog, $280 billion is tied to OpenAI;
  • Of Oracle's $553 billion pipeline business, 54% (approx. $300 billion) is committed by OpenAI;
  • Of Google's $467.6 billion, Anthropic accounts for 43% (approx. $200 billion);
  • Amazon's corresponding exposure also reaches 51% of its $464 billion backlog.

This financing structure is inherently circular. Microsoft's $13 billion investment in OpenAI was largely delivered in the form of Azure credits, which OpenAI used to purchase Azure compute. Microsoft then booked this as cloud revenue.

The same hyperscale cloud providers are both equity investors in the AI labs and service providers collecting compute bills.

This structure is also reflected in profit data. Alphabet reported a record Q1 profit of $62.6 billion, of which about $28.7 billion, nearly half, came from the paper appreciation of its Anthropic stake.

Amazon's Q1 profit of $30.3 billion included $16.8 billion in pre-tax unrealized gains from Anthropic, while its free cash flow plummeted 95% to $1.2 billion due to data center capital expenditures of $44.2 billion in the same period.

The sustainability of this system depends on AI labs' continued ability to secure external financing to fulfill cloud computing commitments, which in turn relies on enterprise customers' continued willingness to pay rising token bills.

It is reported that Anthropic currently incurs costs of $3 for every $1 of revenue. Once the pace of financing slows, the credibility of cloud revenue projections will decline, and the valuation multiples of hyperscale cloud vendors will also face re-evaluation pressure.

This chain transmits in both directions and will break in both directions.

This Isn't 1999, But the Problem is Real

The current situation does not constitute a typical bubble setup.

From a valuation multiple perspective, the Tech 7 currently trades at about 20 times forward price-to-earnings, far below the 52 times at the peak of the 2000 tech bubble, 67 times for the Japanese market in 1989, or the 34 times during the "Nifty Fifty" era.

AI technology itself is real. For heavy user groups, data on productivity gains is verifiable. OpenAI has an annualized revenue of about $20 billion, Anthropic about $4.3 billion; these two labs are not going to disappear.

Today, token cost (compute expense) has become the key determinant of AI success or failure. Six months ago, people weren't even discussing this topic.

Back then, people only cared about "whether the technology works." Now the answer is clear: in the eyes of specific jobs and specific people, the technology indeed works.

But a new question arises: Can the money saved by downstream companies using AI be transmitted upward in time to outrun the valuation window the capital market has left for AI labs and cloud giants?

Those bullish on AI believe that as long as the technology continues to mature, corporate ROI (Return on Investment) will turn positive within 1 to 1.5 years.

The bearish believe more executives will follow Macdonald's lead, publicly complaining about low AI ROI and starting to cut budgets.

Both scenarios are playing out; the outcome is undecided. The only certainty is that the old lie—"as long as token consumption is rising, it means the AI transformation is successful"—has been shattered.

High token consumption does not equal commercial value; this bubble must eventually be squeezed out. The bill for AI has come due, but who will ultimately pay for it? That remains an unknown for now.

Related Questions

QAccording to the article, what is the major problem that enterprise AI spending is currently facing?

AThe major problem is that token consumption is rapidly increasing, but quantifiable business value is hard to find. The article states that 'the line between the growth of token consumption and substantive product improvement... does not yet exist.' Executives are finding it difficult to justify the escalating costs.

QWhat key finding did the developer platform Entelligence.AI discover regarding the value generated from AI token spending?

AEntelligence.AI found that for every dollar spent on AI token fees, only 18 cents generated tangible value that reached end-users. The rest was consumed by other costs: 44 cents for fixing AI-introduced bugs, 27 cents for rework, and 11 cents for review friction.

QWhat is the critical concern regarding the financial structure between hyperscale cloud providers and AI labs, as described in the article?

AThe concern is a potentially unsustainable, cyclical financing structure. Hyperscale cloud providers (like Microsoft, Amazon) are both equity investors in and service providers for AI labs (like OpenAI, Anthropic). The labs use cloud credits from the investments to purchase cloud compute, which the providers book as revenue. This structure depends on continuous external funding for the labs, which itself relies on enterprise clients' willingness to pay rising token bills.

QBased on the bull argument presented, what metric is the AI industry supposedly shifting towards from 'tokenmaxxing'?

AAccording to the bull argument, the industry is shifting from focusing on 'tokenmaxxing' (maximizing token consumption as a KPI) towards a healthier metric: the 'cost per effective action' or the return on investment (ROI) of AI deployments.

QWhat does the article conclude is the 'new question' now that the technical capability of AI is proven for specific tasks?

AThe new question is: 'Can the money saved by downstream companies using AI be transmitted upwards quickly enough to outpace the valuation window that capital markets have left for AI labs and cloud giants?' In other words, can the business value and cost savings materialize fast enough to justify the high costs and valuations before investor patience runs out?

Related Reads

Claude Code Introduces Dynamic Workflows: Enabling AI to Form Teams and Collaborate

Claude Code introduces dynamic workflows, enabling AI to coordinate teams of specialized agents for complex tasks. This transforms Claude from a code assistant into a programmable workbench. Workflows address key limitations of single-agent systems: agentic laziness (premature task completion), self-preferential bias (favoring own outputs), and goal drift (losing sight of original objectives). The system allows Claude to dynamically create execution frameworks using JavaScript. It can split tasks, dispatch parallel agents for isolated work (e.g., in separate worktrees), implement adversarial validation, run tournaments, and synthesize results. This multi-agent approach is valuable for tasks requiring deep research, factual verification, code migration, root cause analysis, large-scale triage, and qualitative sorting. Key patterns include: classify-and-route, fan-out-and-synthesize, adversarial verification, generate-and-filter, tournaments, and loop-until-done. While token usage is higher, workflows excel where tasks resemble programming—needing problem decomposition, isolated context, hypothesis testing, and handling many details. They extend Claude Code's utility beyond technical work to areas like business plan review, resume screening, and naming brainstorm. The feature is not a universal solution but points to a future where AI tool competitiveness depends on organizing reliable, reusable, and auditable execution flows for complex goals.

marsbit14m ago

Claude Code Introduces Dynamic Workflows: Enabling AI to Form Teams and Collaborate

marsbit14m ago

Hyperliquid, Wall Street's 24/7 Trading Convenience Store

Hyperliquid: The 24/7 Trading "Convenience Store" for Wall Street Hyperliquid, a decentralized cryptocurrency exchange, has become a go-to platform for Wall Street traders seeking to trade around the clock, especially during traditional market closures. Founded by Jeff Yan, a former quantitative trader, after the FTX collapse, the platform emphasizes user self-custody of assets. It offers a wide range of perpetual contracts—leveraged derivatives with no expiry—on assets from Bitcoin and crude oil to the S&P 500 and even pre-IPO companies like SpaceX. A notable example involves a hedge fund trader who capitalized on geopolitical news over a weekend, securing a 243% return on oil derivatives before markets reopened. The platform, run by just 11 employees, generated approximately $800 million in revenue last year, and its native token HYPE has seen significant growth. Its rise highlights the merging of traditional finance and crypto. While U.S. users are currently restricted, recent CFTC rule changes could open access. The platform is known for its transparency, having processed $10 billion in liquidations during a market crash while competitors faltered. Regulators warn of the high risks and complexity of perpetual contracts for retail investors. Key to its appeal is a strong community culture, direct engagement with founders, and a simple interface. Despite rules against VPN use, it attracts global users with its permissionless approach. Hyperliquid plans to expand into prediction markets and options, aiming to eventually host all financial activity.

marsbit14m ago

Hyperliquid, Wall Street's 24/7 Trading Convenience Store

marsbit14m ago

Who Funds the Agents?

**Summary: Who Funds AI Agents?** OpenAI recently shut down a feature allowing AI agents to shop for users, highlighting the challenge of creating a secure and regulated environment for agent-driven transactions. While payment infrastructure exists, a crucial governance layer—defining spending limits, fraud detection, tax handling, and return policies—is largely missing. The potential is enormous: AI agents already processed $73M across 176M transactions last year, with McKinsey forecasting this could grow to $3-5T in global consumer commerce by 2030. The core competition isn't just about processing payments, which can be very cheap (especially with crypto-based settlement), but about controlling the rules that govern agent spending. Key players like Stripe and Coinbase are racing to dominate this governance layer. Stripe's acquisition of wallet provider Privy allows it to set spending policies, identity checks, and human-in-the-loop approvals directly at the wallet level. Similarly, Coinbase's stack, including its x402 protocol and AgentKit, embeds governance rules. This vertical integration across settlement, wallet, and governance layers is becoming the dominant strategy. Control over the governance layer is where significant future value lies. If agents handle trillions in transactions, even a small fee for managing compliance, fraud prevention, and policy enforcement could generate billions in annual revenue. The companies that successfully integrate across the payment stack will capture value from idle agent balances, transaction fees, and governance services, positioning themselves as the foundational banks of the AI agent economy.

marsbit41m ago

Who Funds the Agents?

marsbit41m ago

Trading

Spot
Futures

Hot Articles

How to Buy PEOPLE

Welcome to HTX.com! We've made purchasing ConstitutionDAO (PEOPLE) simple and convenient. Follow our step-by-step guide to embark on your crypto journey.Step 1: Create Your HTX AccountUse your email or phone number to sign up for a free account on HTX. Experience a hassle-free registration journey and unlock all features.Get My AccountStep 2: Go to Buy Crypto and Choose Your Payment MethodCredit/Debit Card: Use your Visa or Mastercard to buy ConstitutionDAO (PEOPLE) instantly.Balance: Use funds from your HTX account balance to trade seamlessly.Third Parties: We've added popular payment methods such as Google Pay and Apple Pay to enhance convenience.P2P: Trade directly with other users on HTX.Over-the-Counter (OTC): We offer tailor-made services and competitive exchange rates for traders.Step 3: Store Your ConstitutionDAO (PEOPLE)After purchasing your ConstitutionDAO (PEOPLE), store it in your HTX account. Alternatively, you can send it elsewhere via blockchain transfer or use it to trade other cryptocurrencies.Step 4: Trade ConstitutionDAO (PEOPLE)Easily trade ConstitutionDAO (PEOPLE) on HTX's spot market. Simply access your account, select your trading pair, execute your trades, and monitor in real-time. We offer a user-friendly experience for both beginners and seasoned traders.

6.9k Total ViewsPublished 2024.03.29Updated 2026.06.02

How to Buy PEOPLE

Discussions

Welcome to the HTX Community. Here, you can stay informed about the latest platform developments and gain access to professional market insights. Users' opinions on the price of PEOPLE (PEOPLE) are presented below.

活动图片