When Tokens Cost More Than People, 'AI Narrative' Runs Into Trouble

marsbitОпубликовано 2026-05-29Обновлено 2026-05-29

Введение

Title: When Tokens Cost More Than People, the "AI Narrative" Hits Trouble The economic sustainability of corporate AI adoption is under scrutiny as token consumption soars while measurable business value remains elusive. Major companies like Uber and Microsoft report struggling to justify rising AI costs, with executives coining terms like "tokenmaxxing" to describe wasteful usage. Data reveals a stark picture: for every dollar spent on AI tokens, only 18 cents translates to user-facing value, with the rest consumed by bug fixes, rework, and friction. The debate splits into bullish and bearish camps. Bulls, like Goldman Sachs analysts, see current inefficiencies as growing pains, predicting a 24-fold increase in token demand by 2030 and a shift towards healthier metrics like "cost per effective action." They point to indicators of real productivity gains and argue current tech valuations are not in bubble territory. Bears, however, highlight an unsustainable model where value is heavily concentrated in semiconductor companies like Nvidia, funded by cloud giants taking on massive debt. Studies show 95% of firms investing in generative AI see zero return. A deeper concern is the circular financial structure between cloud providers (hyperscalers) and AI labs like OpenAI and Anthropic. Billions in cloud service commitments are tied to these labs, which are partly funded by the hyperscalers' own investment. This creates a loop where cloud revenue depends on labs securing contin...

Author: Bao Yilong

Source: Wall Street News

The justification for corporate AI spending is facing a severe test, as Token consumption continues to climb, yet quantifiable commercial value remains elusive.

On May 22, Uber's Chief Operating Officer Andrew Macdonald, whose company is valued at over $200 billion, stated publicly on a podcast that the link between the growth in token consumption and substantial product improvement "doesn't exist yet."

Macdonald pointed out that companies are finding it increasingly difficult to rationalize the continuously rising AI expenditures. He even coined a term for the wasteful phenomenon within engineering teams: "tokenmaxxing."

Earlier in mid-May, Microsoft began cutting internal Claude Code licenses, citing token bills as "unsustainable."

The combination of these two events forces the market to confront a previously overlooked variable. Token economics, specifically the unit economics of token consumption at enterprise scale, has evolved from a peripheral issue to the central load-bearing pillar of the entire AI investment thesis.

Five Data Points, Painting a New Picture

Since April, multiple data points have emerged successively, collectively sketching an alarming picture.

In April this year, Uber's Chief Technology Officer publicly stated that the company had burned through its annual Claude Code budget in just four months.

Among 5,000 engineers, monthly usage rates ranged from 84% to 95%, with individual monthly bills varying from $150 to $2,000. The CTO himself reportedly consumed $1,200 worth of tokens during a two-hour internal demonstration.

Macdonald described being "speechless" upon hearing this number.

Regarding Microsoft, according to a report in The Verge's Tom Warren's Notepad newsletter, Claude Code quickly became popular among Microsoft's internal engineering teams. However, the token-based billing model made scaled spending unsustainable, prompting Microsoft to proceed with cutting related licenses.

GitHub announced that starting June 1, all Copilot plans would shift from a fixed subscription model to usage-based billing.

The official discussion thread garnered nearly 900 downvotes, as users calculated that a single AI programming session typically consumes $30 to $40, meaning a $10 monthly subscription could be exhausted in a single use.

Developer productivity platform Entelligence.AI aggregated data from 2,444 companies and found:

  • For every $1 spent on AI token costs, only 18 cents generated actual value reaching users.
  • 44 cents were used to fix bugs introduced by the AI itself; 27 cents went to rework; 11 cents were consumed by review friction.

According to Bloomberg's Silicon Data LLM Token Expenditure Index, token prices have risen about 65% since the end of February this year, and US AI software prices have increased by 20% to 37% cumulatively over the past year.

Bull vs. Bear Debate: One Fact, Two Interpretations

The same data points to starkly different conclusions under different analytical frameworks.

The bullish view argues that the current chaos is merely the growing pains of a successful transformation.

According to Goldman Sachs' Jim Schneider in early May, by 2030, agentic AI will drive a 24-fold increase in token consumption, reaching approximately 120 sextillion tokens per month. The gross margins of hyperscale cloud providers and model vendors will turn positive within the next 3 to 12 months.

Goldman's Rich Privorotsky believes that Q1 2026 might have been the peak for "token maximization" as a KPI. The industry is shifting from pursuing consumption volume to the healthier metric of "cost per effective action."

JP Morgan's economic research also found a jump in new and updated Python packages on PyPI in early 2026, a trend not seen when ChatGPT launched in 2022, indicating that real productivity gains are occurring.

Furthermore, the Magnificent 7 currently trades at about 20 times forward earnings, far below the 52 times at the peak of the 2000 tech bubble, 67 times for Japan in 1989, and 34 times during the "Nifty Fifty" era. By historical bubble standards, this does not constitute a bubble.

The bearish view was most systematically articulated by Goldman Sachs semiconductor analyst Jim Covello in an April report.

He pointed out that almost all value in the AI supply chain flows to semiconductor companies, a phenomenon unprecedented and unsustainable in history. Chip companies should benefit when their customers benefit, but in this cycle, their prosperity comes at the expense of consumption across the entire upstream industry chain.

Nvidia's net profit has grown about 20-fold since ChatGPT's launch; major hyperscale cloud providers have burned through their operating cash flow and are turning to debt—data center-related debt issuance in 2025 was approximately $182 billion, doubling from 2024.

MIT Nanda research shows 95% of enterprises investing in generative AI see zero return. This decoupling may persist for a while, but cannot last forever.

Concerns of the Circular Financing Structure

This discussion touches on a more complex level: the financial loop between hyperscale cloud providers and AI labs.

According to corporate disclosure documents compiled by The Information, OpenAI and Anthropic account for more than half of the approximately $2 trillion in future cloud service commitments from Microsoft, Oracle, Google, and Amazon. Specifically:

  • Of Microsoft's $627 billion cloud service backlog, $280 billion is tied to OpenAI;
  • Of Oracle's $553 billion pipeline business, 54% (approx. $300 billion) is committed by OpenAI;
  • Of Google's $467.6 billion, Anthropic accounts for 43% (approx. $200 billion);
  • Amazon's corresponding exposure also reaches 51% of its $464 billion backlog.

This financing structure is inherently circular. Microsoft's $13 billion investment in OpenAI was largely delivered in the form of Azure credits, which OpenAI used to purchase Azure compute. Microsoft then booked this as cloud revenue.

The same hyperscale cloud providers are both equity investors in the AI labs and service providers collecting compute bills.

This structure is also reflected in profit data. Alphabet reported a record Q1 profit of $62.6 billion, of which about $28.7 billion, nearly half, came from the paper appreciation of its Anthropic stake.

Amazon's Q1 profit of $30.3 billion included $16.8 billion in pre-tax unrealized gains from Anthropic, while its free cash flow plummeted 95% to $1.2 billion due to data center capital expenditures of $44.2 billion in the same period.

The sustainability of this system depends on AI labs' continued ability to secure external financing to fulfill cloud computing commitments, which in turn relies on enterprise customers' continued willingness to pay rising token bills.

It is reported that Anthropic currently incurs costs of $3 for every $1 of revenue. Once the pace of financing slows, the credibility of cloud revenue projections will decline, and the valuation multiples of hyperscale cloud vendors will also face re-evaluation pressure.

This chain transmits in both directions and will break in both directions.

This Isn't 1999, But the Problem is Real

The current situation does not constitute a typical bubble setup.

From a valuation multiple perspective, the Tech 7 currently trades at about 20 times forward price-to-earnings, far below the 52 times at the peak of the 2000 tech bubble, 67 times for the Japanese market in 1989, or the 34 times during the "Nifty Fifty" era.

AI technology itself is real. For heavy user groups, data on productivity gains is verifiable. OpenAI has an annualized revenue of about $20 billion, Anthropic about $4.3 billion; these two labs are not going to disappear.

Today, token cost (compute expense) has become the key determinant of AI success or failure. Six months ago, people weren't even discussing this topic.

Back then, people only cared about "whether the technology works." Now the answer is clear: in the eyes of specific jobs and specific people, the technology indeed works.

But a new question arises: Can the money saved by downstream companies using AI be transmitted upward in time to outrun the valuation window the capital market has left for AI labs and cloud giants?

Those bullish on AI believe that as long as the technology continues to mature, corporate ROI (Return on Investment) will turn positive within 1 to 1.5 years.

The bearish believe more executives will follow Macdonald's lead, publicly complaining about low AI ROI and starting to cut budgets.

Both scenarios are playing out; the outcome is undecided. The only certainty is that the old lie—"as long as token consumption is rising, it means the AI transformation is successful"—has been shattered.

High token consumption does not equal commercial value; this bubble must eventually be squeezed out. The bill for AI has come due, but who will ultimately pay for it? That remains an unknown for now.

Связанные с этим вопросы

QAccording to the article, what is the major problem that enterprise AI spending is currently facing?

AThe major problem is that token consumption is rapidly increasing, but quantifiable business value is hard to find. The article states that 'the line between the growth of token consumption and substantive product improvement... does not yet exist.' Executives are finding it difficult to justify the escalating costs.

QWhat key finding did the developer platform Entelligence.AI discover regarding the value generated from AI token spending?

AEntelligence.AI found that for every dollar spent on AI token fees, only 18 cents generated tangible value that reached end-users. The rest was consumed by other costs: 44 cents for fixing AI-introduced bugs, 27 cents for rework, and 11 cents for review friction.

QWhat is the critical concern regarding the financial structure between hyperscale cloud providers and AI labs, as described in the article?

AThe concern is a potentially unsustainable, cyclical financing structure. Hyperscale cloud providers (like Microsoft, Amazon) are both equity investors in and service providers for AI labs (like OpenAI, Anthropic). The labs use cloud credits from the investments to purchase cloud compute, which the providers book as revenue. This structure depends on continuous external funding for the labs, which itself relies on enterprise clients' willingness to pay rising token bills.

QBased on the bull argument presented, what metric is the AI industry supposedly shifting towards from 'tokenmaxxing'?

AAccording to the bull argument, the industry is shifting from focusing on 'tokenmaxxing' (maximizing token consumption as a KPI) towards a healthier metric: the 'cost per effective action' or the return on investment (ROI) of AI deployments.

QWhat does the article conclude is the 'new question' now that the technical capability of AI is proven for specific tasks?

AThe new question is: 'Can the money saved by downstream companies using AI be transmitted upwards quickly enough to outpace the valuation window that capital markets have left for AI labs and cloud giants?' In other words, can the business value and cost savings materialize fast enough to justify the high costs and valuations before investor patience runs out?

Похожее

Bloomberg Uncovered: How Do China's Wealthy Circumvent the Annual $50,000 Limit to Transfer Assets?

**Summary: How Wealthy Chinese Circumvent $50,000 Annual Foreign Exchange Limits** Despite China's strict capital controls, including an annual $50,000 per person foreign exchange quota, an estimated $150 billion in funds still leaves the country annually via various gray and underground channels. This report outlines the evolution of China's "capital wall" and the methods used to bypass it. **The Evolving Capital Controls:** * **Foundation (1994):** The system of "current account convertibility with strict capital account controls" was established. * **Quota Set (2007):** The $50,000 individual annual forex purchase limit was formalized. * **Crackdown Begins (2015-2017):** Following market volatility, enforcement tightened. Banks were required to scrutinize transactions, and channels like using UnionPay cards for Hong Kong insurance premiums or buying overseas property were blocked. * **Digital & Legal Upgrades (2024-2026):** Enhanced algorithms now flag suspicious patterns (e.g., "smurfing"). The Common Reporting Standard (CRS) provides Chinese tax authorities with data on citizens' offshore accounts. Unlicensed cross-border brokers have been targeted. **Five Primary Methods for Moving Capital:** 1. **Underground Banking / "Hawala" (Duiqiao):** The largest-scale method. No money crosses borders. Clients pay RMB to a domestic account; an overseas associate deposits equivalent foreign currency into the client's offshore account. Risks include high fees, account freezes, and legal penalties. 2. **"Smurfing" or "Ant Moving":** Using multiple individuals' $50,000 quotas to pool funds for one offshore recipient. Increasingly detected by anti-money laundering algorithms. 3. **Trade Invoice Manipulation:** Businesses over-invoice imports or under-invoice exports via offshore shell companies, creating a pretext to transfer excess funds abroad under the guise of trade. 4. **Channel Migration:** After a crackdown on internet brokers, funds flow toward more compliant but costly channels like major banks' cross-border wealth management services or Qualified Domestic Institutional Investor (QDII) quotas. 5. **Structural Arrangements:** High-net-worth individuals use complex, high-cost legal structures involving offshore trusts, insurance, and investment migration programs to transfer asset ownership. **Regulatory Response: Focusing on People, Not Just Money** The current strategy extends oversight from enterprises to **individual residents**. Tools like CRS allow retroactive visibility into offshore assets. Cryptocurrencies, once seen as a potential loophole, are now actively monitored and prosecuted as an illegal channel. The underlying driver remains: with significant wealth concentrated among millions of affluent households seeking diversification amid domestic economic shifts, the incentive to move assets offshore persists despite regulatory barriers.

marsbit5 мин. назад

Bloomberg Uncovered: How Do China's Wealthy Circumvent the Annual $50,000 Limit to Transfer Assets?

marsbit5 мин. назад

Ethereum's Ballmer Moment: As Everyone Is Bearish, the Circulating Supply Is Disappearing

"Ethereum's Ballmer Moment: Circulation Shrinks Amid Bearish Sentiment" Amid widespread bearish sentiment, with prominent figures like Bankless founder David Hoffman selling ETH and young developers flocking to Solana, some argue Ethereum is entering its "Ballmer era"—akin to Microsoft's perceived stagnation under Steve Ballmer. While surface-level criticisms about slow protocol development, cautious leadership, and competitive pressure are valid, underlying fundamentals tell a different story. Approximately 30% of ETH is staked, major holders like BitMine are accumulating, and spot ETFs continue to absorb supply. Regulatory clarity, including the SEC/CFTC's March ruling on staking rewards and the potential passage of the CLARITY Act, is transforming crypto from a regulatory threat into a legitimized framework. This institutionalization, alongside a shrinking circulating supply (with net issuance around 0.23% annually), creates significant buy-side pressure independent of fee-based value capture. The broader crypto total addressable market is expanding through regulated stablecoins, tokenized assets, and institutional adoption. While public chains face competition from permissioned alternatives, the winning model appears to be permissioned assets settling on public chains like Ethereum and Solana. The author advocates a non-maximalist, barbell strategy: holding ETH for its institutional role and supply squeeze, SOL for consumer/throughput trends, BTC as a macro hedge, and a basket of next-gen L1s. Key bullish drivers for ETH include rapid circulation shrinkage, potential Q2 staked ETF approvals, regulatory tailwinds solidifying its role as a default settlement layer, and the optionality of an eventual "Satya moment" leadership shift. Despite bearish consensus, the current setup—where crypto is "not hot" and regulatory groundwork is being laid—presents a compelling investment opportunity. The crypto cycle's focus may have shifted to AI, but blockchain infrastructure is gaining a legal and institutional foothold precisely while attention is elsewhere.

marsbit5 мин. назад

Ethereum's Ballmer Moment: As Everyone Is Bearish, the Circulating Supply Is Disappearing

marsbit5 мин. назад

Claude Code Introduces Dynamic Workflows: Enabling AI to Form Teams and Collaborate

Claude Code introduces dynamic workflows, enabling AI to coordinate teams of specialized agents for complex tasks. This transforms Claude from a code assistant into a programmable workbench. Workflows address key limitations of single-agent systems: agentic laziness (premature task completion), self-preferential bias (favoring own outputs), and goal drift (losing sight of original objectives). The system allows Claude to dynamically create execution frameworks using JavaScript. It can split tasks, dispatch parallel agents for isolated work (e.g., in separate worktrees), implement adversarial validation, run tournaments, and synthesize results. This multi-agent approach is valuable for tasks requiring deep research, factual verification, code migration, root cause analysis, large-scale triage, and qualitative sorting. Key patterns include: classify-and-route, fan-out-and-synthesize, adversarial verification, generate-and-filter, tournaments, and loop-until-done. While token usage is higher, workflows excel where tasks resemble programming—needing problem decomposition, isolated context, hypothesis testing, and handling many details. They extend Claude Code's utility beyond technical work to areas like business plan review, resume screening, and naming brainstorm. The feature is not a universal solution but points to a future where AI tool competitiveness depends on organizing reliable, reusable, and auditable execution flows for complex goals.

marsbit46 мин. назад

Claude Code Introduces Dynamic Workflows: Enabling AI to Form Teams and Collaborate

marsbit46 мин. назад

Hyperliquid, Wall Street's 24/7 Trading Convenience Store

Hyperliquid: The 24/7 Trading "Convenience Store" for Wall Street Hyperliquid, a decentralized cryptocurrency exchange, has become a go-to platform for Wall Street traders seeking to trade around the clock, especially during traditional market closures. Founded by Jeff Yan, a former quantitative trader, after the FTX collapse, the platform emphasizes user self-custody of assets. It offers a wide range of perpetual contracts—leveraged derivatives with no expiry—on assets from Bitcoin and crude oil to the S&P 500 and even pre-IPO companies like SpaceX. A notable example involves a hedge fund trader who capitalized on geopolitical news over a weekend, securing a 243% return on oil derivatives before markets reopened. The platform, run by just 11 employees, generated approximately $800 million in revenue last year, and its native token HYPE has seen significant growth. Its rise highlights the merging of traditional finance and crypto. While U.S. users are currently restricted, recent CFTC rule changes could open access. The platform is known for its transparency, having processed $10 billion in liquidations during a market crash while competitors faltered. Regulators warn of the high risks and complexity of perpetual contracts for retail investors. Key to its appeal is a strong community culture, direct engagement with founders, and a simple interface. Despite rules against VPN use, it attracts global users with its permissionless approach. Hyperliquid plans to expand into prediction markets and options, aiming to eventually host all financial activity.

marsbit47 мин. назад

Hyperliquid, Wall Street's 24/7 Trading Convenience Store

marsbit47 мин. назад

Торговля

Спот
Фьючерсы

Популярные статьи

Как купить PEOPLE

Добро пожаловать на HTX.com! Мы сделали приобретение ConstitutionDAO (PEOPLE) простым и удобным. Следуйте нашему пошаговому руководству и отправляйтесь в свое крипто-путешествие.Шаг 1: Создайте аккаунт на HTXИспользуйте свой адрес электронной почты или номер телефона, чтобы зарегистрироваться и бесплатно создать аккаунт на HTX. Пройдите удобную регистрацию и откройте для себя весь функционал.Создать аккаунтШаг 2: Перейдите в Купить криптовалюту и выберите свой способ оплатыКредитная/Дебетовая Карта: Используйте свою карту Visa или Mastercard для мгновенной покупки ConstitutionDAO (PEOPLE).Баланс: Используйте средства с баланса вашего аккаунта HTX для простой торговли.Третьи Лица: Мы добавили популярные способы оплаты, такие как Google Pay и Apple Pay, для повышения удобства.P2P: Торгуйте напрямую с другими пользователями на HTX.Внебиржевая Торговля (OTC): Мы предлагаем индивидуальные услуги и конкурентоспособные обменные курсы для трейдеров.Шаг 3: Хранение ConstitutionDAO (PEOPLE)После приобретения вами ConstitutionDAO (PEOPLE) храните их в своем аккаунте на HTX. В качестве альтернативы вы можете отправить их куда-либо с помощью перевода в блокчейне или использовать для торговли с другими криптовалютами.Шаг 4: Торговля ConstitutionDAO (PEOPLE)С легкостью торгуйте ConstitutionDAO (PEOPLE) на спотовом рынке HTX. Просто зайдите в свой аккаунт, выберите торговую пару, совершайте сделки и следите за ними в режиме реального времени. Мы предлагаем удобный интерфейс как для начинающих, так и для опытных трейдеров.

758 просмотров всегоОпубликовано 2024.04.12Обновлено 2026.06.02

Как купить PEOPLE

Обсуждения

Добро пожаловать в Сообщество HTX. Здесь вы сможете быть в курсе последних новостей о развитии платформы и получить доступ к профессиональной аналитической информации о рынке. Мнения пользователей о цене на PEOPLE (PEOPLE) представлены ниже.

活动图片