After Burning Tens of Billions of Dollars in Tokens, Silicon Valley Giants Start Limiting Employee Token Usage

marsbitОпубликовано 2026-06-01Обновлено 2026-06-01

Введение

After burning tens of billions of dollars on AI tokens, major Silicon Valley firms are now restricting employee usage. Companies like Microsoft, Uber, and Salesforce, which heavily promoted AI for "efficiency," are facing a cost crisis. The practice of "tokenmaxxing"—pushing employees to maximize AI tool usage—led to wasteful spending on trivial tasks like checking the weather or writing birthday messages, with studies showing significant hidden costs for bug fixes and code rewrites. The core issue is a misalignment between individual productivity gains and actual business value. While employees use AI to automate tasks they dislike, such as writing reports, this often doesn't translate to increased company revenue or improved core business outcomes. For instance, AI-generated code speeds up development but also sees an 800% increase in "code churn" (code being discarded or rewritten). As a result, only 14% of CFOs report seeing a clear, measurable return on AI investments. Firms are now shifting strategies. Microsoft has revoked most internal licenses for Claude Code, while others are implementing monitoring and cost controls. New tools from companies like Harness and CloudZero aim to track AI spending and tie costs to business results. Some AI vendors, like HubSpot, are moving from token-based pricing to charging based on outcomes, such as "resolved conversations" or "leads generated." This represents a necessary correction in the AI adoption cycle. The challenge now is ...

AI automates the tasks employees 'hate,' not the ones that 'make money.'

A few days ago, GeekPark reported that Microsoft, which has placed heavy bets on AI, quietly discontinued Claude Code licenses for most of its employees internally.

This is quite bizarre because one of the biggest selling points of this wave of AI adoption for enterprise users is 'increased efficiency.' If it can increase efficiency, why would Microsoft stop its employees from using Claude Code?

Microsoft is not the only one doing this. 'Tightening token usage' and no longer encouraging employees to go all out on Vibe Coding has become the new trend among Silicon Valley giants.

Uber spent its entire year's AI token budget in four months. Salesforce writes a check to Anthropic for about $300 million annually. An AI consultant revealed that one of his clients had a monthly AI spend as high as $500 million. Meta even quietly took down its internal 'tokenmaxxing leaderboard'—a board originally designed to encourage employees to use AI more.

Now, companies are doing something unthinkable a few years ago:

Limiting, and monitoring, employee use of AI.

Why are major companies shifting their stance?

"Tokenmaxxing," A Reflection of the Times

To understand today's cost crisis, we must first understand what 'tokenmaxxing' is.

This term started gaining popularity around 2025, literally meaning 'maximizing token usage.' Behind it lies a management logic—since the company spent big money on AI tools, employees should use them frantically. The more you use, the more 'digitally transformed' you prove to be. The less you use, you're wasting resources. As a result, many companies set usage quotas, leaderboards, and even performance reviews, pushing employees to use AI.

And the result?

Employees started using the company's enterprise-grade AI models to check the weather, write birthday greetings, and ask what to eat today.

A study of 2,444 companies found that for every dollar a company spends on AI tokens, $0.44 is used to fix bugs generated by AI, $0.27 to rewrite AI-produced code, and $0.11 is consumed in review and merge delays.

In other words, behind every dollar of AI procurement cost lies nearly 80% in hidden losses.

Investor Shruti Gandhi used an apt analogy: "A tokenmaxxing enterprise is like a company measuring productivity by keeping all the lights on—spending more money doesn't equal producing more."

More ironically, most of these companies have no idea what their employees are using AI for, let alone whether the completion of those tasks brought about any change because of AI.

This 'money-burning race' burned from 2024 into 2025, finally igniting this year. JPMorgan issued a sternly worded report with a title uncomfortably blunt—'AI Token Costs Are Eating Up Internet Profits'.

Shopify, Spotify, ServiceNow, and Roku all mentioned in their earnings calls that AI has become a major pressure point on operating expenses. The overall industry sentiment is starting to shift from 'how cool it is to use AI' to 'is this money well spent?'

When CEOs Start Questioning ROI

Only 14% of CFOs say they can see a clear, measurable return on AI investment.

Uber's Chief Operating Officer, Andrew Macdonald, said something very candid in a podcast—they find it difficult to link the productivity gains of individual employees to the overall business impact on the company. "If you can't see how AI helps you push more valuable features to users, token costs are even harder to justify."

This statement highlights the core of the enterprise AI dilemma: Improving individual efficiency does not equal increasing company profits.

An employee writes weekly reports three times faster with AI, but company revenue remains unchanged. An engineer generates code twice as fast with AI, but the code 'churn rate'—the proportion of code abandoned or rewritten—increases by 800%.

Microsoft's former Chief AI Officer, Sophia Velastegui, said something that makes many managers uncomfortable: "Most people default to automating the tasks they dislike, not the tasks most valuable to the company."

Put simply, companies are automating the tasks employees 'hate,' not the ones that 'make money.'

This is not a technical problem; it's a problem of priorities. It's also why about 30% of generative AI projects get abandoned at the proof-of-concept stage—costs are unclear, value is unclear, so the boss naturally stops paying.

Salesforce CEO Marc Benioff's approach is quite representative. Faced with an annual Anthropic bill of $300 million, his expectation is an 'intelligent router': something that can judge which queries are worth using a top-tier model for and which can use a cheaper, smaller model.

This idea itself isn't novel—as early as the cloud computing era, 'pay-as-you-go' and 'resource optimization' were standard practices. But this wave of AI came too fast; everyone bought first and thought later, only now starting to catch up.

Rational Return, or Prelude to Winter?

Microsoft recently canceled most enterprise licenses for Claude Code, with the official reason pointing to cost factors. This has sparked considerable discussion within the industry—after all, Microsoft itself is the largest investor in OpenAI, while simultaneously cutting subscriptions to a competitor. How much of this is cost consideration and how much is strategic planning is hard to say.

But regardless, it represents a signal: enterprises are starting to vote with their feet.

Harness and CloudZero both released AI cost management tools almost on the same day—May 28th. One focuses on real-time monitoring of AI spending and ROI, while the other launched an 'AI Financial Control Plane' to help companies link every dollar of AI spending to specific business outcomes.

The emergence of these two products itself illustrates the problem: there is market demand, and it's urgent.

Starting in April this year, HubSpot adjusted the pricing model for its AI agents, no longer charging by token, but instead charging by 'conversations resolved' or 'leads generated'—a directional shift aligning the seller's interests with the buyer's actual output. ServiceNow is making similar adjustments. AI vendors are realizing that if they continue to sell 'usage' instead of 'results,' enterprise clients will eventually push back collectively.

Is this adjustment a necessary growing pain for AI industrialization, or the prelude to a larger crisis?

I tend to think it's the former. But one detail is somewhat concerning: Global AI software spending is projected to reach $2.59 trillion in 2026, a 47% year-on-year increase. Yet, at the same time, 94% of engineering leaders say key ROI metrics are still missing. More money is being spent, but no one knows where it's burning or if it's worth it—if this contradiction isn't resolved, the next 'tokenmaxxing moment' is only a matter of time.

A Fortune magazine analysis put it bluntly: "Tokenmaxxing is easy; redesigning workflows is hard." What most companies are doing now is optimizing existing processes, not reinventing business models. This is where the real value of AI lies, and it's also a place most enterprises haven't reached yet.

A rational return is a good thing. But after this rational return, companies still need to answer a more difficult question: Should AI be a hammer for our business, or a new framework for thinking?

If you only use AI to do old jobs faster, the bill will eventually force you back to face this question.

This article is from WeChat public account "GeekPark" (ID: geekpark), author: Huilin Dance King, editor: Jingyu

Связанные с этим вопросы

QWhat is the phenomenon of 'tokenmaxxing' in Silicon Valley companies, and why has it become a problem?

ATokenmaxxing is a management trend where companies encourage employees to maximize their usage of AI tokens to demonstrate digital transformation and justify large AI investments. It has become a problem because it leads to significant wasteful spending, as employees use expensive AI models for trivial tasks like checking the weather or writing birthday greetings. Studies show that for every dollar spent on AI tokens, up to 80% can be lost to hidden costs like fixing AI-generated bugs or rewriting code, making the return on investment unclear and prompting a shift towards cost control.

QAccording to the article, what is the core issue with how companies are implementing AI automation?

AThe core issue is that companies are often using AI to automate tasks that employees dislike or find tedious, rather than automating the most valuable, revenue-generating tasks for the business. This misalignment means that while individual productivity metrics might improve, it doesn't translate into measurable business growth or profit, leading CEOs and CFOs to question the ROI of their massive AI expenditures.

QHow are some AI vendors and companies responding to the AI cost crisis?

AIn response to the cost crisis, some AI vendors and companies are shifting their pricing and usage models. For example, HubSpot changed its pricing from a per-token model to charging based on business outcomes like 'solved conversations' or 'generated leads.' Similarly, companies like Microsoft are restricting employee access to certain AI tools like Claude Code, and new tools from Harness and CloudZero are emerging to help businesses monitor AI spending in real-time and tie costs directly to business results.

QWhat did the JPMorgan report indicate about the impact of AI token costs?

AA JPMorgan report, titled 'AI Token Costs Are Eating Internet Profits,' indicated that the substantial costs associated with AI token consumption are becoming a major pressure on the operational expenses and overall profitability of internet and tech companies. This report contributed to a broader industry shift from enthusiasm about AI adoption to serious scrutiny of whether the spending is justified.

QWhat does the article suggest is the harder question companies face after the 'rational return' in AI spending?

AThe article suggests that after the rational return—where companies start controlling costs and seeking clearer ROI—they face a harder, more fundamental question: whether AI should merely be a tool to perform existing tasks faster (a 'hammer') or if it should serve as a new framework for entirely reinventing business models and workflows. The article implies that failing to address this question will lead to recurring cost crises.

Похожее

The Midlife Crisis of Crypto GPs: No PMF, No Next Check from LPs

The article "The Midlife Crisis of Crypto GPs: No PMF, No Next LP Check" analyzes the shifting crypto fundraising landscape. It argues the era of selling grand visions to LPs is over; GPs must now offer products with clear Product-Market Fit (PMF). The author categorizes crypto fundraising products into three types: Primary (VC funds), Liquid (trading strategies), and CeFi/DeFi Native Yield. This summary focuses on the Primary market. Key points include: * **Market Shift:** LPs are impatient, demand immediate returns, and are skeptical of future promises. The "easy money" narrative has faded. * **GP Value Erosion:** LP learning curves have shortened (aided by AI), reducing the value of a GP's basic "crypto knowledge." Superior judgment is now rare. * **Weakened LP Motivations:** Traditional reasons for LPs to invest in crypto VC funds (capturing industry beta, gaining access, leveraging GP judgment) have weakened due to new products like ETFs and increased LP sophistication. * **Surviving in Primary:** The primary market will likely persist for: 1) large funds in endowment mandates treating it as a lottery ticket, 2) family offices/HNWIs using proprietary capital, 3) a few funds with proven recent outperformance, and 4) funds with strong ecosystem "deal-making" capabilities. * **Conclusion:** For most GPs, rebuilding trust requires starting over in a niche, demonstrating alpha-generating ability, or providing concrete value/services to LPs.

marsbit3 ч. назад

The Midlife Crisis of Crypto GPs: No PMF, No Next Check from LPs

marsbit3 ч. назад

Crypto GPs' Midlife Crisis: No PMF, No LP's Next Check

The article "The Midlife Crisis of Crypto GPs: No PMF, No LP's Next Check" analyzes the shifting crypto fundraising landscape. It argues that the era of LPs funding vague "vision" is over; GPs must now offer products with clear Product-Market Fit (PMF) to secure capital. The market has matured. LPs, disillusioned by the last cycle's failures and wary of long lock-up periods, now demand tangible, near-term returns rather than speculative narratives. The proliferation of accessible crypto ETFs and other liquid products has reduced the need for VC blind pools as an entry point. The author categorizes crypto fundraising products into three types: Primary (VC funds, with blind pools or clear pipelines), Liquid (alpha/beta, directional/market-neutral strategies), and CeFi/DeFi Native Yield (crypto-specific mechanisms like staking, farming). Focusing on the Primary market, the piece details why traditional LP rationales for investing in crypto VCs have weakened: easier beta access via ETFs, diminished "access" and "judgement" premiums as LPs build internal teams, and a widespread lack of proven superior returns from GPs. Ultimately, only specific players are likely to remain at the primary VC table: large funds with access to patient endowment capital, family offices/HNWIs investing proprietary capital, the few funds with demonstrable excess returns from the last cycle, and those with clear "deal-making" or ecosystem resource advantages. For others, the path forward is to rebuild trust by proving alpha-generation capability in a niche or providing concrete, valuable services.

链捕手3 ч. назад

Crypto GPs' Midlife Crisis: No PMF, No LP's Next Check

链捕手3 ч. назад

Торговля

Спот
Фьючерсы

Популярные статьи

Неделя обучения по популярным токенам (2): 2026 может стать годом приложений реального времени, сектор AI продолжает оставаться в тренде

2025 год — год институциональных инвесторов, в будущем он будет доминировать в приложениях реального времени.

1.8k просмотров всегоОпубликовано 2025.12.16Обновлено 2025.12.16

Неделя обучения по популярным токенам (2): 2026 может стать годом приложений реального времени, сектор AI продолжает оставаться в тренде

Обсуждения

Добро пожаловать в Сообщество HTX. Здесь вы сможете быть в курсе последних новостей о развитии платформы и получить доступ к профессиональной аналитической информации о рынке. Мнения пользователей о цене на AI (AI) представлены ниже.

活动图片