After Burning Tens of Billions of Dollars in Tokens, Silicon Valley Giants Start Limiting Employee Token Usage

marsbitPublished on 2026-06-01Last updated on 2026-06-01

Abstract

After burning tens of billions of dollars on AI tokens, major Silicon Valley firms are now restricting employee usage. Companies like Microsoft, Uber, and Salesforce, which heavily promoted AI for "efficiency," are facing a cost crisis. The practice of "tokenmaxxing"—pushing employees to maximize AI tool usage—led to wasteful spending on trivial tasks like checking the weather or writing birthday messages, with studies showing significant hidden costs for bug fixes and code rewrites. The core issue is a misalignment between individual productivity gains and actual business value. While employees use AI to automate tasks they dislike, such as writing reports, this often doesn't translate to increased company revenue or improved core business outcomes. For instance, AI-generated code speeds up development but also sees an 800% increase in "code churn" (code being discarded or rewritten). As a result, only 14% of CFOs report seeing a clear, measurable return on AI investments. Firms are now shifting strategies. Microsoft has revoked most internal licenses for Claude Code, while others are implementing monitoring and cost controls. New tools from companies like Harness and CloudZero aim to track AI spending and tie costs to business results. Some AI vendors, like HubSpot, are moving from token-based pricing to charging based on outcomes, such as "resolved conversations" or "leads generated." This represents a necessary correction in the AI adoption cycle. The challenge now is ...

AI automates the tasks employees 'hate,' not the ones that 'make money.'

A few days ago, GeekPark reported that Microsoft, which has placed heavy bets on AI, quietly discontinued Claude Code licenses for most of its employees internally.

This is quite bizarre because one of the biggest selling points of this wave of AI adoption for enterprise users is 'increased efficiency.' If it can increase efficiency, why would Microsoft stop its employees from using Claude Code?

Microsoft is not the only one doing this. 'Tightening token usage' and no longer encouraging employees to go all out on Vibe Coding has become the new trend among Silicon Valley giants.

Uber spent its entire year's AI token budget in four months. Salesforce writes a check to Anthropic for about $300 million annually. An AI consultant revealed that one of his clients had a monthly AI spend as high as $500 million. Meta even quietly took down its internal 'tokenmaxxing leaderboard'—a board originally designed to encourage employees to use AI more.

Now, companies are doing something unthinkable a few years ago:

Limiting, and monitoring, employee use of AI.

Why are major companies shifting their stance?

"Tokenmaxxing," A Reflection of the Times

To understand today's cost crisis, we must first understand what 'tokenmaxxing' is.

This term started gaining popularity around 2025, literally meaning 'maximizing token usage.' Behind it lies a management logic—since the company spent big money on AI tools, employees should use them frantically. The more you use, the more 'digitally transformed' you prove to be. The less you use, you're wasting resources. As a result, many companies set usage quotas, leaderboards, and even performance reviews, pushing employees to use AI.

And the result?

Employees started using the company's enterprise-grade AI models to check the weather, write birthday greetings, and ask what to eat today.

A study of 2,444 companies found that for every dollar a company spends on AI tokens, $0.44 is used to fix bugs generated by AI, $0.27 to rewrite AI-produced code, and $0.11 is consumed in review and merge delays.

In other words, behind every dollar of AI procurement cost lies nearly 80% in hidden losses.

Investor Shruti Gandhi used an apt analogy: "A tokenmaxxing enterprise is like a company measuring productivity by keeping all the lights on—spending more money doesn't equal producing more."

More ironically, most of these companies have no idea what their employees are using AI for, let alone whether the completion of those tasks brought about any change because of AI.

This 'money-burning race' burned from 2024 into 2025, finally igniting this year. JPMorgan issued a sternly worded report with a title uncomfortably blunt—'AI Token Costs Are Eating Up Internet Profits'.

Shopify, Spotify, ServiceNow, and Roku all mentioned in their earnings calls that AI has become a major pressure point on operating expenses. The overall industry sentiment is starting to shift from 'how cool it is to use AI' to 'is this money well spent?'

When CEOs Start Questioning ROI

Only 14% of CFOs say they can see a clear, measurable return on AI investment.

Uber's Chief Operating Officer, Andrew Macdonald, said something very candid in a podcast—they find it difficult to link the productivity gains of individual employees to the overall business impact on the company. "If you can't see how AI helps you push more valuable features to users, token costs are even harder to justify."

This statement highlights the core of the enterprise AI dilemma: Improving individual efficiency does not equal increasing company profits.

An employee writes weekly reports three times faster with AI, but company revenue remains unchanged. An engineer generates code twice as fast with AI, but the code 'churn rate'—the proportion of code abandoned or rewritten—increases by 800%.

Microsoft's former Chief AI Officer, Sophia Velastegui, said something that makes many managers uncomfortable: "Most people default to automating the tasks they dislike, not the tasks most valuable to the company."

Put simply, companies are automating the tasks employees 'hate,' not the ones that 'make money.'

This is not a technical problem; it's a problem of priorities. It's also why about 30% of generative AI projects get abandoned at the proof-of-concept stage—costs are unclear, value is unclear, so the boss naturally stops paying.

Salesforce CEO Marc Benioff's approach is quite representative. Faced with an annual Anthropic bill of $300 million, his expectation is an 'intelligent router': something that can judge which queries are worth using a top-tier model for and which can use a cheaper, smaller model.

This idea itself isn't novel—as early as the cloud computing era, 'pay-as-you-go' and 'resource optimization' were standard practices. But this wave of AI came too fast; everyone bought first and thought later, only now starting to catch up.

Rational Return, or Prelude to Winter?

Microsoft recently canceled most enterprise licenses for Claude Code, with the official reason pointing to cost factors. This has sparked considerable discussion within the industry—after all, Microsoft itself is the largest investor in OpenAI, while simultaneously cutting subscriptions to a competitor. How much of this is cost consideration and how much is strategic planning is hard to say.

But regardless, it represents a signal: enterprises are starting to vote with their feet.

Harness and CloudZero both released AI cost management tools almost on the same day—May 28th. One focuses on real-time monitoring of AI spending and ROI, while the other launched an 'AI Financial Control Plane' to help companies link every dollar of AI spending to specific business outcomes.

The emergence of these two products itself illustrates the problem: there is market demand, and it's urgent.

Starting in April this year, HubSpot adjusted the pricing model for its AI agents, no longer charging by token, but instead charging by 'conversations resolved' or 'leads generated'—a directional shift aligning the seller's interests with the buyer's actual output. ServiceNow is making similar adjustments. AI vendors are realizing that if they continue to sell 'usage' instead of 'results,' enterprise clients will eventually push back collectively.

Is this adjustment a necessary growing pain for AI industrialization, or the prelude to a larger crisis?

I tend to think it's the former. But one detail is somewhat concerning: Global AI software spending is projected to reach $2.59 trillion in 2026, a 47% year-on-year increase. Yet, at the same time, 94% of engineering leaders say key ROI metrics are still missing. More money is being spent, but no one knows where it's burning or if it's worth it—if this contradiction isn't resolved, the next 'tokenmaxxing moment' is only a matter of time.

A Fortune magazine analysis put it bluntly: "Tokenmaxxing is easy; redesigning workflows is hard." What most companies are doing now is optimizing existing processes, not reinventing business models. This is where the real value of AI lies, and it's also a place most enterprises haven't reached yet.

A rational return is a good thing. But after this rational return, companies still need to answer a more difficult question: Should AI be a hammer for our business, or a new framework for thinking?

If you only use AI to do old jobs faster, the bill will eventually force you back to face this question.

This article is from WeChat public account "GeekPark" (ID: geekpark), author: Huilin Dance King, editor: Jingyu

Related Questions

QWhat is the phenomenon of 'tokenmaxxing' in Silicon Valley companies, and why has it become a problem?

ATokenmaxxing is a management trend where companies encourage employees to maximize their usage of AI tokens to demonstrate digital transformation and justify large AI investments. It has become a problem because it leads to significant wasteful spending, as employees use expensive AI models for trivial tasks like checking the weather or writing birthday greetings. Studies show that for every dollar spent on AI tokens, up to 80% can be lost to hidden costs like fixing AI-generated bugs or rewriting code, making the return on investment unclear and prompting a shift towards cost control.

QAccording to the article, what is the core issue with how companies are implementing AI automation?

AThe core issue is that companies are often using AI to automate tasks that employees dislike or find tedious, rather than automating the most valuable, revenue-generating tasks for the business. This misalignment means that while individual productivity metrics might improve, it doesn't translate into measurable business growth or profit, leading CEOs and CFOs to question the ROI of their massive AI expenditures.

QHow are some AI vendors and companies responding to the AI cost crisis?

AIn response to the cost crisis, some AI vendors and companies are shifting their pricing and usage models. For example, HubSpot changed its pricing from a per-token model to charging based on business outcomes like 'solved conversations' or 'generated leads.' Similarly, companies like Microsoft are restricting employee access to certain AI tools like Claude Code, and new tools from Harness and CloudZero are emerging to help businesses monitor AI spending in real-time and tie costs directly to business results.

QWhat did the JPMorgan report indicate about the impact of AI token costs?

AA JPMorgan report, titled 'AI Token Costs Are Eating Internet Profits,' indicated that the substantial costs associated with AI token consumption are becoming a major pressure on the operational expenses and overall profitability of internet and tech companies. This report contributed to a broader industry shift from enthusiasm about AI adoption to serious scrutiny of whether the spending is justified.

QWhat does the article suggest is the harder question companies face after the 'rational return' in AI spending?

AThe article suggests that after the rational return—where companies start controlling costs and seeking clearer ROI—they face a harder, more fundamental question: whether AI should merely be a tool to perform existing tasks faster (a 'hammer') or if it should serve as a new framework for entirely reinventing business models and workflows. The article implies that failing to address this question will lead to recurring cost crises.

Related Reads

Bitcoin's Weak Rebound Fails to Conceal Adjustment Trend; HYPE's Top Signals Warn of Short-Term Risks | Guest Analysis

Bitcoin's Weak Bounce Fails to Mask Correction Trend; HYPE Top Signals Warn of Short-Term Risks | Invited Analysis Core Weekly View: Bitcoin's daily chart structure has weakened. The key question is whether its short-term rebound can effectively break above the upper boundary of the descending channel. Has HYPE's seven-wave advance reached its conclusion? This analysis systematically examines the current market structure across multiple timeframes and outlines operational strategies for the week. **Bitcoin (BTC) Analysis:** The daily chart shows BTC trading within a long-term rising channel (yellow) since February but has recently broken below its midline, indicating structural weakness. It is currently confined within a short-term descending channel (blue) originating from the May 6 high. The ongoing bounce appears to be a weak technical correction targeting the blue channel's upper rail (approx. $75,000-$76,000). The 4-hour chart reveals a complex 10-segment corrective structure from the May high, containing two downward pivot zones (Central D and E). The current rebound (segment 36-37) is expected to face resistance in the $75,000-$76,000 area. A failure to break above could lead to a resumption of the downtrend, testing support at $69,500-$70,500 and potentially $65,000. **BTC Weekly Strategy:** The price is currently below the "Bull-Bear Channel," placing it in a technically weak zone. The core focus is on the test of the $75,000-$76,000 resistance and $69,500-$70,500 support. * *Medium-term*: Consider initiating short positions (up to 30% allocation) if the price rejects the $75,000-$76,000 area. Increase exposure to 60% if the long-term rising channel's lower support fails. * *Short-term (30% allocation)*: Two scenarios are outlined: * *Plan A (Sell on Rally)*: Short on rejection at $75,000-$76,000, with a stop-loss above $77,000. * *Plan B (Breakdown Sell)*: Short on a confirmed breakdown below $69,500-$70,500, with a stop-loss above $72,000. **HYPE Analysis:** The 4-hour chart shows HYPE has completed a seven-wave advance from its May 14 low, including a central consolidation zone. A bearish divergence was noted at the prior high (point 45), leading to a 13% correction. The current rally leg (46-47) shows weakening momentum compared to the initial leg (42-43), suggesting a potential momentum divergence. Furthermore, the proprietary "Spread Trading Model" has triggered a strong top warning signal at point 47. A confirmed top here, combined with the momentum divergence, could signal the end of the current uptrend. **HYPE Weekly Strategy:** The core is observing whether a confirmed top at point 47 coincides with the momentum divergence. * Monitor the key support zone of $62.5-$64.75. A hold and bounce from this area, supported by model buy signals, could allow for a light long position (<30% allocation). * A decisive break below this support would indicate a shift to a larger-degree correction, targeting the $54-$56.3 area. **Trade Review:** A previous short trade on BTC was executed at $77,449 based on model top signals (bearish candlestick pattern, spread model warning, momentum divergence) and closed at $73,519 for a 5.07% profit. **Risk Management Reminder:** Always set an initial stop-loss immediately upon entry. Move the stop-loss to breakeven once a 1% profit is achieved, and trail it upwards to lock in profits as the trade progresses. *Disclaimer: Market conditions change rapidly. All views, models, and strategies are for educational purposes and personal trading logs only, not investment advice. Trading carries significant risk.*

Odaily星球日报3m ago

Bitcoin's Weak Rebound Fails to Conceal Adjustment Trend; HYPE's Top Signals Warn of Short-Term Risks | Guest Analysis

Odaily星球日报3m ago

How to Define "Real U.S. Stocks": Differences Between On-Chain Tokens, Price Contracts, and Direct Broker Connections

**Title:** Defining "Real US Stocks": Differences Among On-Chain Tokens, Price Contracts, and Broker-Direct Access **Summary:** In 2026, using stablecoins to purchase US stocks is mainstream, but products marketed as "buying US stocks with USDT" offer fundamentally different assets. This article analyzes three primary models. **1. Tokenized Stocks:** These are on-chain tokens representing economic exposure to underlying stocks, held by an issuer or custodian. They offer benefits like 24/7 trading and DeFi composability (e.g., use as loan collateral). However, users lack direct legal shareholder status; dividends may not be paid in cash, and voting rights are typically non-binding advisory expressions. Examples include platforms like Ondo Finance. **2. Stock Futures / Equity Perpetuals:** These are derivative contracts tracking a stock's price, allowing leveraged long/short positions 24/7, similar to crypto perpetuals. They offer high efficiency and flexibility but involve funding fees, which can be a significant long-term cost, especially during strong trends. Crucially, they confer no ownership rights (dividends, voting) to the holder. **3. Broker-Direct Model:** This model provides access to real securities via licensed broker-dealers. Stocks/ETFs are bought and held within the US clearing and custodial system (e.g., DTCC), making it the only path to genuine stock ownership. Users receive cash dividends and formal proxy voting rights (where applicable). It supports thousands of stocks and ETFs, far exceeding the coverage of the other two models. Key advantages include no funding fees, a clean cost structure for long-term holds, and the potential to transfer holdings to other brokers. Some platforms facilitate stablecoin (USDT/USDC) deposits, reducing reliance on traditional banking. A critical distinction exists *within* the broker-direct model: the underlying brokerage architecture (e.g., Fully Disclosed IB, Omnibus IB, Self-Clearing) determines how client assets are held, protected, and how safeguards like SIPC insurance are conveyed. Users should verify the specific clearing structure and regulatory compliance of any platform. In conclusion, "buying US stocks with USDT" can mean holding an on-chain economic proxy (Tokenized Stocks), trading a price derivative (Stock Futures), or owning the actual security (Broker-Direct). For users seeking full ownership rights and long-term investment, the broker-direct model is the definitive choice, though its implementation details require careful scrutiny.

marsbit1h ago

How to Define "Real U.S. Stocks": Differences Between On-Chain Tokens, Price Contracts, and Direct Broker Connections

marsbit1h ago

NVIDIA Launches DSX Platform, Expanding into AI Factory Infrastructure

NVIDIA has unveiled the DSX platform at its GTC Taipei event, marking a strategic expansion from GPU sales into comprehensive AI factory infrastructure solutions. The platform addresses challenges like power supply, cooling, and resource orchestration as AI models scale, shifting the industry focus from single-chip performance to overall infrastructure efficiency. DSX integrates NVIDIA's chips, systems, software, and partner technologies to cover the entire AI factory lifecycle—from design and simulation to deployment and operations. It aims to accelerate deployment, improve reliability and operational efficiency, and reduce the cost per generated token in AI inference. The software suite includes DSX MaxLPS, which uses 45°C liquid cooling and rack-level optimization to allow up to 40% more GPUs per megawatt, and DSX OS, an open-source platform for AI factory operations. The platform also encompasses reference designs, digital twin simulation (DSX Sim), dynamic workload adjustment based on grid conditions (DSX Flex), and data exchange between systems. Early adopters include cloud providers like CoreWeave and Lambda. Major hardware partners, including Dell, HPE, Lenovo, and Supermicro, are developing DSX-ready systems. Pilot projects for DSX Flex are underway with energy providers. Strategically, DSX represents NVIDIA's ongoing transition from an AI chip supplier to a full-stack AI infrastructure platform provider, aiming to set industry standards and solidify its market leadership.

marsbit1h ago

NVIDIA Launches DSX Platform, Expanding into AI Factory Infrastructure

marsbit1h ago

Trading

Spot
Futures

Hot Articles

Discussions

Welcome to the HTX Community. Here, you can stay informed about the latest platform developments and gain access to professional market insights. Users' opinions on the price of AI (AI) are presented below.

活动图片