After Burning Tens of Billions of Dollars in Tokens, Silicon Valley Giants Start Limiting Employee Token Usage

marsbitОпубликовано 2026-06-01Обновлено 2026-06-01

Введение

After burning tens of billions of dollars on AI tokens, major Silicon Valley firms are now restricting employee usage. Companies like Microsoft, Uber, and Salesforce, which heavily promoted AI for "efficiency," are facing a cost crisis. The practice of "tokenmaxxing"—pushing employees to maximize AI tool usage—led to wasteful spending on trivial tasks like checking the weather or writing birthday messages, with studies showing significant hidden costs for bug fixes and code rewrites. The core issue is a misalignment between individual productivity gains and actual business value. While employees use AI to automate tasks they dislike, such as writing reports, this often doesn't translate to increased company revenue or improved core business outcomes. For instance, AI-generated code speeds up development but also sees an 800% increase in "code churn" (code being discarded or rewritten). As a result, only 14% of CFOs report seeing a clear, measurable return on AI investments. Firms are now shifting strategies. Microsoft has revoked most internal licenses for Claude Code, while others are implementing monitoring and cost controls. New tools from companies like Harness and CloudZero aim to track AI spending and tie costs to business results. Some AI vendors, like HubSpot, are moving from token-based pricing to charging based on outcomes, such as "resolved conversations" or "leads generated." This represents a necessary correction in the AI adoption cycle. The challenge now is ...

AI automates the tasks employees 'hate,' not the ones that 'make money.'

A few days ago, GeekPark reported that Microsoft, which has placed heavy bets on AI, quietly discontinued Claude Code licenses for most of its employees internally.

This is quite bizarre because one of the biggest selling points of this wave of AI adoption for enterprise users is 'increased efficiency.' If it can increase efficiency, why would Microsoft stop its employees from using Claude Code?

Microsoft is not the only one doing this. 'Tightening token usage' and no longer encouraging employees to go all out on Vibe Coding has become the new trend among Silicon Valley giants.

Uber spent its entire year's AI token budget in four months. Salesforce writes a check to Anthropic for about $300 million annually. An AI consultant revealed that one of his clients had a monthly AI spend as high as $500 million. Meta even quietly took down its internal 'tokenmaxxing leaderboard'—a board originally designed to encourage employees to use AI more.

Now, companies are doing something unthinkable a few years ago:

Limiting, and monitoring, employee use of AI.

Why are major companies shifting their stance?

"Tokenmaxxing," A Reflection of the Times

To understand today's cost crisis, we must first understand what 'tokenmaxxing' is.

This term started gaining popularity around 2025, literally meaning 'maximizing token usage.' Behind it lies a management logic—since the company spent big money on AI tools, employees should use them frantically. The more you use, the more 'digitally transformed' you prove to be. The less you use, you're wasting resources. As a result, many companies set usage quotas, leaderboards, and even performance reviews, pushing employees to use AI.

And the result?

Employees started using the company's enterprise-grade AI models to check the weather, write birthday greetings, and ask what to eat today.

A study of 2,444 companies found that for every dollar a company spends on AI tokens, $0.44 is used to fix bugs generated by AI, $0.27 to rewrite AI-produced code, and $0.11 is consumed in review and merge delays.

In other words, behind every dollar of AI procurement cost lies nearly 80% in hidden losses.

Investor Shruti Gandhi used an apt analogy: "A tokenmaxxing enterprise is like a company measuring productivity by keeping all the lights on—spending more money doesn't equal producing more."

More ironically, most of these companies have no idea what their employees are using AI for, let alone whether the completion of those tasks brought about any change because of AI.

This 'money-burning race' burned from 2024 into 2025, finally igniting this year. JPMorgan issued a sternly worded report with a title uncomfortably blunt—'AI Token Costs Are Eating Up Internet Profits'.

Shopify, Spotify, ServiceNow, and Roku all mentioned in their earnings calls that AI has become a major pressure point on operating expenses. The overall industry sentiment is starting to shift from 'how cool it is to use AI' to 'is this money well spent?'

When CEOs Start Questioning ROI

Only 14% of CFOs say they can see a clear, measurable return on AI investment.

Uber's Chief Operating Officer, Andrew Macdonald, said something very candid in a podcast—they find it difficult to link the productivity gains of individual employees to the overall business impact on the company. "If you can't see how AI helps you push more valuable features to users, token costs are even harder to justify."

This statement highlights the core of the enterprise AI dilemma: Improving individual efficiency does not equal increasing company profits.

An employee writes weekly reports three times faster with AI, but company revenue remains unchanged. An engineer generates code twice as fast with AI, but the code 'churn rate'—the proportion of code abandoned or rewritten—increases by 800%.

Microsoft's former Chief AI Officer, Sophia Velastegui, said something that makes many managers uncomfortable: "Most people default to automating the tasks they dislike, not the tasks most valuable to the company."

Put simply, companies are automating the tasks employees 'hate,' not the ones that 'make money.'

This is not a technical problem; it's a problem of priorities. It's also why about 30% of generative AI projects get abandoned at the proof-of-concept stage—costs are unclear, value is unclear, so the boss naturally stops paying.

Salesforce CEO Marc Benioff's approach is quite representative. Faced with an annual Anthropic bill of $300 million, his expectation is an 'intelligent router': something that can judge which queries are worth using a top-tier model for and which can use a cheaper, smaller model.

This idea itself isn't novel—as early as the cloud computing era, 'pay-as-you-go' and 'resource optimization' were standard practices. But this wave of AI came too fast; everyone bought first and thought later, only now starting to catch up.

Rational Return, or Prelude to Winter?

Microsoft recently canceled most enterprise licenses for Claude Code, with the official reason pointing to cost factors. This has sparked considerable discussion within the industry—after all, Microsoft itself is the largest investor in OpenAI, while simultaneously cutting subscriptions to a competitor. How much of this is cost consideration and how much is strategic planning is hard to say.

But regardless, it represents a signal: enterprises are starting to vote with their feet.

Harness and CloudZero both released AI cost management tools almost on the same day—May 28th. One focuses on real-time monitoring of AI spending and ROI, while the other launched an 'AI Financial Control Plane' to help companies link every dollar of AI spending to specific business outcomes.

The emergence of these two products itself illustrates the problem: there is market demand, and it's urgent.

Starting in April this year, HubSpot adjusted the pricing model for its AI agents, no longer charging by token, but instead charging by 'conversations resolved' or 'leads generated'—a directional shift aligning the seller's interests with the buyer's actual output. ServiceNow is making similar adjustments. AI vendors are realizing that if they continue to sell 'usage' instead of 'results,' enterprise clients will eventually push back collectively.

Is this adjustment a necessary growing pain for AI industrialization, or the prelude to a larger crisis?

I tend to think it's the former. But one detail is somewhat concerning: Global AI software spending is projected to reach $2.59 trillion in 2026, a 47% year-on-year increase. Yet, at the same time, 94% of engineering leaders say key ROI metrics are still missing. More money is being spent, but no one knows where it's burning or if it's worth it—if this contradiction isn't resolved, the next 'tokenmaxxing moment' is only a matter of time.

A Fortune magazine analysis put it bluntly: "Tokenmaxxing is easy; redesigning workflows is hard." What most companies are doing now is optimizing existing processes, not reinventing business models. This is where the real value of AI lies, and it's also a place most enterprises haven't reached yet.

A rational return is a good thing. But after this rational return, companies still need to answer a more difficult question: Should AI be a hammer for our business, or a new framework for thinking?

If you only use AI to do old jobs faster, the bill will eventually force you back to face this question.

This article is from WeChat public account "GeekPark" (ID: geekpark), author: Huilin Dance King, editor: Jingyu

Связанные с этим вопросы

QWhat is the phenomenon of 'tokenmaxxing' in Silicon Valley companies, and why has it become a problem?

ATokenmaxxing is a management trend where companies encourage employees to maximize their usage of AI tokens to demonstrate digital transformation and justify large AI investments. It has become a problem because it leads to significant wasteful spending, as employees use expensive AI models for trivial tasks like checking the weather or writing birthday greetings. Studies show that for every dollar spent on AI tokens, up to 80% can be lost to hidden costs like fixing AI-generated bugs or rewriting code, making the return on investment unclear and prompting a shift towards cost control.

QAccording to the article, what is the core issue with how companies are implementing AI automation?

AThe core issue is that companies are often using AI to automate tasks that employees dislike or find tedious, rather than automating the most valuable, revenue-generating tasks for the business. This misalignment means that while individual productivity metrics might improve, it doesn't translate into measurable business growth or profit, leading CEOs and CFOs to question the ROI of their massive AI expenditures.

QHow are some AI vendors and companies responding to the AI cost crisis?

AIn response to the cost crisis, some AI vendors and companies are shifting their pricing and usage models. For example, HubSpot changed its pricing from a per-token model to charging based on business outcomes like 'solved conversations' or 'generated leads.' Similarly, companies like Microsoft are restricting employee access to certain AI tools like Claude Code, and new tools from Harness and CloudZero are emerging to help businesses monitor AI spending in real-time and tie costs directly to business results.

QWhat did the JPMorgan report indicate about the impact of AI token costs?

AA JPMorgan report, titled 'AI Token Costs Are Eating Internet Profits,' indicated that the substantial costs associated with AI token consumption are becoming a major pressure on the operational expenses and overall profitability of internet and tech companies. This report contributed to a broader industry shift from enthusiasm about AI adoption to serious scrutiny of whether the spending is justified.

QWhat does the article suggest is the harder question companies face after the 'rational return' in AI spending?

AThe article suggests that after the rational return—where companies start controlling costs and seeking clearer ROI—they face a harder, more fundamental question: whether AI should merely be a tool to perform existing tasks faster (a 'hammer') or if it should serve as a new framework for entirely reinventing business models and workflows. The article implies that failing to address this question will lead to recurring cost crises.

Похожее

Xiaohongshu's Second Great Voyage, This Time Sailing Towards AI

Xiaohongshu's Second Voyage: Navigating Towards AI Since ChatGPT's emergence, Xiaohongshu's founder Mao Wenchao has been acutely aware of AI's potential threat, recognizing that the life advice people seek from chatbots overlaps directly with his platform's core business. Founded in 2013 as a PDF shopping guide for Chinese tourists, Xiaohongshu evolved into a massive community where millions share authentic, personal experiences—from product reviews to travel tips. This vast repository of "I've tried this" human judgment became its most valuable asset. However, the rise of AI, which delivers instant answers, challenges the very need for users to sift through numerous personal notes. Fearing its treasure trove of lived experience could become mere training data for others, Xiaohongshu is proactively adapting. In 2026, it established a dedicated AI division (Dots), launched RED Skill to turn user experiences into usable AI tools, and acquired the AI search product "Diandian." Its investments now extend to AI firms like MiniMax and hardware startups, moving upstream to address needs before they even become search queries. The platform's commercialization strategy is also evolving. With a newly acquired payment license and tools like the AIPS model to track consumer decision journeys, Xiaohongshu aims to seamlessly integrate recommendations with transactions, embedding commerce within AI-generated answers. Yet, a critical tension remains. While building smarter machines to organize and leverage its human experiences, Xiaohongshu must prevent AI from drowning out the authentic, flawed, and trustworthy "I've tried this" voices that built its community. Its core challenge is to harness AI's power without letting the map—the machine's perfect, synthesized answer—replace the territory of genuine human experience. This balance between technological advancement and preserving human trust defines its current journey and its future.

marsbit6 мин. назад

Xiaohongshu's Second Great Voyage, This Time Sailing Towards AI

marsbit6 мин. назад

SharpLink CEO: How to Interpret Ethereum Developers Surpassing 1 Million?

SharpLink CEO reflects on the milestone of Ethereum surpassing one million historical developers, a figure unmatched in crypto. Based on Electric Capital data and observations from Asian developer hubs, the article argues that Ethereum’s true strength isn't raw speed, but where top builders choose to build long-term. The core advantage is a decade-deep ecosystem of standards, tools, liquidity, and culture. Of the over one million developers, about 232,000 were active in the past year, working on hard problems: protocol scalability (e.g., the upcoming Glamsterdam upgrade), synchronous composability across rollups, and post-quantum security preparedness. Ethereum’s defensibility stems from network effects beyond numbers: deep composability (the "financial Lego" of DeFi), the universal EVM/Solidity stack, credible neutrality (secured by ~900k validators), and its role as the trusted settlement layer for major institutions. The convergence of top researchers, a modular rollup ecosystem, and a culture of long-term thinking create a compounding advantage that is difficult to replicate. The conclusion is that Ethereum has consolidated its position as the default operating system for programmable finance and internet-native capital, attracting the builders who are architecting the next generation of financial infrastructure.

marsbit17 мин. назад

SharpLink CEO: How to Interpret Ethereum Developers Surpassing 1 Million?

marsbit17 мин. назад

SharpLink CEO: How to Understand Ethereum Developers Just Exceeded 1 Million?

SharpLink CEO reflects on the milestone of Ethereum surpassing 1 million historical developers, emphasizing that this figure represents the largest pool of technical talent ever assembled around an open, permissionless blockchain network. While approximately 232,000 developers remain active, the key question for the crypto industry is not which chain is fastest, but where the best builders choose to build long-term. Ethereum's advantage lies in a decade-long accumulation of infrastructure, standards, tools, liquidity, and a cohesive culture, making it the default operating system for programmable finance. This developer base is tackling complex challenges: the Glamsterdam upgrade aims to enhance scalability while preserving core principles; synchronous composability seeks to unify Rollup ecosystems; and significant efforts are underway for post-quantum security. Ethereum's deeper network effects stem from composability and shared standards (like the EVM and Solidity), creating a flywheel of more developers, tools, and liquidity. Three reinforcing strengths cement Ethereum's lead: credible neutrality (secured by ~900k validators), a modular architecture with interconnected Rollups, and a culture that attracts top researchers. The ecosystem is consolidating as the trusted coordination layer for internet-native finance, favored by large institutions valuing security and liquidity. The future of Ethereum is being built by this global community of founders and architects.

链捕手20 мин. назад

SharpLink CEO: How to Understand Ethereum Developers Just Exceeded 1 Million?

链捕手20 мин. назад

A Clod of Chinese Soil Chokes Two Japanese Giants

"Chinese Soil Chokes Japanese Giants" The production of a key electronic specialty gas, tungsten hexafluoride (WF6), vital for manufacturing AI chips, was halted by two leading Japanese producers—Kanto Denka and Central Glass. Their shutdown was not due to a technological failure but a sudden, critical shortage of a raw material they had long taken for granted: ultra-high-purity (6N-grade) tungsten powder, which is almost entirely sourced from China. Following a quiet Chinese export announcement in January 2026, tungsten powder shipments to Japan dropped to zero for months. Despite frantic efforts, Japanese companies found no viable alternative; imported powder was three times more expensive and lacked the required purity. Their existing stockpiles were exhausted by mid-2026. WF6 is essential for depositing tungsten into the microscopic contact holes of High Bandwidth Memory (HBM) chips, which are crucial for advanced processors like those from Nvidia. While Japanese firms had mastered producing ultra-pure WF6 gas, their entire supply chain relied on China's 6N tungsten powder—a dependency now revealed as a fatal vulnerability. China's dominance in this "soil" results from decades of painstaking R&D by companies like Xiamen Tungsten and China Tungsten & Hightech. They overcame immense technical hurdles, such as separating chemically similar molybdenum from tungsten, to achieve mass production of the world's purest tungsten powder. With their primary suppliers gone, Kanto Denka and Central Glass announced a permanent halt to WF6 production starting July 1, 2026. This immediately created a supply crisis for major semiconductor manufacturers like Samsung and SK Hynix, forcing them to urgently seek and certify new Chinese suppliers for WF6 itself. The reversal marks a dramatic shift: China has moved from exporting low-value raw materials to controlling the high-purity foundation of a critical global tech supply chain, upending a long-established industrial hierarchy.

marsbit51 мин. назад

A Clod of Chinese Soil Chokes Two Japanese Giants

marsbit51 мин. назад

From Identity Protocol to AI Gateway: How Ambitious Is World?

The article discusses Worldcoin's (WLD) evolution from a "proof of personhood" network to a potential identity infrastructure for the AI era. Initially focused on unique digital identities via iris scans, Worldcoin faced skepticism over real-world use cases. However, entering "The Simple Plan" Phase 3, it's shifting from token incentives to utility-driven growth. Key applications are emerging: enterprise solutions (e.g., deepfake verification with Zoom, Okta), consumer use cases (e.g., reducing bots on Tinder or in ticket sales), and notably, the AI Agent sector via AgentKit, which aims to establish trusted authorization between humans and AI agents. The author argues AI's proliferation makes authentic human identity a scarce and valuable resource. Worldcoin's strategy is now focused on key markets for network effects and developing self-service Orb hardware to scale sustainably. The launch of World ID 4.0 introduces a fee model for issuers/protocols, creating a potential revenue stream. In conclusion, WLD's rise reflects a market reassessment. Worldcoin's ambition is to become the foundational identity layer and crucial entry point for the coming AI Agent economy, moving crypto narratives beyond finance into verifiable human identity.

marsbit1 ч. назад

From Identity Protocol to AI Gateway: How Ambitious Is World?

marsbit1 ч. назад

Торговля

Спот

Фьючерсы

Обсуждения

Добро пожаловать в Сообщество HTX. Здесь вы сможете быть в курсе последних новостей о развитии платформы и получить доступ к профессиональной аналитической информации о рынке. Мнения пользователей о цене на AI (AI) представлены ниже.

After Burning Tens of Billions of Dollars in Tokens, Silicon Valley Giants Start Limiting Employee Token Usage

Введение

"Tokenmaxxing," A Reflection of the Times

When CEOs Start Questioning ROI

Rational Return, or Prelude to Winter?

Связанные с этим вопросы

Похожее

Xiaohongshu's Second Great Voyage, This Time Sailing Towards AI

SharpLink CEO: How to Interpret Ethereum Developers Surpassing 1 Million?

SharpLink CEO: How to Understand Ethereum Developers Just Exceeded 1 Million?

A Clod of Chinese Soil Chokes Two Japanese Giants

From Identity Protocol to AI Gateway: How Ambitious Is World?

Торговля

Популярные статьи

AI Companions: Новое определение взаимодействия человека с ИИ

HTX Learn: пройдите обучение по "AI Companions" и разделите 10 000 USDT!

Неделя обучения по популярным токенам (2): 2026 может стать годом приложений реального времени, сектор AI продолжает оставаться в тренде

Обсуждения

Топ вопросы

Популярные категории

Популярные теги