From Subsidies to Token-Based Pricing to Price Cuts: Is OpenAI Sparking a Price War? Is the Inflection Point for Token Economics Nearing?

marsbitPublicado a 2026-06-11Actualizado a 2026-06-11

Resumen

The commercialization of generative AI is facing a critical inflection point as a potential price war looms. According to The Wall Street Journal, OpenAI is considering a significant cut to its token fees to compete with rival Anthropic, signaling a shift from a growth-at-all-costs model focused on token consumption. This move comes as both companies, reportedly losing billions on compute, prepare for IPOs, and as enterprise customers face "bill shock" from switching to usage-based token billing. Reports indicate poor ROI, with one analysis finding only 18 cents of every dollar spent on AI tokens generates user-facing value. The industry's initial phases—from flat-rate subscriptions to aggressive subsidies—have given way to a reckoning with real costs. Analysts debate the future: some predict a bifurcation between premium, high-cost models for complex tasks and cheaper alternatives for routine work, while others believe overall spending will still rise as agentic AI increases tokens per task. Notably, Chinese model DeepSeek's low-cost API is gaining traction with U.S. enterprises, adding competitive pressure. The core challenge is redefining value beyond token volume ("tokenmaxxing") toward measurable productivity ("valuemaxxing"), as the entire AI value chain, from cloud providers to chipmakers, feels the ripple effects of unsustainable pricing.

When a real Token price war erupts, how will the AI industry make money? The entire valuation logic for AI commercialization is at a moment where it needs to be rewritten. The era of competing on "cost-effectiveness" and "scarcity" may have arrived. For OpenAI, "the situation is deteriorating further." Analysis points out that "if OpenAI goes into decline, it could very likely drag down Nvidia, Oracle, Coreweave, and others."

The commercialization narrative of generative AI is facing its most profound self-examination in three years. From exchanging subsidies for users, to monthly subscription plans that hide costs, to Token-based billing exposing enterprise bill crises, the AI industry has completed a three-stage leap in commercialization within three years—and a potential price war could reset this entire monetization logic back to zero.

According to The Wall Street Journal, OpenAI is considering significantly lowering the Token fees it charges users, in order to compete for enterprise clients from rival Anthropic. Sources familiar with the matter stated that this move is partly to "seize the initiative," as OpenAI expects Anthropic to take similar price-cutting actions. OpenAI CEO Sam Altman recently admitted at an event that AI usage costs have become "a huge problem," and said the company will "help people get more value for less money."

The timing of this news is particularly sensitive. OpenAI secretly filed for an IPO this week, while Anthropic is also in the final countdown to its own listing. Meanwhile, Bloomberg's Silicon Data LLM Token Expenditure Index has fallen for seven consecutive trading days, marking its longest losing streak since January this year, reflecting the market's deep-seated anxiety about the sustainability of AI bills. The report bluntly states that a price war would directly erode the profit margins of both companies—and both are already losing tens of billions of dollars due to the massive computing power required by AI systems.

The core of this discussion is no longer just a single price-cut decision, but a more fundamental question: As the narrative of "more Token consumption is better" reaches its end, who will tell the next commercialization story for the AI industry, and how will it be told?

01

The Initial Three Stages: From Monthly Subsidies to Token Bills

The commercialization of generative AI has undergone a clear three-phase evolution in just three years.

Stage One: Monthly and Annual Subscriptions set the industry baseline. In February 2023, OpenAI launched ChatGPT Plus with a monthly fee of $19.99, pioneering paid C-end access for large models; Baidu, Alibaba, and Tencent followed suit, making fixed-fee subscriptions the standard for early-stage business models.

Stage Two: The subsidy war erupted in full force. To boost ARR (Annual Recurring Revenue), the core anchor for financing valuations, major players turned to large-scale subsidies: Google offered students 15 months of Gemini Advanced for free, OpenAI launched a Team membership plan at $1 for the first month, ByteDance's Doubao entered the market with pricing "99.3% lower than the industry standard," and Baidu announced its core models would be free. The essence of subsidies was trading losses for growth—reportedly, Microsoft lost an average of over $20 per user per month under the GitHub Copilot subscription model, with some heavy users costing up to $80 a month in losses.

Stage Three: The forced switch to usage-based billing. On June 1, 2026, Microsoft announced that all GitHub Copilot plans would officially transition to Token-based billing, converting the $19 monthly fee directly into an equivalent Token credit. This change brought the true costs long hidden by subscription models into the open—according to user calculations in the Reddit community, a single agent programming session could consume $30 to $40, depleting a monthly plan in just one use.

02

Bill Out of Control: When Tokens Cost More Than People

The implementation of Token-based pay-as-you-go billing fully revealed the true face of enterprise AI expenditure.

The numbers on the enterprise side are staggering. Uber COO Andrew Macdonald publicly stated in May 2026 that the link between the growth in Token consumption and substantive product improvement "does not yet exist," and even coined a term for it: "tokenmaxxing," describing employees performing valueless tasks to boost usage metrics.

More direct data: Uber exhausted its annual Token budget in just the first four months of 2026; Salesforce expects to pay Anthropic around $300 million for the full year.

Anthropic's own developer documentation shows that developers using Claude Code incur an average cost of about $13 per workday, with 90% of users having daily costs below $30—translating this, a 10-person development team could spend over $75,600 per year on Token fees alone.

The return on investment is equally alarming. Enterprise data platform Entelligence.AI, after aggregating data from 2,444 companies, found that for every $1 spent on AI Token fees, only 18 cents generated actual value that reached users; 44 cents were used to fix bugs introduced by the AI itself, 27 cents went towards rework, and 11 cents were consumed by review friction.

Faced with runaway bills, enterprises have begun proactive control measures. Amazon halted internal AI usage leaderboards, instructing employees "not to use AI just for the sake of using it"; Microsoft plans to gradually phase out Claude Code subscriptions for employees in some key product divisions. Goldman Sachs notes that some companies' spending on AI Tokens already accounts for 10% of their total employee labor costs, and this proportion could rise further in the coming quarters. This isn't about demand disappearing, but the end of the era of reckless AI spending.

03

Act Four: Price War Ignited, OpenAI Considers Major Price Cuts

It is against this backdrop that the fuse for a price war was lit.

According to The Wall Street Journal, Altman's consideration of price cuts was directly triggered by the pressure to catch up with Anthropic. Anthropic's revenue has grown significantly recently, its programming tool Claude Code has become popular among software engineers, and this five-year-old startup's valuation has even surpassed OpenAI's for the first time.

However, the cost of this price war will be exceptionally heavy. If prices are significantly lowered, it will further compress the already negative profit margins of both companies, and the competitive landscape offers very limited room for maneuver.

And a fundamental risk long identified by investors is that OpenAI and Anthropic's products are highly substitutable; clients can easily switch from one to the other—meaning price cuts, even if they retain customers in the short term, cannot truly build a moat and only delay market share loss.

This dilemma also transmits outwards through the financial cycle between cloud computing giants and AI labs.

According to corporate disclosure documents compiled by The Information, OpenAI and Anthropic together account for over half of the approximately $2 trillion in future cloud service commitments from Microsoft, Oracle, Google, and Amazon. If price cuts lead to downward revisions in revenue expectations, this transmission chain will face pressure from both ends.

American neuroscientist and AI expert Gary Marcus said: "This further exposes the fragility of OpenAI and indicates how serious the predicament it faces is. If OpenAI goes into decline, it could very likely drag down companies like Nvidia, Oracle, Coreweave. The situation is deteriorating rapidly."

A divergence of views is playing out openly on Wall Street. JPMorgan TMT analyst Mark Schilsky believes the current billing anxiety is merely a "minimum speed bump on the road to higher spending": if the average price per million tokens falls, but the penetration rate of paid AI among US companies continues to rise, overall token usage will mathematically increase substantially; coupled with agentic AI pushing single-task token consumption to several times that of traditional Q&A modes, long-term total expenditure is expected to be significantly higher than current levels.

Goldman Sachs semiconductor analyst Jim Covello holds a more pessimistic view, believing that the current industry chain prosperity has directed almost all value towards semiconductor companies, a phenomenon "unprecedented in history and unsustainable." Once enterprises face the true price of usage-based billing, the capital flows supporting GPU procurement and model training could face a reversal.

04

Act Five: The Next Story for Token Economics?

After the price war, the next chapter of AI industry commercialization has yet to be written, but its outline is emerging.

A report from Citadel Securities offers a directional framework: tiered pricing and charging based on scarcity. Its core logic is that inference-intensive frontier AI won't disappear but will increasingly concentrate in the hands of a few large enterprises capable of bearing the compute costs; for the broader range of enterprises, simpler models might be a more productive path until physical constraints ease. This implies AI usage will move towards stratification—high-value, complex tasks will continue to use frontier models, while routine and batch tasks will shift towards cheaper or local models.

JPMorgan holds a relatively optimistic judgment: even if the per-unit token price falls, the proliferation of agentic AI will multiply token consumption per task—existing data shows that after agentification, token consumption per task can become 3.5 times the original—potentially still expanding the overall expenditure scale. The current billing anxiety might just be a "minimum speed bump on the road to higher spending."

Nebius Chief Revenue Officer Marc Boroditsky proposed the concept of "valuemaxxing," advocating for the industry to shift from pursuing Token consumption maximization to making every Token truly generate value. This direction is gradually becoming an industry consensus—but true commercial implementation still requires AI labs to find a pricing system that both reflects the true cost and is acceptable to enterprise clients. This is precisely the core unresolved proposition in all current debates.

However, perhaps the most overlooked variable in this price war is Chinese models.

According to June data from American corporate spend management platform Ramp, DeepSeek has topped the list for growth in US enterprise software subscriptions. Ramp Chief Economist Ara Kharazian specifically emphasized that this is not about local deployment of open-source models, but rather "enterprises are directly sending and receiving data through DeepSeek," indicating real paid direct usage—he admitted "did not anticipate US companies would go and use DeepSeek." According to third-party estimates, DeepSeek V4-Pro's API price is about one-tenth that of GPT-5.5, and about one-eleventh that of Claude Opus 4.7.

As OpenAI and Anthropic, two tigers, fight, the ultimate beneficiary might be the player that has long embedded "accessible pricing" into its DNA and doesn't need to answer to IPO investors about profit margins. This may not be the most popular ending for this price war, but it is becoming an increasingly hard-to-ignore reality.

This article is from the WeChat public account "Hard AI," author: Xu Chao

Preguntas relacionadas

QAccording to the article, what are the three main stages of generative AI's commercialization evolution?

AThe article outlines three stages: 1) Monthly/annual subscription plans establishing the baseline, like ChatGPT Plus. 2) A comprehensive subsidy war to drive user growth and valuations, with companies offering free or heavily discounted access. 3) The mandatory switch to usage-based (Token) billing, which exposed the true costs previously hidden by flat-rate subscriptions.

QWhat is 'tokenmaxxing' as mentioned in the article, and what problem does it highlight for businesses?

A'Tokenmaxxing' is a term coined by Uber's COO, describing the behavior where employees perform valueless tasks just to increase AI Token usage. It highlights the problem that increased Token consumption does not necessarily correlate with meaningful productivity gains or product improvements, leading to wasteful and uncontrolled AI spending for enterprises.

QWhy is a potential price war between OpenAI and Anthropic described as particularly costly and risky in the article?

AA price war is risky because both companies are already losing billions on the massive compute power required for AI systems. Lowering prices would further squeeze their negative profit margins. Furthermore, their products are highly substitutable, meaning price cuts may temporarily retain customers but do not build a sustainable competitive moat, potentially just delaying market share loss without solving the core profitability issue.

QWhat new pricing or usage frameworks are suggested as potential next chapters for AI commercialization after a price war?

AThe article suggests frameworks like tiered pricing based on scarcity/value, where advanced, compute-intensive models are reserved for high-value tasks at large enterprises, while simpler, cheaper models handle routine tasks. Another concept is 'valuemaxxing'—shifting the industry focus from maximizing Token consumption to ensuring each Token delivers actual value. Morgan Stanley also suggests that the rise of agentic AI could multiply Token use per task, potentially offsetting unit price declines.

QHow does the article characterize the role of Chinese AI models, specifically DeepSeek, in the current competitive landscape?

AThe article presents Chinese models like DeepSeek as a significant and often overlooked variable. It notes DeepSeek's API is priced at roughly one-tenth of GPT-5.5 and one-eleventh of Claude Opus. Data shows DeepSeek topping U.S. enterprise software subscription growth, indicating real paid adoption. The article suggests that while OpenAI and Anthropic compete, the ultimate beneficiary might be a player like DeepSeek, which has 'inclusive pricing' in its DNA and less immediate pressure to show profitability to IPO investors.

Lecturas Relacionadas

Trend in US Stocks: A Post Triggers a 930-Point Rebound, Tonight Belongs to SpaceX

On Thursday (June 11, U.S. Eastern Time), Wall Street staged a textbook V-shaped reversal. The Dow Jones surged 929.97 points (+1.86%) to close above 50,000, while the Nasdaq and S&P 500 rose 2.54% and 1.75%, respectively. The rally occurred despite the hottest PPI report in years, with May data showing a 6.5% year-on-year surge, the highest since 2022. The market ignored the inflation data, focusing instead on reports that former President Trump called off a planned strike on Iran, hinting at a potential multi-party peace agreement draft. This sparked a sharp drop in oil prices, fueling hopes that inflation may have peaked. Sector rotations were stark: previously battered AI hardware and cyclical stocks led the gains, while defensive sectors that hit record highs the prior day were sold off. Chip stocks like Micron and Intel saw sharp rebounds. In contrast, software giant Oracle plunged nearly 10% despite beating earnings, with concerns over cloud revenue and cash flow. Adobe also fell after hours despite raising guidance, as its CFO announced departure. The rally's sustainability is questioned, driven largely by social media posts about unconfirmed geopolitical developments. Inflation risks remain, with pipeline pressures still high. Meanwhile, the market's risk appetite faces a major test with SpaceX's historic IPO. Priced at $135 per share, it aims to raise ~$75 billion with a $1.75 trillion valuation, becoming the largest U.S. IPO ever. It will join the Nasdaq 100 in 15 days, triggering massive index fund buying. However, critics cite extreme valuation (88x sales) and market liquidity concerns.

marsbitHace 19 min(s)

Trend in US Stocks: A Post Triggers a 930-Point Rebound, Tonight Belongs to SpaceX

marsbitHace 19 min(s)

The Trillion-Dollar Valuation Test: Are the Three Super IPOs a Tech Stock Frenzy or a Crypto Market Nightmare?

Trillion-Dollar Valuation Test: Are the Three Mega IPOs a Tech Stock Frenzy or a Crypto Market Nightmare? The capital market in 2026 is witnessing a highly anticipated wave of tech IPOs, centered on SpaceX, OpenAI, and Anthropic. Collectively valued at over $3.5 trillion, their potential listing represents one of the largest such waves in recent years. This raises concerns about market liquidity, valuation bubbles, and potential capital outflows from other assets like crypto. SpaceX's valuation narrative has shifted from rocket launches to becoming a global infrastructure play via its Starlink satellite network, which now drives most revenue. Despite ongoing losses, investors focus on its long-term growth potential. OpenAI and Anthropic represent the core productivity engines of generative AI. Their public listings would offer the first direct investment opportunity in large foundation model companies, potentially triggering a repricing within the AI sector. Market fears of a massive "capital drain" from these IPOs are likely overstated. Historical precedents like Alibaba and Saudi Aramco show that mega-listings primarily cause capital reallocation, not destruction, within the vast equities market. Systemic risk is rarely triggered by IPOs alone. For stock markets, short-term volatility and sector repricing are expected, especially for AI concept stocks. Long-term, these listings could reinforce the tech sector's importance. For crypto, direct competition for speculative capital exists, particularly affecting AI-themed tokens. However, crypto's trajectory remains more tied to its own cycles, macro liquidity, and Bitcoin ETF flows rather than a single IPO event. The real risk lies not in the listings themselves but in the sky-high growth expectations embedded in these valuations. If future revenue, profitability, or commercialization progress disappoints, significant valuation resets could follow, impacting high-growth tech stocks. Ultimately, the market's direction hinges on macroeconomic conditions and whether these companies can deliver on their ambitious promises.

链捕手Hace 35 min(s)

The Trillion-Dollar Valuation Test: Are the Three Super IPOs a Tech Stock Frenzy or a Crypto Market Nightmare?

链捕手Hace 35 min(s)

Trillion-Dollar Valuation Test: Are the Three Super IPOs a Tech Stock Frenzy or a Crypto Market Nightmare?

Title: Trillion-Dollar Valuations at Stake: Super IPOs of SpaceX, OpenAI, Anthropic – Tech Boom or Crypto Nightmare? TL;DR: A wave of mega-tech IPOs is approaching, featuring SpaceX (targeting a $1.75 trillion valuation), OpenAI (~$852B), and Anthropic (~$965B), with a combined potential valuation exceeding $3.5 trillion. This tests the market's pricing of innovation and sparks debate on liquidity impact. * **SpaceX**'s valuation is now driven more by its Starlink global communications infrastructure than its core rocket business. * **OpenAI & Anthropic** offer the first major public investment opportunities in foundational AI models, potentially repricing the entire AI sector. * Concerns about a market-wide "liquidity drain" are likely overblown; history shows large IPOs mainly cause fund reallocation, not disappearance, and rarely trigger systemic risk. * Crypto markets, especially some AI-themed tokens, may face short-term fund competition, but their long-term trajectory depends more on macro liquidity, regulation, and Bitcoin cycles. * The real risk lies not in the IPOs themselves, but in whether these companies can justify their sky-high valuations with future revenue growth and profitability. Unmet expectations could lead to significant repricing pressure. Ultimately, these IPOs represent a massive market pricing of next-gen tech infrastructure, not a prelude to a market crash. The broader market direction will be determined by macro conditions, corporate earnings, and risk appetite.

marsbitHace 35 min(s)

Trillion-Dollar Valuation Test: Are the Three Super IPOs a Tech Stock Frenzy or a Crypto Market Nightmare?

marsbitHace 35 min(s)

Anthropic Apologized, But the Business of 'Safety' Hasn't Stopped

On June 11, Anthropic apologized not for a model failure, but for a lack of transparency. Its new Claude Fable 5 model was found to be secretly rerouting requests from users engaged in advanced AI model development to a weaker version, Opus 4.8, without any notification. The company's response—promising future notifications for such "downgrades"—was met with user skepticism. The article argues the core issue isn't technical but commercial: Anthropic's "safety" measures are primarily a business strategy. A key feature, the "intelligent safety classifier," marketed as user protection, is described as a tool for "competitive defense" to protect Anthropic's market lead by limiting rivals' research capabilities. This covert mechanism was designed for low "false positives," precisely targeting AI researchers. Anthropic's model involves a calculated three-step process: publishing alarming security research to amplify public anxiety, offering its Fable 5 model with a "safety classifier" as a premium-priced solution, and cashing in through a planned high-value IPO. This contrasts with OpenAI's more direct "tool-and-traffic" approach. The apology, merely changing a secret downgrade to a visible one, is seen as a business "patch" rather than a principled shift. The incident risks damaging Anthropic's "safest AI" reputation among the developer community, which underpins its valuation and appeal to government and corporate clients. Ultimately, the article concludes that for Anthropic, safety is a business, and the apology is merely customer service for that business.

marsbitHace 1 hora(s)

Anthropic Apologized, But the Business of 'Safety' Hasn't Stopped

marsbitHace 1 hora(s)

Trading

Spot
Futuros

Artículos destacados

Cómo comprar WAR

¡Bienvenido a HTX.com! Hemos hecho que comprar WAR (WAR) sea simple y conveniente. Sigue nuestra guía paso a paso para iniciar tu viaje de criptos.Paso 1: crea tu cuenta HTXUtiliza tu correo electrónico o número de teléfono para registrarte y obtener una cuenta gratuita en HTX. Experimenta un proceso de registro sin complicaciones y desbloquea todas las funciones.Obtener mi cuentaPaso 2: ve a Comprar cripto y elige tu método de pagoTarjeta de crédito/débito: usa tu Visa o Mastercard para comprar WAR (WAR) al instante.Saldo: utiliza fondos del saldo de tu cuenta HTX para tradear sin problemas.Terceros: hemos agregado métodos de pago populares como Google Pay y Apple Pay para mejorar la comodidad.P2P: tradear directamente con otros usuarios en HTX.Over-the-Counter (OTC): ofrecemos servicios personalizados y tipos de cambio competitivos para los traders.Paso 3: guarda tu WAR (WAR)Después de comprar tu WAR (WAR), guárdalo en tu cuenta HTX. Alternativamente, puedes enviarlo a otro lugar mediante transferencia blockchain o utilizarlo para tradear otras criptomonedas.Paso 4: tradear WAR (WAR)Tradear fácilmente con WAR (WAR) en HTX's mercado spot. Simplemente accede a tu cuenta, selecciona tu par de trading, ejecuta tus trades y monitorea en tiempo real. Ofrecemos una experiencia fácil de usar tanto para principiantes como para traders experimentados.

188 Vistas totalesPublicado en 2024.12.11Actualizado en 2026.06.02

Cómo comprar WAR

Discusiones

Bienvenido a la comunidad de HTX. Aquí puedes mantenerte informado sobre los últimos desarrollos de la plataforma y acceder a análisis profesionales del mercado. A continuación se presentan las opiniones de los usuarios sobre el precio de WAR (WAR).

活动图片