Token Inefficient, Economy Tokenless

marsbitPubblicato 2026-06-05Pubblicato ultima volta 2026-06-05

Introduzione

The article "Tokens Aren't Economical, Economics Aren't Tokenized" analyzes a pivotal shift in the AI industry from a technology-driven narrative to one dominated by capital efficiency. It highlights two concurrent trends: a severe capital shortage due to the exorbitant and recurring costs of compute (e.g., OpenAI's high burn rate) and a wave of corporate spin-offs where major tech companies are separating their AI units (like Kuaishou's Kling and Baidu's Kunlunxin). The core argument is that AI's "anti-internet" business model, where user growth increases costs rather than profits, has created a disconnect between high valuations and actual cash flow. Spin-offs address this by allowing AI assets to be valued independently. Within a parent company, they are seen as cost centers, but as standalone entities, they are priced based on their growth potential and scarcity in the primary market, leading to massive valuation premiums (e.g., Kling's estimated value tripling post-spin-off). The industry is at an inflection point, moving from "model worship" to "value realization." The competition is evolving from a pure compute (GPU) race to a broader focus on systemic efficiency and full-stack engineering (involving CPUs and orchestration) to achieve viable commercialization. The year 2026 is framed as a critical moment where the industry must definitively answer how to economically translate AI capability into tangible business value, reshaping the sector's future power structure.

Recently, OpenAI's IPO plans have thrust the lab, long operating under a 'capped-profit' structure, into the spotlight of the public market. Meanwhile, Alphabet, Google's parent company, launched an $80 billion financing plan, with Berkshire Hathaway alone subscribing to $10 billion.

The conservative investment giant's first foray into tech stocks signals that the AI capital game has reached a new peak. Only now can we say that the AI industry is undergoing a profound paradigm shift.

The most direct manifestation is that "shortage of capital" and "spin-offs" have become the two dominant narratives for current AI companies.

The reason for the former is unsurprising: computing power is a heavy asset. Google's capital expenditure for 2026 is projected to be as high as $180 to $190 billion, with Microsoft, Meta, and Amazon also investing hundreds of billions. An H100 chip, a transformer for a data center, a grid connection – each step requires real money.

The latter has become a major strategic move for large domestic companies recently. Kuaishou's Kling AI had an internal valuation of around $6 billion within the group. After spinning off independently, its pre-money valuation skyrocketed to $18 billion, a threefold increase. Baidu's Kunlunxin was spun off for a separate listing, with external estimates suggesting it could contribute nearly $30 billion in incremental market cap to Baidu, equivalent to over 60% of its current total market value.

This phenomenon reflects the capital market's redefinition of AI assets. Within a conglomerate's consolidated financials, the AI business is seen as a profit-consuming investment. Once independent, it's priced based on the scarcity of the sector, revenue growth rate, and future potential, where price-to-sales ratios of several dozen times are not uncommon.

These two threads, seemingly independent, actually point to the same core: AI is transitioning from a technology-narrative-driven landscape to a new competitive environment dominated by capital efficiency.

The End of the Computing Power Race: The Breakdown and Reconfiguration of Financing Logic

Behind the "capital shortage" lies a fundamental logical chain. Today's large language model competition is essentially no longer a product competition but a heavy-asset race of computing power scale. OpenAI has committed to about $600 billion in future expenditure for computing power expansion. Even after completing a $122 billion financing round, this capital is expected to be depleted within three years.

More directly, OpenAI's CFO, John F. Ray, previously disclosed that while annualized revenue for 2025 has exceeded $20 billion, it's still insufficient to cover massive losses. The company incurs about $1.22 in losses for every $1 of revenue it generates.

The crux of the problem is that the cost curve of the AI business is fundamentally different from that of the traditional internet.

Adding one more user to WeChat brings marginal costs close to zero. But the more popular ChatGPT is, the more calls are made, and the higher the inference costs. User growth becomes pure benefit but also a cost pressure. This "anti-internet" business model means that economies of scale not only fail to bring profits but actually amplify cash flow pressure – user growth no longer directly equates to value growth.

Deeper still is the phenomenon of "circular accounting" in the AI era: Microsoft's $13 billion investment in OpenAI was not delivered in cash but in the form of "cloud credits." OpenAI uses these credits to train models, while Microsoft books them as new cloud revenue.

This closed-loop operation of "using investment to buy cloud services" appears on the surface as healthy revenue growth but is essentially paying oneself with one's own money and then classifying it as sales revenue. Estimates suggest OpenAI's annual cloud service bill has ballooned to over $60 billion, more than double its actual revenue of $25 billion.

This is the essential contradiction of the "capital shortage": the disconnect between valuation bubbles and actual cash flow. When investors start caring about "free cash flow" rather than "paper profits," the valuation system previously propped up by mutual investment promises and circular orders faces the risk of valuation deflation.

OpenAI plans for a $14 billion loss in 2026, expecting profitability only by 2029. Google's projected $180-$190 billion capital expenditure for 2026 indicates that the current AI "capital shortage" is not a cyclical liquidity issue but a dilemma of the entire business model at the capital structure level.

Why Is One Financial Statement Worth Three Times More?

One of the most noteworthy signals for 2026 is the concentrated spin-off of core AI assets by major tech companies.

Kuaishou's AI video product, Kling, plans a Pre-IPO round with a valuation of $20 billion, close to 70% of Kuaishou's parent company market cap. Concurrently, Baidu is pushing its AI chip company, Kunlunxin, towards a dual listing on A-share and Hong Kong markets, with 2025 revenue expected to exceed 3.5 billion RMB and achieve breakeven. Alibaba is reportedly planning to spin off Pingtouge, and ByteDance's Doubao could follow the same path at any time.

Think about it: before the spin-off, Morgan Stanley valued Kling at only around $6 billion. After the spin-off, targeting a $20 billion financing, the same assets, same revenue, same team – just by changing the financial statement – the valuation instantly differed by over three times.

The change in valuation logic here reveals a structural divide at the mechanism level: the primary market is different from the secondary market. Its game rules follow the unconventional pricing mechanism of "consensus determines value." The primary market looks at the future, sector positioning, imagination, and whether there will be buyers in the next round, but pays little attention to current profits and revenue.

The core logic for Kling's $20 billion valuation lies in the scarcity of such top-tier assets. After Sora's closure, the number of leading players in the AI video generation sector can be counted on one hand. The label of "AI infrastructure for the content industry" itself commands a premium.

So, what qualifies as a top-tier asset currently? In the current AI landscape, it's companies that possess self-developed foundation models (whether language, video, or multimodal), not wrapper or fine-tuning models; have proven at least one vertical scenario with large-scale users or revenue (not a demo, not proof-of-concept); and have "takeover expectations" for subsequent financing – either a strategic buyer (major tech company) or an IPO channel (US, Hong Kong, or A-share markets).

Companies meeting these three criteria globally can be counted on two hands: OpenAI, Anthropic, xAI, Google DeepMind (if independent), China's Zhipu AI, Moonshot AI, MiniMax, ByteDance's Doubao (if independent), Kuaishou's Kling (in spin-off process), Baidu's Kunlunxin (chip side). Each is a scarce target, each in a state of "buyers queuing, sellers raising prices."

The underlying logic of this "revaluation" is the cognitive shift of AI assets within major tech companies from "cost centers" to "value centers."

Within a conglomerate, the AI business is treated as part of the group's operations. Inside a major company, AI business is typically categorized as "strategic investment," meaning its costs (computing power, R&D, data annotation) are co-mingled with the group's mature cash flow businesses (like ads, e-commerce, gaming). The group CFO looks at consolidated statements. As long as the AI business is burning cash, it's constantly asked to explain "when it will contribute net profit."

In this context, AI teams are forced into short-term ROI justifications, and the valuation logic is naturally suppressed under the shadow of the group's overall P/E multiple – mature internet companies typically only get 10-15x P/E. Even high-growth businesses only enjoy a 20% premium, not the 3-5x P/S ratio of an independent sector.

Once spun off independently, the independent financial statements can redefine the boundaries of "costs" and "revenue." For example, the computing power costs previously consumed internally by the group can now be repriced at market rates as "related-party transaction revenue"; model training previously booked as R&D expenses can now be capitalized as "intangible assets" and amortized over periods.

In other words, these assets gain the pricing model of a "growth enterprise." The spun-off AI company can proceed with financing and strategy more flexibly, avoiding the constraints of internal resource allocation within the group, and receive independent pricing in the capital market based on its own growth prospects.

Simultaneously, this involves a further differentiation of valuation systems. The growth potential and forward-looking valuation of existing businesses in major companies, now tagged with AI, are beginning to command new premiums in the secondary market.

This also explains why traditional internet giants (like Baidu at $47.5 billion, Kuaishou at $27 billion) are being caught up with or even surpassed by AI newcomers in absolute market cap terms – Zhipu AI's latest market cap translates to approximately $58.6 billion, already surpassing Baidu to become China's ninth-largest AI tech stock.

From "Model Worship" to "Value Realization": The Industry Narrative Has Structurally Shifted

Some professionals suggest that the rapid development of the current AI era is reminiscent of the previous mobile internet explosion. This analogy is accurate, but the key difference lies in the essential difference in cost structures.

The mobile internet explosion relied on smartphone proliferation and continuously declining bandwidth costs – marginal costs trended downward. The AI explosion faces hard constraints like rising computing power costs, surging electricity consumption, and long data center construction cycles.

One observation is that the current AI industry is in a state of "85-degree water temperature – about to boil but not yet boiling."

The direction of technological breakthroughs (agents, multimodal) is clear, computing infrastructure investment is unprecedented (large US hyperscale cloud companies' 2026 capital expenditure will reach $805 billion, nearly double predictions from a year ago). But true commercialization, monetization, and adoption scale are still at a critical point of about to begin.

Currently, only a small fraction of CFOs saw actual financial value from AI in 2025, and even fewer Chinese enterprises achieved revenue growth through AI. This tension of "high investment, low return" is precisely the growing pain signal of the industry shifting from hype to practical implementation.

Many may not have noticed that the weight in the AI value chain has shifted from the GPU side to the entire system side. The latest Morgan Stanley research points out, "Agent AI marks a structural shift from computing to orchestration." In agent workflows, CPU-side orchestration time can account for 50% to 90% of total latency, leading to projections of an incremental $32.5 to $60 billion CPU market space by 2030.

This means the industry's core contradiction is shifting from "insufficient computing power" to "insufficient system efficiency." Correspondingly, the investment logic will expand from a "single-chip computing power race" to "full-stack system engineering." GPUs determine "if it can be done," but CPUs and systems determine "if it can be profitable."

If the mobile internet explosion was driven by "connectivity" as the core, then the AI explosion will likely have "intelligence" at its core, and the breadth of its value chain will likely surpass that of mobile internet, covering the entire chain of computing power, models, applications, and data.

Some economists point out that 2026 is becoming the singularity year for AI's leap from "assisting thinking" to "autonomous execution." The core contradiction at this stage is shifting from "who can train the strongest model" to "who can be the first to transform AI capabilities into tangible commercial value and user benefits in the most economical way, fastest speed, and widest coverage."

"It's not just about redefinition, but also about revaluation." Everything happening in the AI industry in 2026 – giants short on capital, frenzied financing, corporate spin-offs, IPO clusters – is essentially the concentrated release of the same capital logic: when the path of "burning money for growth" reaches its end, the industry must answer a most fundamental question: How much is this technology really worth?

The answer to this question will determine the power structure of the AI industry for the next decade. And 2026 is precisely the moment when this game of capital and technology fully unfolds.

This article is from WeChat public account "New Eyes" (ID: xinmouls), author: Li Xiaodong.

Domande pertinenti

QAccording to the article, what is the fundamental reason behind the 'lack of money' faced by AI companies like OpenAI?

AThe article states the fundamental reason is a structural flaw in the business model. Unlike traditional internet services with near-zero marginal costs, AI models like ChatGPT have costs that scale directly with usage and growth, creating negative cash flow. Furthermore, practices like 'circular accounting' (e.g., Microsoft investing 'cloud credits' which OpenAI spends, counting as Microsoft revenue) create valuation bubbles disconnected from actual cash flow, leaving companies reliant on continuous heavy investment.

QWhat valuation change occurred when Kuaishou's Kling AI was spun off, and what is the primary reason given for this dramatic increase?

AKuaishou's Kling AI was valued at around $6 billion within the parent company. Upon announcing its spin-off for independent financing, its pre-money valuation target jumped to approximately $20 billion, a more than three-fold increase. The primary reason is the shift in valuation logic. Within a conglomerate, AI is a cost center judged by its profit contribution. As an independent entity, it is valued as a scarce, high-growth asset based on its market potential, growth narrative, and scarcity, allowing for much higher price-to-sales multiples.

QThe article describes a shift in the core focus of the AI industry. What is this shift according to the analysis provided?

AThe analysis indicates a structural migration in the industry narrative from 'model supremacy' or 'model worship' to 'value realization.' The core contradiction is shifting from 'who can train the strongest model' to 'who can most efficiently, quickly, and broadly convert AI capabilities into tangible commercial value and user benefits.' The industry is moving from a technology narrative to a capital efficiency and business model-driven competitive landscape.

QWhat is the '85-degree water' analogy used in the article meant to describe about the current state of the AI industry?

AThe analogy of '85-degree water' is used to describe the current AI industry as being in a state of high tension and imminent change that has not yet fully materialized. It signifies that key directions like multi-modal AI and agents are clear, and massive infrastructure investment is underway. However, widespread commercial adoption and significant revenue generation are still at a critical tipping point, creating a phase of 'high investment, low return' before the industry fully 'boils over' into mainstream value creation.

QHow does the article contrast the cost structure of the AI boom with that of the mobile internet boom?

AThe article contrasts them as fundamentally opposite. The mobile internet boom was driven by falling marginal costs—smartphone普及 and decreasing bandwidth costs allowed user growth to scale profitably. In contrast, the AI boom faces rising marginal costs: compute (GPU) costs, soaring power consumption, and long data center construction cycles mean that more users and usage directly increase costs, challenging traditional scale economies.

Letture associate

Just now, DeepSeek V4 updates with DSpark, improving inference speed by 80%

DeepSeek has updated its DeepSeek V4 model with the DSpark speculative decoding framework, achieving a significant 60-85% speedup in generation for Flash models and 57-78% for Pro models while maintaining the same overall throughput. This engineering-focused update, rather than a core architectural change, introduces DSpark to address latency and throughput bottlenecks in high-concurrency production environments. DSpark combines high-throughput parallel generation with adaptive load-aware verification. Its key innovations include a semi-autoregressive generation architecture to model dependencies within token blocks and a hardware-aware confidence-scheduled verification system. This system uses a confidence head to predict token acceptance probabilities, allowing it to dynamically optimize verification length per request and allocate compute only to tokens with the highest expected payoff. The asynchronous scheduler is designed for real-world deployment, ensuring zero-overhead scheduling and continuous CUDA graph replay while preserving the target model's output distribution. In tests across mathematical reasoning, code generation, and daily dialogue, DSpark outperformed state-of-the-art models like Eagle3 and DFlash, increasing average acceptance length by 26.7%-30.9% and 16.3%-18.4% respectively on Qwen3 target models. DeepSeek also open-sourced DeepSpec, a full-stack codebase for training and evaluating speculative decoding draft models, providing a standardized toolkit that includes data preparation tools, model implementations, training code, and evaluation scripts.

marsbit3 h fa

Just now, DeepSeek V4 updates with DSpark, improving inference speed by 80%

marsbit3 h fa

BIT Research: The 2028 Halving Is Not the End, the Real Shake-Up of the Bitcoin Mining Industry Is Just Beginning

The Bitcoin mining industry is undergoing its most complex structural adjustment since inception. Despite Bitcoin's price holding near $61,000 and the network hash rate approaching a record 1 ZH/s, miner profitability is deteriorating. The industry is operating close to its breakeven point, with the 2028 halving expected to accelerate consolidation. The challenges extend beyond the halving's subsidy reduction; the industry's revenue model has yet to successfully transition towards a fee-driven structure. Increasingly, mining companies are evolving from simple Bitcoin producers into infrastructure and energy operators, including providers of AI/HPC computing power. Competition is shifting from pure hash rate expansion to business model upgrades. Economic pressure is evident. The theoretical daily mining revenue at current prices is around $78 million, yet the actual figure is only about $33 million—a 136% gap. Transaction fees remain low at roughly $220k daily, far below historical implied levels. With a current estimated industry-wide breakeven price near $65,000, mining alone is struggling to generate ideal profits. The 2028 halving is projected to push the fundamental production cost floor to approximately $93,289. This will likely accelerate a shift towards consolidation among larger, well-capitalized miners with diversified revenue streams. Competitive advantage will belong to institutionalized players with access to low-cost energy, AI/HPC hosting operations, and stronger balance sheets. In essence, Bitcoin mining is transitioning from a "mining business" to an "infrastructure business." Future profitability and resilience will depend less on block rewards and more on diversified income sources like energy management and computational infrastructure services. For investors, the key question is not the halving itself, but which miners can successfully navigate this business model transformation.

marsbit4 h fa

BIT Research: The 2028 Halving Is Not the End, the Real Shake-Up of the Bitcoin Mining Industry Is Just Beginning

marsbit4 h fa

This is How God Karpathy Uses Claude?

Andrej Karpathy, a prominent figure in AI, has reportedly joined Anthropic, leading to a noticeable decrease in his open-source contributions and social media activity. A document claiming to be his personal "CLAUDE.md" file—a set of instructions for the Claude AI to follow within a specific codebase—has been circulating online. While its authenticity is unverified, the content aligns closely with Karpathy's publicly shared principles on effective AI-assisted programming. The document outlines key rules for AI coding assistants, emphasizing the importance of reading existing code thoroughly before writing new code to maintain consistency. It advises against over-engineering, advocating for simple, surgical modifications that match the project's existing style. Other guidelines include clarifying assumptions upfront, writing meaningful tests, thoughtful debugging, and carefully considering dependencies. The core message is that these principles help prevent common AI coding failures, such as introducing unnecessary abstractions, style drift, or making invisible architectural decisions. The community has noted that even experts like Karpathy require detailed instructions to guide AI effectively, akin to managing a junior developer. A related GitHub repository, "andrej-karpathy-skills," which encapsulates these ideas, is reported to significantly reduce Claude's code error rate. Ultimately, the advice stresses that the best CLAUDE.md is tailored to one's own tech stack and coding practices.

marsbit4 h fa

This is How God Karpathy Uses Claude?

marsbit4 h fa

Trading

Spot
活动图片