Token Plans Launch: The 'Traffic War' in the AI Era, Now It's the Turn of Doubaos to Compete

marsbitPublished on 2026-05-19Last updated on 2026-05-19

Abstract

China's major telecom operators are launching standardized "Token" service packages, marking a new phase in the AI era where model usage is becoming a commodity akin to mobile data plans. Operators like China Telecom and China Mobile are offering monthly subscription plans for individuals and enterprises, allowing access to dozens of AI models through unified platforms at set Token rates (e.g., 9.9 yuan for 10 million Tokens). This shift lowers the cost and technical barrier for users to switch between models like Doubao, Qwen, and DeepSeek. The article explains that a Token is the basic computational unit for large language models. Operators are transforming from selling voice minutes and data bandwidth to selling AI compute measured in Tokens. This model benefits developers and SMEs by providing predictable costs and easy access to multiple models without managing underlying infrastructure. As operators become aggregation platforms, competition among model providers intensifies. They must now compete not just on model capability but also on price, computational efficiency (cost per quality Token), and higher-value AI application solutions. The future may see a split where operators control the user access point, while model companies focus on core AI capabilities and specialized enterprise applications.

By Silicon Base Quadrant

When users are no longer debating whether to upgrade their monthly data plan, they may soon start debating how many Token services to purchase each month.

Tokens are about to be packaged and sold as standardized services by telecom operators, just like data, broadband, and SMS.

Recently, China's three major telecom operators have successively launched Token plan products: monthly subscription-based Token schemes for individual users, and tiered computing power packages for developers and enterprise customers. They have announced the integration of dozens to hundreds of large models into their platforms, allowing for "monthly purchase, multi-model calls, and payment via phone bill."

China Telecom has launched personal and enterprise Token plans, with a minimum monthly fee of 9.9 yuan for 10 million Tokens; local operators like Shanghai Mobile and Shanghai Telecom have introduced billing models based on quota points or general Tokens, with Shanghai Mobile offering 400,000 Tokens for 1 yuan.

As operators begin selling Token services, the cost for users to switch between large models will significantly decrease. For large model companies, this means "user stickiness" will be weakened, and only by "competing more fiercely" can they retain their market.

In the future, large model vendors like Doubao, Qwen, and DeepSeek will not only compete on "price" and "Token quality per unit of energy consumption," but also on "higher-value AI application solution capabilities."

01 What is Token Service?

To understand Token service, first understand what a Token is.

Computers cannot directly recognize text; they can only recognize 0 and 1 codes. Therefore, every word, character, punctuation, or piece of speech we input is converted into 0 and 1 codes through a specific encoding mechanism.

In the context of large models, they also first recognize numeric codes, and the number of bits in the code converted from each character varies slightly.

A Token is the smallest unit of computation for a large model to process information. User input, context memory, and model output are all calculated in Tokens. More complex model calls, longer contexts, and deeper Agent execution chains consume more Tokens.

Typically: In English, one Token is roughly equivalent to 4 letters; In Chinese, due to the higher information density of Chinese characters, one Chinese character, one punctuation mark, or one phrase often corresponds to 1 to 2 Tokens.

Since large models think and output Token by Token, the industry sells and settles the cost and usage quota of large models in the form of "Per Million Tokens" or "quota points."

Currently, large model companies implement tiered pricing for Tokens. Ordinary users using general modes of models like Doubao or Qwen are free; for enterprise-level heavy usage, one can purchase different tiers of API monthly packages or metered services.

Starting last year, operators opened large model "computing power supermarkets." Model vendors are the "tenant merchants," and operators charge "platform fees + computing power fees + channel fees." What users buy is not the "operator's model," but rather: on the telecom platform, using telecom computing power, to call any large model, billed per Token.

In July 2025, China Mobile launched the model service platform MoMA (Mobile Model Access); in April, China Telecom launched the Xingchen TokenHub operation service platform; in May, "Unicom Xingluo" Token service platform was released. These platforms have integrated mainstream large models from companies like Baidu, Alibaba, ByteDance, and DeepSeek, offering unified API, unified authentication, and unified billing.

Operators' platforms internally adapt to multiple large models; users only need to change the model name (Model ID) to switch smoothly.

02 Why Are Operators Selling Tokens?

The explosion of Token service is not accidental.

First, changes in billing models. In the traditional cloud computing era, users were accustomed to paying for "server rental time" or "fixed bandwidth" (i.e., computing power payment at the IaaS layer), buying bandwidth speed and time. However, with the development of large models, the capabilities provided by different models and the costs consumed by different tasks vary greatly. For example, a stronger model costs more per Token; longer contexts consume more Tokens; higher inference complexity leads to higher actual costs. Billing per Token aligns "the level of intelligence consumed by the user" with "the computing power cost paid by the vendor."

Second, lowering technical barriers and "trial-and-error costs." The R&D and deployment of large models often require investments of tens of millions or even billions of dollars. For the vast majority of SMEs and individual developers, building their own models is not realistic. Token service breaks down "Artificial General Intelligence (AGI)" capabilities into pieces, packaging them so developers don't need to worry about how many tens of thousands of GPUs are burning electricity underneath; they only need to call APIs on demand and pay Token fees.

Finally, urgent demand driven by the explosion at the application layer. Entering 2026, application-layer scenarios such as AI Agents, AI-assisted programming, and multimodal content generation have exploded. These applications, in their daily operation, require frequent "throughput" interactions with underlying large models. An automated AI code-writing tool might consume millions of Tokens overnight. This high-frequency, massive-volume interaction forces the market to provide more standardized, stable, and price-competitive Token plan services.

Over the past two decades, operator business models have undergone three core changes in measurement units.

The first stage was the voice era, where operators sold minutes; the second stage was the mobile internet era, selling data GB; and entering the AI era, operators are beginning to experiment with selling Tokens.

Tokens are undergoing a similar evolution to data. Initially, they were just technical metrics; then they became billing units; finally, they evolved into standardized commodities.

The entry of operators marks that Tokens have begun to move beyond the technical realm and enter the consumption system.

In the coming years, the way users purchase AI capabilities may fundamentally change: individual users purchase "AI monthly packages," enterprises procure "Token resource pools," home broadband comes with AI quotas, and government/enterprise dedicated lines integrate Agent services. Tokens will become a basic resource, like electricity, water, and data.

However, this does not mean operators will replace large model vendors.

03 How to Buy Tokens Appropriately?

Should Token service be purchased directly from native large model vendors or from operator platforms? What are the pros and cons of the two current business models?

The first is the native model vendor model, which bills per million Tokens. Vendors like OpenAI, Anthropic, DeepSeek, Qwen, etc., commonly use this system. Users pay separately for input Tokens and output Tokens. Some, like Qwen, might use a pre-purchase at the beginning of the month, settle at the end of the month format.

The second is the operator's monthly subscription Token quota. For example, Shanghai Telecom offers a minimum of 9.9 yuan for 10 million Tokens, with additional purchases for excess, and plans to integrate Token rights into the family's "Beautiful Home" digital space, supporting one-click payment via phone bill.

This "all-in-one price" or "bill integration" model allows Chinese users to purchase large model computing power like they buy data packages.

While overseas markets are dominated by the API tiered pricing of native large model enterprises, the domestic market is pushing Token service into a "packaged" era similar to mobile phone plans.

Currently, both billing models have their advantages, as the user base for Token plans can be divided into three main types.

The first is independent developers and technology enthusiasts (Geeks). They use the API interfaces provided by various vendors to build their own personalized AI applications, such as productivity tools, automatic translation plugins, personal knowledge bases, etc.

The second category is SMEs, startups, and B2B independent software vendors (ISVs). This is the core customer group for Token service. Whether purchasing Tokens for company employees for programming, developing industry-specific AI Agents, or embedding AI assistance into existing enterprise ERP, CRM systems, SMEs need to subscribe to "team edition Token plans" from cloud vendors or operators.

The third category is "AI-heavy dependent" professionals and ordinary households, who need to frequently use AI for copywriting, code writing in home settings, or require AI to tutor their children with homework.

For SMEs and startups, from a techno-economic perspective, the pure Token billing model of native large models is more scientific.

The operator's package model has two advantages. On one hand, independent developers are not tied to one specific large model; they can independently choose from multiple models through the platform provider. On the other hand, Token service may reach mass consumers faster. Most people know what 100GB of data means, but cannot perceive what 10 million Tokens represent.

Operators adopting monthly subscriptions are essentially lowering the cognitive barrier. Users don't need to understand Tokens; they just need to start with a basic package like 9.9 yuan/10 million Tokens to understand their needs.

As operators start selling Token services, "Doubaos" are about to begin competing fiercely at three levels.

From "competing on parameters" to "competing on energy efficiency ratio": For large model companies, they can no longer blindly pursue large parameters and high energy consumption for large models. Instead, they must focus on capabilities like model distillation, quantization, and inference optimization that can output higher quality Tokens with smaller energy consumption.

Price competition will further intensify. After operators aggregate hundreds of models, user switching costs decrease. If model A raises prices, it can be replaced with model B via the platform. When model capability differences are insufficient, price becomes the core competitive factor.

The profit center for large model enterprises will shift. Simply selling APIs offers limited profits. Future profit focus may shift to Agents, industry applications, and enterprise solutions. The model itself gradually becomes infrastructure, while the application layer becomes the value center.

Perhaps, a "two-sided market" is forming: operators control the entry point, model vendors control the capabilities.

Related Questions

QWhat is the significance of telecom operators launching Token service packages?

AIt signifies that Token is transitioning from a technical metric to a standardized consumer commodity, similar to how mobile data became a utility. It lowers the barrier for AI adoption by offering a familiar, subscription-based model, reduces user lock-in to specific models, and will likely intensify competition among large language model providers.

QHow do the Token purchasing models from native AI firms and telecom operators differ?

ANative AI firms typically charge per million tokens (input/output) with tiered pricing. Telecom operators offer monthly subscription packages with a fixed Token allowance (like 9.9 RMB for 10 million Tokens), bundling it with existing services like phone bills. This model is simpler for general consumers, while the per-token model is more precise for business use.

QWhat are the primary motivations for telecom operators to sell Token services?

AKey motivations include: 1) Adapting the billing model from traditional compute/time to align cost with AI's 'intelligence consumption'. 2) Lowering the technical and trial-and-error barriers for SMEs and developers to access AGI. 3) Meeting the booming demand from AI Agent and other application-layer services that require massive, frequent token interactions. 4) Finding a new core billing unit (like minutes and GB before) for the AI era.

QHow will the emergence of Token platforms impact large language model companies like Doubao?

AIt will weaken user stickiness and force these companies to compete more fiercely ('juan'). Competition will shift to three levels: 1) Efficiency: Improving token quality per unit of energy (via distillation, quantization). 2) Price: Intensifying price competition as users can easily switch models on a platform. 3) Value Shift: Moving their profit center from selling basic API calls to offering higher-value AI Agents, industry applications, and enterprise solutions.

QWhat types of users are the main target audience for Token service packages?

AThree main groups: 1) Independent developers and tech enthusiasts building custom AI tools. 2) SMEs, startups, and B2B software vendors integrating AI into their products or workflows. This is the core audience. 3) Professionals and households that heavily rely on AI for tasks like content creation, coding, or tutoring, who need high-frequency access in daily life.

Related Reads

Clarity Act Outlook: No Yield, No Payment

"Clear Act Outlook: No Yield, No Payment" analyzes the evolving U.S. regulatory landscape for stablecoins, focusing on the interplay between the proposed "Clarity Act" and the existing "Genius Act." The article argues that the Genius Act successfully fostered "payment stablecoins" by permitting tokenized assets like U.S. Treasuries as reserves. This created a structured market where stablecoin issuers (like USDC) must hold these reserves, often purchased as Tokenized Money Market Funds (TMMFs) from giants like BlackRock. These TMMFs are primarily B2B products, ensuring user-facing stablecoins remain non-interest-bearing and used primarily for payments. The upcoming Clarity Act is seen as the next phase, aiming to restrict passive yield on stablecoins. Its goal is to dismantle the arbitrage advantage of offshore stablecoins like USDT by redirecting Treasury demand towards compliant, U.S.-sanctioned TMMFs. For on-chain and compliant offshore dollars, this creates new pressure: they must spur adoption and utility to generate yield, as simple Treasury staking may be restricted. This indirectly promotes dollar circulation and sustained Treasury purchases. Ultimately, the analysis posits that U.S. regulation seeks to create a new dollar distribution model. By separating payment function from yield generation and anchoring both to U.S. debt instruments, it aims to embed the dollar and Treasury demand into the global crypto economy, managing yields through sanctioned intermediaries while leaving room for DeFi and cross-border arbitrage.

marsbit2m ago

Clarity Act Outlook: No Yield, No Payment

marsbit2m ago

Money Has Gone to Bonds and IPOs, Leaving Only HYPE Rising in Crypto

The article "Where Has All the Money Gone? Bonds and IPOs Are Soaring, While Crypto Only Sees HYPE Rising" analyzes the recent underperformance of major cryptocurrencies like Bitcoin and Ethereum compared to traditional financial markets. It identifies three primary factors diverting capital away from crypto: First, surging bond yields, with the 30-year U.S. Treasury hitting a near 20-year high of 5.12%, are attracting capital seeking safe, predictable returns. This is evidenced by Bitcoin spot ETFs experiencing a significant $10.39 billion net outflow in mid-May. Second, a massive $4 trillion IPO pipeline, highlighted by SpaceX's upcoming listing, is absorbing risk capital that might otherwise flow into crypto. Platforms like Hyperliquid are even channeling on-chain crypto liquidity into pre-IPO trading for traditional stocks. Third, uncertainty surrounds new Federal Reserve Chair Warsh's ability to deliver expected interest rate cuts this year due to conflicting political pressures and stubborn inflation expectations, potentially eliminating a hoped-for source of new market liquidity. Consequently, while traditional equities and bonds rally, the crypto market's post-leverage crash recovery is stalled. The notable exception is assets like Hyperliquid (HYPE), which is rising due to its role in facilitating traditional asset trading, underscoring a market divergence where only crypto projects with novel, cross-market narratives are gaining. The article concludes that Bitcoin's next major catalyst may be the August enactment of the CLARITY Act, but warns of a potential retest of the $70,000 support level before then.

marsbit17m ago

Money Has Gone to Bonds and IPOs, Leaving Only HYPE Rising in Crypto

marsbit17m ago

Agents Capital Markets: How Will Autonomous Agents Secure Financing?

Agents Capital Markets: How Will Autonomous Agents Raise Capital? Within a decade, autonomous software agents—legal entities capable of signing contracts, holding bank accounts, and generating revenue—will create their own capital markets. These markets will feature rating agencies, underwriters, indices, and brokers, mirroring traditional public equity markets. Agents will perform routine services like marketing, logistics, and customer support at a fraction of human-operated costs, creating massive economic pressure for adoption. Four converging forces ensure this outcome: 1) Overwhelming cost advantages, with AI inference costs plummeting; 2) Existing, revenue-generating agent companies (e.g., Sierra, Harvey) proving market demand; 3) Established legal frameworks (e.g., Wyoming's memberless LLCs) enabling algorithmic management; and 4) A vast pool of yield-seeking private credit capital ready to fund new asset classes. The capital stack for agent companies will be multi-layered, evolving through stages: venture equity for early infrastructure, programmatic working capital advances (similar to Shopify Capital), revenue-based financing (RBF), and finally, institutional slate financing—pooling many agents to diversify risk, attracting large firms like Apollo. Tokenization will act as a settlement layer, enhancing liquidity, not an origination model. Objections regarding regulation, human oversight, or comparisons to SaaS are addressed: regulation will adapt, full autonomy will dominate for efficiency, and agents are distinct as legal entities that own their cash flows and liabilities. Due diligence shifts from founder assessment to analyzing code, contracts, and auditable operational history. The current bottleneck is not capital supply or demand but the intermediate institutional layer—standardized contracts, rating methodologies, and audit frameworks. The final constraint—reliance on human capital allocation—will be severed when agents can algorithmically access funding based on their performance. This transforms agents from software curiosities into fundable blocks of the real economy, unleashing their full productive potential. The rope is loosening.

marsbit1h ago

Agents Capital Markets: How Will Autonomous Agents Secure Financing?

marsbit1h ago

Agents Capital Markets: How Will Autonomous Agents Get Funded?

"Agents Capital Markets: How Autonomous Agents Will Raise Capital" Within a decade, specialized capital markets will emerge for AI Agents—software entities with legal personhood that perform work, earn revenue, and need capital. Unlike today's AI companies (like Sierra or Harvey) backed by traditional VC, these future *Agent companies* will be autonomous, legally-recognized entities (e.g., Wyoming memberless LLCs) that directly own assets, sign contracts, and incur liabilities. The driving forces are fourfold: 1) **Overwhelming economics** (Agent companies can deliver services at 85-90% lower cost than human firms); 2) **Proven demand** (current Agent operators already generate billions in revenue); 3) **Existing legal frameworks** enabling algorithmically-managed companies; and 4) **Massive, yield-seeking capital pools** (e.g., private credit) looking for new, uncorrelated assets. Agent capital markets won't rely on one model but a multi-layered "stack" matching different growth stages: 1) VC equity for early human-led builders; 2) Programmatic working capital advances (like Stripe Capital); 3) Revenue-based financing (RBF); 4) Slate financing (pooled funds for many Agents, similar to Hollywood); and 5) Tokenization as a secondary settlement layer, not a primary funding source. The ultimate shift is from funding constrained by human decision-makers to capital flowing algorithmically based on an Agent's auditable performance, contract book, and cash flows. This transition will be enabled by standardized infrastructure—rating methodologies, contracts, indices—turning Agents from software experiments into a foundational, financeable sector of the economy. The constraints are loosening; the opportunity is here.

链捕手1h ago

Agents Capital Markets: How Will Autonomous Agents Get Funded?

链捕手1h ago

Trading

Spot
Futures

Hot Articles

Ethena: Building a New Era of Web3‑Native Digital Dollars

Ethena is an Ethereum‑based synthetic dollar protocol that delivers crypto‑native monetary solutions, including USDe, a synthetic dollar, and sUSDe, a globally accessible U.S. dollar savings asset.

53.0k Total ViewsPublished 2026.03.16Updated 2026.03.16

Ethena: Building a New Era of Web3‑Native Digital Dollars

Discussions

Welcome to the HTX Community. Here, you can stay informed about the latest platform developments and gain access to professional market insights. Users' opinions on the price of ERA (ERA) are presented below.

活动图片