Token Plans Launch: The 'Traffic War' in the AI Era, Now It's the Turn of Doubaos to Compete

marsbitPublicado em 2026-05-19Última atualização em 2026-05-19

Resumo

China's major telecom operators are launching standardized "Token" service packages, marking a new phase in the AI era where model usage is becoming a commodity akin to mobile data plans. Operators like China Telecom and China Mobile are offering monthly subscription plans for individuals and enterprises, allowing access to dozens of AI models through unified platforms at set Token rates (e.g., 9.9 yuan for 10 million Tokens). This shift lowers the cost and technical barrier for users to switch between models like Doubao, Qwen, and DeepSeek. The article explains that a Token is the basic computational unit for large language models. Operators are transforming from selling voice minutes and data bandwidth to selling AI compute measured in Tokens. This model benefits developers and SMEs by providing predictable costs and easy access to multiple models without managing underlying infrastructure. As operators become aggregation platforms, competition among model providers intensifies. They must now compete not just on model capability but also on price, computational efficiency (cost per quality Token), and higher-value AI application solutions. The future may see a split where operators control the user access point, while model companies focus on core AI capabilities and specialized enterprise applications.

By Silicon Base Quadrant

When users are no longer debating whether to upgrade their monthly data plan, they may soon start debating how many Token services to purchase each month.

Tokens are about to be packaged and sold as standardized services by telecom operators, just like data, broadband, and SMS.

Recently, China's three major telecom operators have successively launched Token plan products: monthly subscription-based Token schemes for individual users, and tiered computing power packages for developers and enterprise customers. They have announced the integration of dozens to hundreds of large models into their platforms, allowing for "monthly purchase, multi-model calls, and payment via phone bill."

China Telecom has launched personal and enterprise Token plans, with a minimum monthly fee of 9.9 yuan for 10 million Tokens; local operators like Shanghai Mobile and Shanghai Telecom have introduced billing models based on quota points or general Tokens, with Shanghai Mobile offering 400,000 Tokens for 1 yuan.

As operators begin selling Token services, the cost for users to switch between large models will significantly decrease. For large model companies, this means "user stickiness" will be weakened, and only by "competing more fiercely" can they retain their market.

In the future, large model vendors like Doubao, Qwen, and DeepSeek will not only compete on "price" and "Token quality per unit of energy consumption," but also on "higher-value AI application solution capabilities."

01 What is Token Service?

To understand Token service, first understand what a Token is.

Computers cannot directly recognize text; they can only recognize 0 and 1 codes. Therefore, every word, character, punctuation, or piece of speech we input is converted into 0 and 1 codes through a specific encoding mechanism.

In the context of large models, they also first recognize numeric codes, and the number of bits in the code converted from each character varies slightly.

A Token is the smallest unit of computation for a large model to process information. User input, context memory, and model output are all calculated in Tokens. More complex model calls, longer contexts, and deeper Agent execution chains consume more Tokens.

Typically: In English, one Token is roughly equivalent to 4 letters; In Chinese, due to the higher information density of Chinese characters, one Chinese character, one punctuation mark, or one phrase often corresponds to 1 to 2 Tokens.

Since large models think and output Token by Token, the industry sells and settles the cost and usage quota of large models in the form of "Per Million Tokens" or "quota points."

Currently, large model companies implement tiered pricing for Tokens. Ordinary users using general modes of models like Doubao or Qwen are free; for enterprise-level heavy usage, one can purchase different tiers of API monthly packages or metered services.

Starting last year, operators opened large model "computing power supermarkets." Model vendors are the "tenant merchants," and operators charge "platform fees + computing power fees + channel fees." What users buy is not the "operator's model," but rather: on the telecom platform, using telecom computing power, to call any large model, billed per Token.

In July 2025, China Mobile launched the model service platform MoMA (Mobile Model Access); in April, China Telecom launched the Xingchen TokenHub operation service platform; in May, "Unicom Xingluo" Token service platform was released. These platforms have integrated mainstream large models from companies like Baidu, Alibaba, ByteDance, and DeepSeek, offering unified API, unified authentication, and unified billing.

Operators' platforms internally adapt to multiple large models; users only need to change the model name (Model ID) to switch smoothly.

02 Why Are Operators Selling Tokens?

The explosion of Token service is not accidental.

First, changes in billing models. In the traditional cloud computing era, users were accustomed to paying for "server rental time" or "fixed bandwidth" (i.e., computing power payment at the IaaS layer), buying bandwidth speed and time. However, with the development of large models, the capabilities provided by different models and the costs consumed by different tasks vary greatly. For example, a stronger model costs more per Token; longer contexts consume more Tokens; higher inference complexity leads to higher actual costs. Billing per Token aligns "the level of intelligence consumed by the user" with "the computing power cost paid by the vendor."

Second, lowering technical barriers and "trial-and-error costs." The R&D and deployment of large models often require investments of tens of millions or even billions of dollars. For the vast majority of SMEs and individual developers, building their own models is not realistic. Token service breaks down "Artificial General Intelligence (AGI)" capabilities into pieces, packaging them so developers don't need to worry about how many tens of thousands of GPUs are burning electricity underneath; they only need to call APIs on demand and pay Token fees.

Finally, urgent demand driven by the explosion at the application layer. Entering 2026, application-layer scenarios such as AI Agents, AI-assisted programming, and multimodal content generation have exploded. These applications, in their daily operation, require frequent "throughput" interactions with underlying large models. An automated AI code-writing tool might consume millions of Tokens overnight. This high-frequency, massive-volume interaction forces the market to provide more standardized, stable, and price-competitive Token plan services.

Over the past two decades, operator business models have undergone three core changes in measurement units.

The first stage was the voice era, where operators sold minutes; the second stage was the mobile internet era, selling data GB; and entering the AI era, operators are beginning to experiment with selling Tokens.

Tokens are undergoing a similar evolution to data. Initially, they were just technical metrics; then they became billing units; finally, they evolved into standardized commodities.

The entry of operators marks that Tokens have begun to move beyond the technical realm and enter the consumption system.

In the coming years, the way users purchase AI capabilities may fundamentally change: individual users purchase "AI monthly packages," enterprises procure "Token resource pools," home broadband comes with AI quotas, and government/enterprise dedicated lines integrate Agent services. Tokens will become a basic resource, like electricity, water, and data.

However, this does not mean operators will replace large model vendors.

03 How to Buy Tokens Appropriately?

Should Token service be purchased directly from native large model vendors or from operator platforms? What are the pros and cons of the two current business models?

The first is the native model vendor model, which bills per million Tokens. Vendors like OpenAI, Anthropic, DeepSeek, Qwen, etc., commonly use this system. Users pay separately for input Tokens and output Tokens. Some, like Qwen, might use a pre-purchase at the beginning of the month, settle at the end of the month format.

The second is the operator's monthly subscription Token quota. For example, Shanghai Telecom offers a minimum of 9.9 yuan for 10 million Tokens, with additional purchases for excess, and plans to integrate Token rights into the family's "Beautiful Home" digital space, supporting one-click payment via phone bill.

This "all-in-one price" or "bill integration" model allows Chinese users to purchase large model computing power like they buy data packages.

While overseas markets are dominated by the API tiered pricing of native large model enterprises, the domestic market is pushing Token service into a "packaged" era similar to mobile phone plans.

Currently, both billing models have their advantages, as the user base for Token plans can be divided into three main types.

The first is independent developers and technology enthusiasts (Geeks). They use the API interfaces provided by various vendors to build their own personalized AI applications, such as productivity tools, automatic translation plugins, personal knowledge bases, etc.

The second category is SMEs, startups, and B2B independent software vendors (ISVs). This is the core customer group for Token service. Whether purchasing Tokens for company employees for programming, developing industry-specific AI Agents, or embedding AI assistance into existing enterprise ERP, CRM systems, SMEs need to subscribe to "team edition Token plans" from cloud vendors or operators.

The third category is "AI-heavy dependent" professionals and ordinary households, who need to frequently use AI for copywriting, code writing in home settings, or require AI to tutor their children with homework.

For SMEs and startups, from a techno-economic perspective, the pure Token billing model of native large models is more scientific.

The operator's package model has two advantages. On one hand, independent developers are not tied to one specific large model; they can independently choose from multiple models through the platform provider. On the other hand, Token service may reach mass consumers faster. Most people know what 100GB of data means, but cannot perceive what 10 million Tokens represent.

Operators adopting monthly subscriptions are essentially lowering the cognitive barrier. Users don't need to understand Tokens; they just need to start with a basic package like 9.9 yuan/10 million Tokens to understand their needs.

As operators start selling Token services, "Doubaos" are about to begin competing fiercely at three levels.

From "competing on parameters" to "competing on energy efficiency ratio": For large model companies, they can no longer blindly pursue large parameters and high energy consumption for large models. Instead, they must focus on capabilities like model distillation, quantization, and inference optimization that can output higher quality Tokens with smaller energy consumption.

Price competition will further intensify. After operators aggregate hundreds of models, user switching costs decrease. If model A raises prices, it can be replaced with model B via the platform. When model capability differences are insufficient, price becomes the core competitive factor.

The profit center for large model enterprises will shift. Simply selling APIs offers limited profits. Future profit focus may shift to Agents, industry applications, and enterprise solutions. The model itself gradually becomes infrastructure, while the application layer becomes the value center.

Perhaps, a "two-sided market" is forming: operators control the entry point, model vendors control the capabilities.

Perguntas relacionadas

QWhat is the significance of telecom operators launching Token service packages?

AIt signifies that Token is transitioning from a technical metric to a standardized consumer commodity, similar to how mobile data became a utility. It lowers the barrier for AI adoption by offering a familiar, subscription-based model, reduces user lock-in to specific models, and will likely intensify competition among large language model providers.

QHow do the Token purchasing models from native AI firms and telecom operators differ?

ANative AI firms typically charge per million tokens (input/output) with tiered pricing. Telecom operators offer monthly subscription packages with a fixed Token allowance (like 9.9 RMB for 10 million Tokens), bundling it with existing services like phone bills. This model is simpler for general consumers, while the per-token model is more precise for business use.

QWhat are the primary motivations for telecom operators to sell Token services?

AKey motivations include: 1) Adapting the billing model from traditional compute/time to align cost with AI's 'intelligence consumption'. 2) Lowering the technical and trial-and-error barriers for SMEs and developers to access AGI. 3) Meeting the booming demand from AI Agent and other application-layer services that require massive, frequent token interactions. 4) Finding a new core billing unit (like minutes and GB before) for the AI era.

QHow will the emergence of Token platforms impact large language model companies like Doubao?

AIt will weaken user stickiness and force these companies to compete more fiercely ('juan'). Competition will shift to three levels: 1) Efficiency: Improving token quality per unit of energy (via distillation, quantization). 2) Price: Intensifying price competition as users can easily switch models on a platform. 3) Value Shift: Moving their profit center from selling basic API calls to offering higher-value AI Agents, industry applications, and enterprise solutions.

QWhat types of users are the main target audience for Token service packages?

AThree main groups: 1) Independent developers and tech enthusiasts building custom AI tools. 2) SMEs, startups, and B2B software vendors integrating AI into their products or workflows. This is the core audience. 3) Professionals and households that heavily rely on AI for tasks like content creation, coding, or tutoring, who need high-frequency access in daily life.

Leituras Relacionadas

Vitalik's Latest Long Read: In the AI Era, How Can Code Become More Secure?

Vitalik Buterin explores the role of formal verification as a critical tool for software security, especially in the AI era and for blockchain systems. He defines formal verification as using machine-checkable mathematical proofs to verify that code meets specified properties, moving beyond manual auditing. The article highlights that while AI can generate code and find vulnerabilities rapidly, it also makes formal verification more accessible by assisting in writing proofs. This is crucial for Ethereum's complex components like STARKs, ZK-EVMs, consensus algorithms, and high-performance EVM implementations, where bugs can lead to irreversible losses. Vitalik argues that formal verification enables a powerful "separation of concerns": AI can write highly optimized (e.g., assembly) code for efficiency, while a separate, human-readable specification defines correctness. A machine-checked proof then verifies their equivalence. This paradigm can create a more secure "trusted core" of software. However, he cautions that formal verification is not a panacea. "Proven correctness" depends on the accuracy of the specifications and proofs themselves, which can be wrong or incomplete. Risks include unverified code sections, hardware-level side-channel attacks, and overlooked assumptions. The true goal is not absolute proof but increased confidence through redundant expressions of intent—using code, tests, types, and formal proofs—and automatically checking their consistency. The article concludes that AI and formal verification are complementary: AI enables scale, while verification ensures accuracy. For critical systems, this combination offers a path toward stronger security in a future with powerful AI adversaries, helping to maintain the defensive advantage essential for a decentralized internet.

marsbitHá 3m

Vitalik's Latest Long Read: In the AI Era, How Can Code Become More Secure?

marsbitHá 3m

海面收比特币，海下扬言断网，伊朗正“绞杀”美元体系

Odaily星球日报Há 8m

IOSG: After the Number of Developers Halved, Crypto Did Not Die

The crypto development community has undergone a significant transformation, with monthly active developers on GitHub halving from 45K in 2022 to approximately 23K by 2026. This decline is largely attributed to the departure of newcomers, whose roles were often tied to market-driven hype cycles like NFTs and forked DeFi protocols, leading to a 52% churn rate among those with less than a year of experience. However, the core of the industry has strengthened. Established developers with over two years of experience have reached a record high, contributing about 70% of the code. They are consolidating around ecosystems with genuine users and revenue, such as Bitcoin and Solana, while moving away from narrative-driven projects. The talent shift represents a "deleveraging" and an increase in core density. This core group has developed a unique skillset by operating in an environment of "code is law," with zero tolerance for error and no external recourse. They have learned to build trust and functional systems from the ground up without central authorities, as demonstrated by protocols like Uniswap and MakerDAO. These capabilities are now being repriced and leveraged in the AI era. The structural challenges of AI scaling—such as trust, coordination, and verification—mirror those long addressed in crypto. Examples include CoreWeave pivoting from GPU mining to AI compute, OpenSea's founder applying NFT market logic to AI model routing with OpenRouter, and projects like NEAR and Catena Labs transitioning crypto-native architectural and financial insights into AI infrastructure and agent banking. Key areas where crypto-bred skills are directly applicable to AI include: 1. **Compute Aggregation & Optimization**: Using token incentives and cryptographic verification (e.g., Proof of Sampling & Privacy) to create trusted, decentralized GPU networks, as seen with Hyperbolic. 2. **AI Governance & Incentive Design**: Applying economic mechanism design from DAOs and tokenomics to align the goals of multiple, fast-acting AI agents, a direction explored by EigenLayer's EigenCloud. 3. **AI Agent Autonomous Payments**: Leveraging stablecoins and programmable, permissionless blockchains to enable the micro-transactions required for AI agent economies, exemplified by protocols like x402. The role of the crypto builder is evolving from writing smart contracts to designing trust mechanisms for autonomous AI systems. This convergence is reflected in hiring trends at major firms and significant capital allocation from funds like Paradigm and a16z crypto, which are investing at the intersection of crypto and AI. Regional differences exist, with the US favoring foundational protocol innovation and Asia focusing on compliant application-layer integration, but the underlying trend is clear. The industry's "deleveraging" has not signaled its demise but rather a maturation, positioning its core builders to solve critical trust and coordination problems in the age of AI.

marsbitHá 31m

IOSG: After the Number of Developers Halved, Crypto Did Not Die

marsbitHá 31m

Currency and Stock Market Barometer: Strategy Invested Over $2 Billion to Buy Over 24,800 BTC Last Week; Bitmine's ETH Holdings Increase to 4.37% of Total Supply (May 19)

Crypto & Stock Market Watch: Institutional BTC Buying Surges, ETH Holdings Grow Major listed companies aggressively accumulated Bitcoin last week, with net purchases skyrocketing over 44x to $2.03 billion. Strategy (formerly MicroStrategy) led the charge, spending approximately $2.01 billion to buy 24,869 BTC, bringing its total holdings to 843,738 BTC. Overall, listed firms (excluding miners) now hold 1,113,841 BTC, valued at ~$86.16 billion. On the Ethereum front, Bitmine purchased 71,672 ETH in the past week. It now holds 5,278,462 ETH, worth $11.56 billion and representing 4.37% of ETH's total supply. A significant portion (4,712,917 ETH) is staked, generating an annualized yield of $289 million. Industry leaders note a divergence from the MicroStrategy model, with ETH treasury firms increasingly focusing on staking yields and simpler balance sheets. In traditional markets, Morgan Stanley warns of a potential significant U.S. stock market correction if bond yields and volatility continue rising. Investment giants like Berkshire Hathaway and Bridgewater adjusted portfolios in Q1, with Bridgewater notably increasing its stakes in chipmakers like Nvidia, Broadcom, and Micron while shedding software stocks. Among other crypto-focused public companies, Solana treasury firm Upexi reported a widened net loss of $109 million for its fiscal Q3, driven by a decline in its crypto holdings' value. Meanwhile, Hyperion DeFi, a HYPE token treasury company, reported a Q1 net profit of $8.8 million and increased its HYPE holdings past 2 million tokens.

marsbitHá 32m

Currency and Stock Market Barometer: Strategy Invested Over $2 Billion to Buy Over 24,800 BTC Last Week; Bitmine's ETH Holdings Increase to 4.37% of Total Supply (May 19)

marsbitHá 32m

24-Year-Old "Wall Street Newcomer" Portfolio Adjustments Revealed: Shorts Chips Heavily in Q1, Bullish on Energy and AI Infrastructure

A 24-year-old Wall Street prodigy, Leopold Aschenbrenner, has disclosed the Q1 portfolio adjustments for his fund, Situational Awareness LP. The fund's assets under management skyrocketed from $5.52 billion to $13.7 billion. The most significant move was a massive bearish bet on the semiconductor sector. The fund established $8.46 billion in put options, targeting chipmakers like NVIDIA ($1.6B in puts) and the VanEck Semiconductor ETF (SMH, $2B in puts). This bearish stance extended to Broadcom, Oracle, AMD, Micron, ASML, Intel, Corning, and TSMC. However, the fund made a selective bullish exception, adding shares and call options for memory chip maker SanDisk. The fund maintained its core bullish thesis on energy and AI infrastructure. Bloom Energy remained its largest long equity holding. It also increased stakes in cryptocurrency mining/data center firms like CleanSpark, Riot Platforms, Applied Digital, and IREN Limited, viewing them as providers of critical ready-to-use infrastructure—land, power, and grid permits—for AI expansion. The 13F filing was submitted one day late. Overall, the fund's strategy involves substantial bearish semiconductor bets while maintaining concentrated, high-volatility investments in selective tech, computing, and infrastructure plays aligned with AI growth.

marsbitHá 51m

24-Year-Old "Wall Street Newcomer" Portfolio Adjustments Revealed: Shorts Chips Heavily in Q1, Bullish on Energy and AI Infrastructure

marsbitHá 51m

Trading

Spot

Futuros

Artigos em Destaque

Como comprar ERA

Bem-vindo à HTX.com!Tornámos a compra de Caldera (ERA) simples e conveniente.Segue o nosso guia passo a passo para iniciar a tua jornada no mundo das criptos.Passo 1: cria a tua conta HTXUtiliza o teu e-mail ou número de telefone para te inscreveres numa conta gratuita na HTX.Desfruta de um processo de inscrição sem complicações e desbloqueia todas as funcionalidades.Obter a minha contaPasso 2: vai para Comprar Cripto e escolhe o teu método de pagamentoCartão de crédito/débito: usa o teu visa ou mastercard para comprar Caldera (ERA) instantaneamente.Saldo: usa os fundos da tua conta HTX para transacionar sem problemas.Terceiros: adicionamos métodos de pagamento populares, como Google Pay e Apple Pay, para aumentar a conveniência.P2P: transaciona diretamente com outros utilizadores na HTX.Mercado de balcão (OTC): oferecemos serviços personalizados e taxas de câmbio competitivas para os traders.Passo 3: armazena teu Caldera (ERA)Depois de comprar o teu Caldera (ERA), armazena-o na tua conta HTX.Alternativamente, podes enviá-lo para outro lugar através de transferência blockchain ou usá-lo para transacionar outras criptomoedas.Passo 4: transaciona Caldera (ERA)Transaciona facilmente Caldera (ERA) no mercado à vista da HTX.Acede simplesmente à tua conta, seleciona o teu par de trading, executa as tuas transações e monitoriza em tempo real.Oferecemos uma experiência de fácil utilização tanto para principiantes como para traders experientes.

459 Visualizações TotaisPublicado em {updateTime}Atualizado em 2025.07.17

Discussões

Bem-vindo à Comunidade HTX. Aqui, pode manter-se informado sobre os mais recentes desenvolvimentos da plataforma e obter acesso a análises profissionais de mercado. As opiniões dos utilizadores sobre o preço de ERA (ERA) são apresentadas abaixo.