Token Plans Launch: The 'Traffic War' in the AI Era, Now It's the Turn of Doubaos to Compete

marsbitОпубликовано 2026-05-19Обновлено 2026-05-19

Введение

China's major telecom operators are launching standardized "Token" service packages, marking a new phase in the AI era where model usage is becoming a commodity akin to mobile data plans. Operators like China Telecom and China Mobile are offering monthly subscription plans for individuals and enterprises, allowing access to dozens of AI models through unified platforms at set Token rates (e.g., 9.9 yuan for 10 million Tokens). This shift lowers the cost and technical barrier for users to switch between models like Doubao, Qwen, and DeepSeek. The article explains that a Token is the basic computational unit for large language models. Operators are transforming from selling voice minutes and data bandwidth to selling AI compute measured in Tokens. This model benefits developers and SMEs by providing predictable costs and easy access to multiple models without managing underlying infrastructure. As operators become aggregation platforms, competition among model providers intensifies. They must now compete not just on model capability but also on price, computational efficiency (cost per quality Token), and higher-value AI application solutions. The future may see a split where operators control the user access point, while model companies focus on core AI capabilities and specialized enterprise applications.

By Silicon Base Quadrant

When users are no longer debating whether to upgrade their monthly data plan, they may soon start debating how many Token services to purchase each month.

Tokens are about to be packaged and sold as standardized services by telecom operators, just like data, broadband, and SMS.

Recently, China's three major telecom operators have successively launched Token plan products: monthly subscription-based Token schemes for individual users, and tiered computing power packages for developers and enterprise customers. They have announced the integration of dozens to hundreds of large models into their platforms, allowing for "monthly purchase, multi-model calls, and payment via phone bill."

China Telecom has launched personal and enterprise Token plans, with a minimum monthly fee of 9.9 yuan for 10 million Tokens; local operators like Shanghai Mobile and Shanghai Telecom have introduced billing models based on quota points or general Tokens, with Shanghai Mobile offering 400,000 Tokens for 1 yuan.

As operators begin selling Token services, the cost for users to switch between large models will significantly decrease. For large model companies, this means "user stickiness" will be weakened, and only by "competing more fiercely" can they retain their market.

In the future, large model vendors like Doubao, Qwen, and DeepSeek will not only compete on "price" and "Token quality per unit of energy consumption," but also on "higher-value AI application solution capabilities."

01 What is Token Service?

To understand Token service, first understand what a Token is.

Computers cannot directly recognize text; they can only recognize 0 and 1 codes. Therefore, every word, character, punctuation, or piece of speech we input is converted into 0 and 1 codes through a specific encoding mechanism.

In the context of large models, they also first recognize numeric codes, and the number of bits in the code converted from each character varies slightly.

A Token is the smallest unit of computation for a large model to process information. User input, context memory, and model output are all calculated in Tokens. More complex model calls, longer contexts, and deeper Agent execution chains consume more Tokens.

Typically: In English, one Token is roughly equivalent to 4 letters; In Chinese, due to the higher information density of Chinese characters, one Chinese character, one punctuation mark, or one phrase often corresponds to 1 to 2 Tokens.

Since large models think and output Token by Token, the industry sells and settles the cost and usage quota of large models in the form of "Per Million Tokens" or "quota points."

Currently, large model companies implement tiered pricing for Tokens. Ordinary users using general modes of models like Doubao or Qwen are free; for enterprise-level heavy usage, one can purchase different tiers of API monthly packages or metered services.

Starting last year, operators opened large model "computing power supermarkets." Model vendors are the "tenant merchants," and operators charge "platform fees + computing power fees + channel fees." What users buy is not the "operator's model," but rather: on the telecom platform, using telecom computing power, to call any large model, billed per Token.

In July 2025, China Mobile launched the model service platform MoMA (Mobile Model Access); in April, China Telecom launched the Xingchen TokenHub operation service platform; in May, "Unicom Xingluo" Token service platform was released. These platforms have integrated mainstream large models from companies like Baidu, Alibaba, ByteDance, and DeepSeek, offering unified API, unified authentication, and unified billing.

Operators' platforms internally adapt to multiple large models; users only need to change the model name (Model ID) to switch smoothly.

02 Why Are Operators Selling Tokens?

The explosion of Token service is not accidental.

First, changes in billing models. In the traditional cloud computing era, users were accustomed to paying for "server rental time" or "fixed bandwidth" (i.e., computing power payment at the IaaS layer), buying bandwidth speed and time. However, with the development of large models, the capabilities provided by different models and the costs consumed by different tasks vary greatly. For example, a stronger model costs more per Token; longer contexts consume more Tokens; higher inference complexity leads to higher actual costs. Billing per Token aligns "the level of intelligence consumed by the user" with "the computing power cost paid by the vendor."

Second, lowering technical barriers and "trial-and-error costs." The R&D and deployment of large models often require investments of tens of millions or even billions of dollars. For the vast majority of SMEs and individual developers, building their own models is not realistic. Token service breaks down "Artificial General Intelligence (AGI)" capabilities into pieces, packaging them so developers don't need to worry about how many tens of thousands of GPUs are burning electricity underneath; they only need to call APIs on demand and pay Token fees.

Finally, urgent demand driven by the explosion at the application layer. Entering 2026, application-layer scenarios such as AI Agents, AI-assisted programming, and multimodal content generation have exploded. These applications, in their daily operation, require frequent "throughput" interactions with underlying large models. An automated AI code-writing tool might consume millions of Tokens overnight. This high-frequency, massive-volume interaction forces the market to provide more standardized, stable, and price-competitive Token plan services.

Over the past two decades, operator business models have undergone three core changes in measurement units.

The first stage was the voice era, where operators sold minutes; the second stage was the mobile internet era, selling data GB; and entering the AI era, operators are beginning to experiment with selling Tokens.

Tokens are undergoing a similar evolution to data. Initially, they were just technical metrics; then they became billing units; finally, they evolved into standardized commodities.

The entry of operators marks that Tokens have begun to move beyond the technical realm and enter the consumption system.

In the coming years, the way users purchase AI capabilities may fundamentally change: individual users purchase "AI monthly packages," enterprises procure "Token resource pools," home broadband comes with AI quotas, and government/enterprise dedicated lines integrate Agent services. Tokens will become a basic resource, like electricity, water, and data.

However, this does not mean operators will replace large model vendors.

03 How to Buy Tokens Appropriately?

Should Token service be purchased directly from native large model vendors or from operator platforms? What are the pros and cons of the two current business models?

The first is the native model vendor model, which bills per million Tokens. Vendors like OpenAI, Anthropic, DeepSeek, Qwen, etc., commonly use this system. Users pay separately for input Tokens and output Tokens. Some, like Qwen, might use a pre-purchase at the beginning of the month, settle at the end of the month format.

The second is the operator's monthly subscription Token quota. For example, Shanghai Telecom offers a minimum of 9.9 yuan for 10 million Tokens, with additional purchases for excess, and plans to integrate Token rights into the family's "Beautiful Home" digital space, supporting one-click payment via phone bill.

This "all-in-one price" or "bill integration" model allows Chinese users to purchase large model computing power like they buy data packages.

While overseas markets are dominated by the API tiered pricing of native large model enterprises, the domestic market is pushing Token service into a "packaged" era similar to mobile phone plans.

Currently, both billing models have their advantages, as the user base for Token plans can be divided into three main types.

The first is independent developers and technology enthusiasts (Geeks). They use the API interfaces provided by various vendors to build their own personalized AI applications, such as productivity tools, automatic translation plugins, personal knowledge bases, etc.

The second category is SMEs, startups, and B2B independent software vendors (ISVs). This is the core customer group for Token service. Whether purchasing Tokens for company employees for programming, developing industry-specific AI Agents, or embedding AI assistance into existing enterprise ERP, CRM systems, SMEs need to subscribe to "team edition Token plans" from cloud vendors or operators.

The third category is "AI-heavy dependent" professionals and ordinary households, who need to frequently use AI for copywriting, code writing in home settings, or require AI to tutor their children with homework.

For SMEs and startups, from a techno-economic perspective, the pure Token billing model of native large models is more scientific.

The operator's package model has two advantages. On one hand, independent developers are not tied to one specific large model; they can independently choose from multiple models through the platform provider. On the other hand, Token service may reach mass consumers faster. Most people know what 100GB of data means, but cannot perceive what 10 million Tokens represent.

Operators adopting monthly subscriptions are essentially lowering the cognitive barrier. Users don't need to understand Tokens; they just need to start with a basic package like 9.9 yuan/10 million Tokens to understand their needs.

As operators start selling Token services, "Doubaos" are about to begin competing fiercely at three levels.

From "competing on parameters" to "competing on energy efficiency ratio": For large model companies, they can no longer blindly pursue large parameters and high energy consumption for large models. Instead, they must focus on capabilities like model distillation, quantization, and inference optimization that can output higher quality Tokens with smaller energy consumption.

Price competition will further intensify. After operators aggregate hundreds of models, user switching costs decrease. If model A raises prices, it can be replaced with model B via the platform. When model capability differences are insufficient, price becomes the core competitive factor.

The profit center for large model enterprises will shift. Simply selling APIs offers limited profits. Future profit focus may shift to Agents, industry applications, and enterprise solutions. The model itself gradually becomes infrastructure, while the application layer becomes the value center.

Perhaps, a "two-sided market" is forming: operators control the entry point, model vendors control the capabilities.

Связанные с этим вопросы

QWhat is the significance of telecom operators launching Token service packages?

AIt signifies that Token is transitioning from a technical metric to a standardized consumer commodity, similar to how mobile data became a utility. It lowers the barrier for AI adoption by offering a familiar, subscription-based model, reduces user lock-in to specific models, and will likely intensify competition among large language model providers.

QHow do the Token purchasing models from native AI firms and telecom operators differ?

ANative AI firms typically charge per million tokens (input/output) with tiered pricing. Telecom operators offer monthly subscription packages with a fixed Token allowance (like 9.9 RMB for 10 million Tokens), bundling it with existing services like phone bills. This model is simpler for general consumers, while the per-token model is more precise for business use.

QWhat are the primary motivations for telecom operators to sell Token services?

AKey motivations include: 1) Adapting the billing model from traditional compute/time to align cost with AI's 'intelligence consumption'. 2) Lowering the technical and trial-and-error barriers for SMEs and developers to access AGI. 3) Meeting the booming demand from AI Agent and other application-layer services that require massive, frequent token interactions. 4) Finding a new core billing unit (like minutes and GB before) for the AI era.

QHow will the emergence of Token platforms impact large language model companies like Doubao?

AIt will weaken user stickiness and force these companies to compete more fiercely ('juan'). Competition will shift to three levels: 1) Efficiency: Improving token quality per unit of energy (via distillation, quantization). 2) Price: Intensifying price competition as users can easily switch models on a platform. 3) Value Shift: Moving their profit center from selling basic API calls to offering higher-value AI Agents, industry applications, and enterprise solutions.

QWhat types of users are the main target audience for Token service packages?

AThree main groups: 1) Independent developers and tech enthusiasts building custom AI tools. 2) SMEs, startups, and B2B software vendors integrating AI into their products or workflows. This is the core audience. 3) Professionals and households that heavily rely on AI for tasks like content creation, coding, or tutoring, who need high-frequency access in daily life.

Похожее

The Age of Decoupling Has Arrived: Bitcoin is No Longer the Sole Compass of Crypto

The era of the cryptocurrency market moving in lockstep with Bitcoin is ending, as the industry splits into two distinct asset categories: endogenous and exogenous. Endogenous assets, like Bitcoin, derive value purely from the crypto market's cycles. Their narratives swing between being "interstellar money" in bull markets and "digital collectibles" in bear markets. Exogenous assets, however, are nominally crypto but operate with independent value drivers. Examples include: * **Venice:** An AI inference service using tokens for payments; its consumer-AI business model is decoupled from crypto price swings. * **Figure:** A fintech lender using blockchain to speed up loan approvals; its core value is in credit, not crypto. * **Stablecoin firms like BVNK:** Acquired by traditional finance giants (Mastercard, Stripe), their growth is tied to payment infrastructure, not market cycles. Hybrid projects like **Hyperliquid** (a decentralized exchange) show a shift, with a growing share of non-crypto trading (e.g., prediction markets). This divergence is fundamental. Endogenous assets remain highly correlated to Bitcoin, similar to gold miners to gold. Exogenous assets are evolving to have their own fundamentals, like the weak correlation between gold and the S&P 500. This changes investment analysis. Evaluating exogenous assets requires traditional fundamental research—assessing user bases, unit economics, and moats—more akin to fintech investing than charting Bitcoin. Promising exogenous sectors include: on-chain exchanges/brokers, AI-crypto fusion, privacy-focused digital banks, lending (institutional/private credit), stablecoins/real-world asset tokenization, payment rails, and non-financial crypto-consumer products. Currently, investing via equity is often safer than via tokens, as token value accrual mechanisms need further regulatory and industry development (e.g., the CLARITY Act). Nonetheless, the core trend is clear: crypto market drivers are diversifying from a single factor (Bitcoin) to multiple fundamentals, ending the era of uniform market moves.

marsbit34 мин. назад

The Age of Decoupling Has Arrived: Bitcoin is No Longer the Sole Compass of Crypto

marsbit34 мин. назад

Five Cryptos That Could Outperform Bitcoin Over the Next Cycle Due To Higher Growth Velocity

Bitcoin's growth often sets market trends, but analysts believe the next cycle's highest percentage gains may come from assets with greater growth velocity. While Bitcoin provides stability, several cryptocurrencies are positioned for stronger relative upside. This article highlights five such assets, with a particular focus on Ozak AI as the potential high-growth standout of the cycle. Ethereum (ETH) is noted for its ongoing evolution and institutional adoption. Solana (SOL) is recognized for its high throughput and history of sharp rallies. Chainlink (LINK) is highlighted as essential infrastructure for DeFi and AI applications. Avalanche (AVAX) is mentioned for its subnet architecture and enterprise potential. Ozak AI ($OZ) is presented as a distinct early-stage opportunity, currently in presale at $0.014 with a target listing price of $1.00. The project is building a full AI-native blockchain ecosystem, including prediction agents, a data stream network, and structured data vaults. Analysts suggest its early valuation stage and focus on AI infrastructure could allow for exponential growth velocity compared to more mature assets like Bitcoin, which requires massive capital inflows for significant price movement. The final takeaway positions Ozak AI as a high-asymmetry bet for investors seeking exponential upside alongside more stable assets.

TheNewsCrypto1 ч. назад

Five Cryptos That Could Outperform Bitcoin Over the Next Cycle Due To Higher Growth Velocity

TheNewsCrypto1 ч. назад

What's New in Jensen Huang's 'Agent Factory'?

In a keynote at COMPUTEX 2026, NVIDIA CEO Jensen Huang shifted the company's focus from hardware "full-stack" solutions to the era of AI Agents. The centerpiece is the Vera Rubin platform, now in production, which is designed specifically for Agent workloads and offers 10x the efficiency of its predecessor. The platform features the new Vera CPU, built for AI, and incorporates Spectrum-X Ethernet Photonics with CPO technology for improved networking and energy efficiency. NVIDIA introduced DSX, an integrated toolkit for designing, simulating, and operating AI data centers, aiming to streamline "AI factory" deployment and management. For end-user deployment, the company unveiled DGX Station for Windows, a desktop AI supercomputer for running Agents locally, and the RTX Spark SoC for AI PCs. On the software front, NVIDIA launched the 550B-parameter Nemotron 3 Ultra model for enterprise Agents and the Cosmos 3 foundation model for physical AI, unifying visual reasoning and action prediction. In robotics, a partnership with Unitree yielded the H2 Plus, a reference humanoid robot built on the Isaac GR00T platform to lower development barriers. Security was emphasized with enhanced confidential computing for Vera Rubin and new data path security features for the BlueField-4 STX storage platform. The presentation highlighted a strategic pivot: NVIDIA is reorganizing its entire technology stack—from chips and data centers to models, software, and robots—around the emerging ecosystem of autonomous, practical AI Agents.

marsbit1 ч. назад

What's New in Jensen Huang's 'Agent Factory'?

marsbit1 ч. назад

Торговля

Спот
Фьючерсы

Популярные статьи

Как купить ERA

Добро пожаловать на HTX.com! Мы сделали приобретение Caldera (ERA) простым и удобным. Следуйте нашему пошаговому руководству и отправляйтесь в свое крипто-путешествие.Шаг 1: Создайте аккаунт на HTXИспользуйте свой адрес электронной почты или номер телефона, чтобы зарегистрироваться и бесплатно создать аккаунт на HTX. Пройдите удобную регистрацию и откройте для себя весь функционал.Создать аккаунтШаг 2: Перейдите в Купить криптовалюту и выберите свой способ оплатыКредитная/Дебетовая Карта: Используйте свою карту Visa или Mastercard для мгновенной покупки Caldera (ERA).Баланс: Используйте средства с баланса вашего аккаунта HTX для простой торговли.Третьи Лица: Мы добавили популярные способы оплаты, такие как Google Pay и Apple Pay, для повышения удобства.P2P: Торгуйте напрямую с другими пользователями на HTX.Внебиржевая Торговля (OTC): Мы предлагаем индивидуальные услуги и конкурентоспособные обменные курсы для трейдеров.Шаг 3: Хранение Caldera (ERA)После приобретения вами Caldera (ERA) храните их в своем аккаунте на HTX. В качестве альтернативы вы можете отправить их куда-либо с помощью перевода в блокчейне или использовать для торговли с другими криптовалютами.Шаг 4: Торговля Caldera (ERA)С легкостью торгуйте Caldera (ERA) на спотовом рынке HTX. Просто зайдите в свой аккаунт, выберите торговую пару, совершайте сделки и следите за ними в режиме реального времени. Мы предлагаем удобный интерфейс как для начинающих, так и для опытных трейдеров.

721 просмотров всегоОпубликовано 2025.07.17Обновлено 2026.06.01

Как купить ERA

Обсуждения

Добро пожаловать в Сообщество HTX. Здесь вы сможете быть в курсе последних новостей о развитии платформы и получить доступ к профессиональной аналитической информации о рынке. Мнения пользователей о цене на ERA (ERA) представлены ниже.

活动图片