Huawei Cloud Rejects Token Price War, Zhou Yuefeng Seeks a New Winning Formula for AI Cloud

marsbitPublished on 2026-06-06Last updated on 2026-06-06

Abstract

At the 2026 Huawei Cloud INSPIRE Creator Conference, CEO Zhou Yuefeng outlined Huawei Cloud's distinct strategy in the competitive AI cloud market. Instead of engaging in price wars based on token volume or Maas revenue—a common focus for rivals like Alibaba Cloud and ByteDance's Volcano Engine—Huawei Cloud is shifting the competition towards real-world productivity gains. Zhou highlighted three core differentiators: a fully domestic computing stack (Ascend, Kunpeng), a focus on government and enterprise clients rather than consumer internet, and a deep commitment to open-source ecosystems. To this end, Huawei Cloud launched a suite of new products under the "Agentic Infra" paradigm, including the AICS Lingqu computing cluster, AMS memory storage, and the ModelArts Next platform. These aim to solve enterprise challenges in deploying AI agents, such as latency, memory, scheduling, and security. The strategy further involves creating specialized industry zones ("AI Dream Factories") for sectors like healthcare and embodied intelligence. For example, a smart medical zone developed with Shanghai Ruijin Hospital aims to democratize expert-level diagnostic capabilities. In essence, Huawei Cloud is positioning itself not as a commodity token provider, but as the foundational infrastructure for industrial AI, leveraging its domestic supply chain and hybrid cloud solutions to serve sectors where productivity, not just scale, is the ultimate measure of value.

"I don't care much about the total amount of Tokens, nor do I care much about the total revenue." At the Huawei Cloud INSPIRE Creator Conference held on June 5, 2026, Zhou Yuefeng, Director of Huawei and CEO of Huawei Cloud, gave his first media interview since taking office, clearly and unequivocally conveying the current strategic focus of Huawei Cloud.

This is a rare statement in the current Chinese AI cloud market.

Over the past six months, cloud vendors represented by Alibaba Cloud and Volcano Engine have continuously emphasized the narrative of AI cloud, using daily Token call volume and MaaS revenue scale as new growth anchors. Even large model vendors like MoonShot AI, DeepSeek, and Zhipu have repeatedly lowered inference prices. The industry's key focus has been model call volume and scale.

Huawei Cloud is choosing a different way to enter this crowded battlefield. Huawei Cloud released its most intensive batch of new products oriented towards AI since last year in one go: AICS Lingqu Intelligent Computing Cluster, AMS Agentic Memory Storage, CCE Volcano Next - Integrated General-Purpose & AI Scheduling Engine, AgentSphere Secure Autonomous Operation Foundation, as well as ModelArts Next, the enterprise-grade intelligent agent platform AgentArts (open-source version openJiuwen), and collectively proposed a new "Agentic Infra" paradigm.

The KPI Zhou Yuefeng defined for Huawei Cloud is not Token count, but "whether each Token truly enhances productivity." During the window period of limited domestic computing power supply and a reshaping business model, Huawei Cloud has extracted itself from the "competition for second place in the AI cloud" race.

Not Competing on Token Scale

Zhou Yuefeng, at the press meeting, made a rare direct response regarding the differences with Alibaba Cloud and Volcano Engine. He said Huawei Cloud differs from other cloud vendors for three reasons.

First, the computing power route is different. Huawei Cloud uses an entirely domestically developed computing power hardware and software system, including Ascend, Kunpeng, CANN, Euler, etc. This path is more challenging because Huawei cannot use others' computing power; it can only turn domestic solutions into an industry-grade answer.

Consequently, Huawei Cloud must build a second computing plane, offering another ecological choice alongside the globally dominant path formed by NVIDIA + mainstream public clouds. Huawei Cloud cannot and does not intend to use hardware from "all nations" to compete with peers on computing power scale. Zhou Yuefeng said, "I am not willing to compare revenue or scale rankings with other cloud companies, it's meaningless."

Second, the commercial focus is different. Internet-based cloud vendors naturally rely on C-end traffic and developer ecosystems, whereas Huawei Cloud places its main focus on government & enterprise sectors and industries vital to the national economy and people's livelihood. For example, Huawei Hybrid Cloud has held the top market share in government, finance, and central & state-owned enterprises for multiple consecutive years, serving over 5,500 customers globally.

Zhou Yuefeng stated that the iteration speed of models and computing power is too fast; models could become outdated soon after deployment. Therefore, he advises that government and enterprise clients should not build their own 10,000-card clusters. Instead, they should combine local data with remote public cloud AI computing power/model services, leveraging technologies like confidential inference, confidential training, and confidential computing to balance data sovereignty and computing power sharing. Essentially, this delivers the iteration benefits of the public cloud to clients who cannot fully migrate to the public cloud.

Third, the ecosystem approach is different. Huawei Cloud has embraced open-source quite thoroughly: Ascend CANN, Euler operating system, CCE Volcano scheduling, and the ModelArts toolchain are all open-source. The open-source version of the intelligent agent platform AgentArts, openJiuwen, shares over 90% codebase commonality with its commercial version.

The conference also jointly launched the "Hundred Models, Thousand Forms, Gather on Cloud for Win-Win" plan with over 20 leading model vendors including Zhipu, DeepSeek, MiniMax, Kimi, StepFun, Baidu, Meituan LongCat, iFLYTEK Spark, etc.

When domestic computing power is still limited in capability and supply, expanding the ecosystem and increasing model choices is the way to solidify the second computing plane.

Agentic Infra: Shifting the Battlefield from Selling Tokens to Selling Productivity

If the computing power route determines what Huawei Cloud "does not fight," then Agentic Infra determines what it "wants to fight."

Zhou Yuefeng presented a judgment on the evolution of the AI industry: four years ago, doing AI meant buying computing power cards; three years ago, it was training large models; this year, it is using intelligent agents. Computing power and models are receding to the background, while intelligent agents are stepping to the forefront.

The competitive focus of AI cloud is shifting from Token throughput to whether intelligent agents can truly run effectively within enterprises.

Huawei Cloud's product matrix is also realigned according to this judgment. The "four components" of Agentic Infra — efficient Token factory, continuous learning, integrated general-purpose & AI scheduling, and secure autonomy — each address critical engineering challenges enterprises face when deploying intelligent agents.

AICS Lingqu reduces the Token latency for a 100,000-card cluster to under 10 milliseconds; AMS provides petabyte-level memory space via NPU-direct CMS, solving the Agent's long-term task memory bottleneck; CCE Volcano Next improves resource utilization by over 30% through shared training and inference pools; AgentSphere achieves 100-millisecond-level startup and hundreds of thousands of batch creations per minute with its lightweight sandbox.

ModelArts Next restructures the MaaS playbook. Its model routing supports cost-priority, effect-priority, and balanced strategies, already integrated with over 15 SOTA models, achieving scheduling accuracy over 95%, and reducing average calling costs by 20%.

But Huawei Cloud's truly differentiated bet lies in the industry zones. At this conference, Huawei Cloud launched four "Industry AI Dream Factory" zones at once: Smart Healthcare, Embodied AI, Smart Manufacturing, and Scientific Computing.

The Smart Healthcare zone, co-developed with Shanghai Ruijin Hospital, features the RuiPath large model. Over 20 hospitals including Handan, Rui'an, Qianxinan, and Wu'an, ranging from top-tier to municipal and county-level, have collectively joined. This marks the first time that capabilities like pathological diagnosis, highly dependent on expert experience, are being delivered as a "cloud service" to county-level hospitals at scale.

The Embodied AI zone launched the world's first full-process embodied AI development platform, CloudRobo, aiming to meet the full-link toolchain demands of over 300 embodied AI startups in China.

Zhou Yuefeng stated that healthcare and finance are the most mature and data-rich industries in China's digitalization, "If AI cannot succeed in these industries, it will be even harder in others." In these fields, the yardstick for measuring AI value should not be daily active users or Token counts, but rather the proportion of financial risk prevention, the improvement in credit efficiency, the probability of accurate diagnoses for remote patients.

Connecting these threads, the strategic outline of Huawei Cloud becomes clear: using a domestically developed computing power + open-source ecosystem as the foundation; covering government and enterprise sectors with hybrid cloud + confidential computing; and shifting the competition from "selling Tokens" to "selling productivity" through Agentic Infra + industry zones.

This path is much slower than chasing MaaS revenue and harder to present attractive year-on-year data, but it circumvents the current intense price war in the AI cloud. It bets on a market not yet priced: who can secure the underlying infrastructure position when intelligent agents truly enter the industry.

On the AI cloud track, Huawei Cloud can only adopt a different solution. Zhou Yuefeng concluded, "I cannot build a silicon-based 'black land' made of hardware from all nations." While other cloud vendors compare whose Tokens offer higher cost-performance, Huawei Cloud is striving to see if this domestic computing power system can meet the real future needs of China's industrial AI.(Author | Zhang Shuai, Editor | Yang Lin)

Related Questions

QAccording to the article, what is Huawei Cloud's strategic focus, and how does it differ from other major AI cloud providers in China?

AHuawei Cloud's strategic focus is not on maximizing token volume or revenue, but on ensuring that 'every token truly enhances productivity.' This contrasts with other major Chinese AI cloud providers like Alibaba Cloud and Volcano Engine, who emphasize daily token calls and MaaS revenue scale as key growth metrics and engage in intense price competition on inference costs.

QWhat are the three key reasons CEO Zhou Yuefeng gives for Huawei Cloud's different approach compared to other cloud companies?

AZhou Yuefeng cites three reasons: 1. Different compute path: Huawei Cloud relies entirely on a domestic R&D system (Ascend, Kunpeng, CANN, Euler), creating a 'second compute plane' distinct from the dominant NVIDIA path. 2. Different commercial focus: Huawei Cloud heavily targets government and enterprise (G/ICT) and key national industries, rather than relying on consumer internet traffic. 3. Different ecosystem strategy: Huawei Cloud employs a thorough open-source approach for its core technologies and collaborates broadly with model vendors through initiatives like the 'Hundred Models, Thousand Forms' plan.

QWhat is 'Agentic Infra,' and what role does it play in Huawei Cloud's strategy for the AI cloud market?

A'Agentic Infra' (Agentic Infrastructure) is a new paradigm introduced by Huawei Cloud. It represents a shift in the AI cloud competition focus from token throughput to enabling agents to operate effectively within enterprises. It consists of a 'four-piece suite' addressing key engineering challenges: efficient token factories (AICS), continuous learning/memory (AMS), integrated scheduling for training and inference (CCE Volcano Next), and secure autonomous operation (AgentSphere). This moves the battlefield from 'selling tokens' to 'selling productivity.'

QWhat are 'Industry AI Dream Factory' zones, and can you name two specific examples mentioned in the article?

A'Industry AI Dream Factory' zones are specialized platforms launched by Huawei Cloud to provide AI solutions and tools tailored to specific vertical industries. Two examples mentioned are: 1. The Smart Healthcare Zone, which features the RuiPath large model co-developed with Shanghai Ruijin Hospital and serves over 20 hospitals. 2. The Embodied Intelligence Zone, which introduced CloudRobo, a full-process development platform aimed at serving over 300 embodied AI startups in China.

QHow does Huawei Cloud propose to serve government and enterprise clients who cannot fully migrate to the public cloud, according to Zhou Yuefeng?

AZhou Yuefeng suggests a hybrid approach for such clients. He advises against them building large-scale AI clusters themselves due to the rapid iteration of models and compute. Instead, he recommends keeping data locally while leveraging remote public cloud AI compute/model services. This is combined with technologies like confidential inference, training, and computing to balance data sovereignty with access to shared compute power. This model essentially delivers the iteration benefits of the public cloud to clients with stricter data requirements.

Related Reads

Codex Goal Mode Usage Guide: How to Make AI Continuously Pursue a Specific Objective

"Codex Goal Mode: How to Make AI Work Continuously Toward a Specific Goal" OpenAI's Codex "goal mode" (/goal) transforms the AI from a reactive code assistant into a proactive execution agent capable of working autonomously for hours or even days to achieve a defined objective. To maximize its effectiveness, follow these key principles: 1. **Define Clear, Verifiable Exit Criteria:** The goal prompt should be a concise, measurable success condition, not a lengthy specification. Use quantifiable metrics like "reduce build time by 30%" or "achieve 100% test parity." 2. **Provide Initial Guidance and Tools:** Direct Codex toward likely problem areas and specify available tools (e.g., browsers, testing environments) to prevent it from exploring unproductive paths. 3. **Enable Progress Measurement:** Equip Codex with ways to track advancement, such as creating comparison tools for visual tasks or evaluation sets, ensuring it can gauge its own progress. 4. **Use a Realistic Execution Environment:** For tasks like performance optimization, provide access to environments that closely mimic production (e.g., similar configs, databases) to yield valid results. 5. **Be Cautious with Visual Goals:** Avoid vague "pixel-perfect" instructions. Instead, supplement visual references with functional checklists or design system specifications to prevent Codex from obsessing over minor details. 6. **Implement Progress Tracking:** For long-running tasks, have Codex commit code to draft PRs, update progress documents, or send Slack updates to maintain visibility into its work. 7. **Review and Consolidate Results:** Once the goal is met, instruct Codex to review its work, clean up ineffective experimental code, and reflect on what strategies succeeded or failed. Ultimately, using goal mode shifts the developer's role from writing prompts to managing a persistent engineering agent—defining objectives, establishing metrics, configuring environments, and conducting final reviews.

marsbit1h ago

Codex Goal Mode Usage Guide: How to Make AI Continuously Pursue a Specific Objective

marsbit1h ago

From Ethereum to AI's 'CROPS': What Exactly Is This 'Slow Variable' That Vitalik Has Repeatedly Emphasized?

Recently, Vitalik Buterin has frequently emphasized the concept of "CROPS," first outlined in the Ethereum Foundation's March mandate as core principles guiding its focus: Censorship Resistance, Capture Resistance, Open Source, Privacy, and Security. CROPS represents Ethereum's commitment to providing foundational capabilities for user sovereignty—enabling asset ownership, identity expression, and coordination without reliance on centralized platforms or surrendering ultimate control. This framework is gaining new urgency with the rise of AI, particularly AI agents managing digital assets and automating transactions. While AI offers convenience, it risks centralizing user data, intent, and control if dependent on opaque, centralized services. Vitalik argues for "CROPS AI"—AI that is open, privacy-preserving, secure, and capable of local execution to maintain user agency. He highlights convergence between "CROPS Ethereum access layers" and "CROPS AI," such as using zero-knowledge proofs for private remote LLM calls and Ethereum RPC reads, ensuring users can access services without exposing sensitive information. Ultimately, CROPS is not just an abstract ideal but a practical guide for Ethereum's development and AI integration. It addresses the critical long-term question: as digital systems grow more powerful, how can users retain control over their privacy, assets, and autonomy? In an AI-driven era, these principles may define Ethereum's enduring value—prioritizing verifiable, secure, and user-centric design over short-term optimizations like speed and cost alone.

marsbit1h ago

From Ethereum to AI's 'CROPS': What Exactly Is This 'Slow Variable' That Vitalik Has Repeatedly Emphasized?

marsbit1h ago

Trading

Spot
Futures

Hot Articles

How to Buy WAR

Welcome to HTX.com! We've made purchasing WAR (WAR) simple and convenient. Follow our step-by-step guide to embark on your crypto journey.Step 1: Create Your HTX AccountUse your email or phone number to sign up for a free account on HTX. Experience a hassle-free registration journey and unlock all features.Get My AccountStep 2: Go to Buy Crypto and Choose Your Payment MethodCredit/Debit Card: Use your Visa or Mastercard to buy WAR (WAR) instantly.Balance: Use funds from your HTX account balance to trade seamlessly.Third Parties: We've added popular payment methods such as Google Pay and Apple Pay to enhance convenience.P2P: Trade directly with other users on HTX.Over-the-Counter (OTC): We offer tailor-made services and competitive exchange rates for traders.Step 3: Store Your WAR (WAR)After purchasing your WAR (WAR), store it in your HTX account. Alternatively, you can send it elsewhere via blockchain transfer or use it to trade other cryptocurrencies.Step 4: Trade WAR (WAR)Easily trade WAR (WAR) on HTX's spot market. Simply access your account, select your trading pair, execute your trades, and monitor in real-time. We offer a user-friendly experience for both beginners and seasoned traders.

2.1k Total ViewsPublished 2024.03.29Updated 2026.06.02

How to Buy WAR

Discussions

Welcome to the HTX Community. Here, you can stay informed about the latest platform developments and gain access to professional market insights. Users' opinions on the price of WAR (WAR) are presented below.

活动图片