Xiaomi and MiniMax Unleash Major Upgrades Simultaneously, Officially Kicking Off the Agent Pricing War

marsbitPublished on 2026-03-20Last updated on 2026-03-20

Abstract

Chinese AI companies MiniMax and Xiaomi's MiMo have both launched major Agent-focused models, M2.7 and V2-Pro, respectively, within two days in March. Both models rank in the top tier globally on Agent benchmarks but are priced significantly lower than leading Western models—MiniMax at $1.2 per million tokens (1/21 of Claude Opus) and MiMo at $3 (1/8 of Claude Opus). The two represent divergent technical strategies. MiMo-V2-Pro adopts a scale-driven approach with over 1 trillion parameters and a hybrid attention mechanism optimized for long-context and multi-tool agent tasks. In contrast, MiniMax’s M2.7 uses a self-iterative optimization method, autonomously refining its architecture over 100+ cycles to improve performance without disclosing parameter count. Their release rhythms also differ: MiniMax iterates rapidly with four versions in five months, while Xiaomi releases fewer but more substantial upgrades. Notably, Xiaomi debuted V2-Pro anonymously on OpenRouter as "Hunter Alpha," topping the platform’s usage chart before revealing its identity—a first for a Chinese AI model gaining global developer traction through pure performance.

On March 18 and 19, two Chinese companies successively released their major models in the Agent direction. Domestic AI startup MiniMax launched M2.7, while Xiaomi's large model team MiMo introduced V2-Pro. Both models have entered the global top tier on the Agent benchmark, but their API output pricing is 1/21 and 1/8 of Claude Opus 4.6, respectively.

They played their cards in the same week, but with completely different hands. They represent two截然不同的 technical routes, betting on two different futures for the Agent era.

The Same Exam, 1/17 the Tuition

First, let's look at the most直观 comparison.

According to data from OpenRouter and the official pricing pages of various companies, based on API output price (per million tokens), MiniMax M2.7 is $1.2, and MiMo-V2-Pro is $3. As a reference, Claude Opus 4.6's output price is $25, GPT-5.2 is $14, and Claude Sonnet 4.6 is $15.

The price gap is by an order of magnitude, but the capability gap is not. On SWE-bench Verified (the current mainstream benchmark for measuring code engineering capabilities), MiMo-V2-Pro scored 78%, while Sonnet 4.6 scored 79.6%, a difference of less than two percentage points. M2.7's SWE-Pro score was 56.22%, on par with GPT-5.3-Codex. On VIBE-Pro (end-to-end project delivery capability), M2.7 scored 55.6%,接近 the level of Opus 4.6.

The key point of this chart is not who is higher or lower—the benchmark systems of various companies are not fully aligned, so direct comparisons should be made cautiously. The key point is that "price-performance剪刀差": domestic Agent models have already挤进 the same capability band but stand in completely different price ranges.

Trillion Parameters vs. Self-Evolution

Price is only the表象. The two companies have revealed two completely different底牌.

MiMo-V2-Pro follows the "more is better" route. According to Xiaomi's official announcement, V2-Pro has over 1 trillion total parameters, 42B activated parameters, and supports an ultra-long context of 1 million tokens. Its core innovation is the Hybrid Attention mechanism, adjusting the ratio of Sliding Window Attention (SWA) to Global Attention (GA) to 7:1—the previous generation V2-Flash was 5:1. This architecture makes the model more stable when handling long documents and multi-tool parallel calling Agent scenarios. On PinchBench (Agent tool calling capability evaluation), MiMo-V2-Pro scored 84%.

M2.7 takes a completely different path. According to the official technical blog released by MiniMax on March 18, M2.7's parameter count is not公开, but it demonstrates a "self-iterative evolution" mechanism: the model autonomously runs over 100 rounds of optimization cycles, including analyzing failure trajectories, planning modifications, modifying its own code architecture, running evaluations, and cycling again, ultimately achieving a 30% performance improvement on the internal evaluation set. On the MLE Bench Lite (machine learning competition difficulty evaluation) with 22 high-difficulty problems, M2.7 won 9 gold, 5 silver, and 1 bronze, with an average medal rate of 66.6%.

Looking from five dimensions, the锋芒 of the two routes朝向 completely different directions: MiMo-V2-Pro has obvious advantages in context length and code engineering dimensions, while M2.7 pulls ahead in office automation and self-iterative capabilities. According to the same MiniMax technical blog, M2.7 scored ELO 1495 on GDPval-AA (office document processing evaluation), ranking first among open-source models, and maintained a 97% skill adherence rate in the MM-Claw test covering over 40 complex skills.

Four Versions in Five Months

The two companies not only have different technical routes but also completely different iteration rhythms.

According to public release records, MiniMax iterated four major versions from the release of M2 in October 2025 to the release of M2.7 in March 2026—a new version every 49 days on average. The interval between M2.5 and M2.7 was only about 30 days.

Xiaomi MiMo's rhythm is different: MiMo-7B (a 7B parameter open-source inference model) was released in April 2025, V2-Flash (309B total parameters) in December 2025, and V2-Pro (1T total parameters) in March 2026. The parameter scale leap between each generation is larger, but the version intervals are also longer.

MiniMax chose small steps and quick runs, with small iteration amplitudes but extremely high frequency; M2.7's self-iterative mechanism is itself designed for "continuous evolution." Xiaomi chose蓄力一击, with each version representing a major leap in parameter scale and architecture.

Anonymous for 8 Days, Topping OpenRouter

Beyond the technical route, Xiaomi's release strategy also broke industry conventions.

According to a Reuters report, on March 11, an anonymous model named Hunter Alpha appeared on OpenRouter, the world's largest API aggregation platform. No brand endorsement, no launch event, no technical blog. Its API pricing was extremely low, yet its performance was surprisingly strong.

The community began speculating about its origin. According to Republic World and multiple tech media reports, the most mainstream guess was DeepSeek V4, as MiMo team leader Luo Fuli had previously conducted research at DeepSeek. Call volume surged rapidly, exceeding 1 trillion tokens during the anonymous period, topping the OpenRouter weekly chart.

In the early hours of March 19, Xiaomi revealed the answer: Hunter Alpha was MiMo-V2-Pro. According to the same Reuters report, Xiaomi's Hong Kong stock saw a gain of up to 5.8% after the reveal.

This was the first time a domestic large model proved itself on a global platform through pure blind testing. Relying not on brand or宣传, but letting developers vote with their feet over 8 days.

Not a Price Hike, but a Supply Cut? Oil Prices Have Crossed the Tipping Point

The global oil market has passed a critical point, shifting the focus from price increases to potential physical supply shortages. The core issue is a time mismatch: even if the Strait of Hormuz reopens, shipping disruptions have already caused significant delays, which will continue to deplete onshore crude inventories for weeks. Refinery behavior acts as an amplifier. Reduced runs in Asia and Europe don’t reflect weaker demand but instead shrink product inventories, raising fuel prices and refining margins, which in turn encourages higher runs—creating a self-reinforcing cycle. If the Strait remains closed beyond April, traditional pricing models may fail. The market could face an unprecedented physical shortfall of 11-13 million barrels per day—roughly four times historical disruption levels. In such a scenario, price becomes an inadequate balancing tool. The only way to rebalance the market would be policy-driven demand destruction, similar to COVID-era lockdowns. Current prices around $95/barrel are insufficient to balance the market. Key signals to watch include inventory levels, policy announcements, and the pace of involuntary demand contraction. Geopolitically, the situation appears likely to worsen before improving, with little room for compromise between the US and Iran.

marsbit1h ago

Not a Price Hike, but a Supply Cut? Oil Prices Have Crossed the Tipping Point

marsbit1h ago

Amazon Invests Additional $25 Billion in Anthropic, AI Infrastructure 'Arms Race' Escalates

Amazon announces an additional investment of up to $25 billion in Anthropic, with $5 billion delivered immediately and the remaining contingent on performance milestones. This follows a recent $50 billion investment in OpenAI, highlighting Amazon's strategy of backing leading AI labs. The deal includes a commitment from Anthropic to spend over $100 billion on AWS infrastructure over the next decade, securing up to 5 gigawatts of computing power to address growing demand and capacity constraints. Anthropic’s annualized revenue has surpassed $30 billion, but the company faces infrastructure strain due to rapid user growth. The investment will support scaling Claude’s capabilities using Amazon’s custom Trainium and Graviton chips. The move deepens integration between Anthropic and AWS, allowing Claude to be accessed natively within AWS services. Over 100,000 organizations already use Claude via Amazon Bedrock. This investment is part of a broader AI infrastructure race, with Amazon planning around $200 billion in capital expenditures this year, largely focused on expanding AI compute capacity.

marsbit1h ago

Amazon Invests Additional $25 Billion in Anthropic, AI Infrastructure 'Arms Race' Escalates

marsbit1h ago

A New CEO Who Has Worked Exclusively with Hardware for 25 Years Takes Over Apple, Valued at 4 Trillion

Apple, the world's most valuable tech firm, has appointed John Ternus as its new CEO, effective September 1, replacing Tim Cook who transitions to executive chairman. Ternus, a 51-year-old mechanical engineer with 25-year tenure at Apple, previously served as senior vice president of hardware engineering. Known for his low public profile and lack of social media presence, he played key roles in developing products like the iPad, AirPods, and led Apple’s transition from Intel to in-house silicon chips. Ternus takes over a company valued at $4 trillion, with Cook having multiplied Apple’s revenue and built a $100 billion services business. However, he inherits significant challenges: Apple’s AI efforts, including the delayed Siri revamp powered by Google’s Gemini, remain unproven, and the company is preparing to launch its first foldable iPhone amid supply constraints. While Ternus’s hardware expertise positions him well for product-driven innovation, his lack of software/AI experience raises questions about Apple’s competitiveness in the AI era. The company’s structure has been adjusted to support him, with Cook remaining in an advisory role. Ternus’s engineer-led approach—emphasizing humility and collaboration—may prove vital in navigating Apple’s next chapter.

marsbit1h ago

A New CEO Who Has Worked Exclusively with Hardware for 25 Years Takes Over Apple, Valued at 4 Trillion

marsbit1h ago

Atkins' First Year at the Helm of the SEC: A Comprehensive Shift in Crypto Regulation

Paul Atkins marked his one-year anniversary as Chair of the U.S. Securities and Exchange Commission (SEC) on April 21, 2025, overseeing a significant shift in the agency’s approach to cryptocurrency regulation. Under his leadership, the SEC dropped multiple enforcement actions against crypto firms, approved several crypto-linked ETFs, and issued guidance clarifying that most cryptocurrencies are not considered securities under federal law. The SEC also signed a memorandum with the CFTC to improve regulatory coordination. These actions reversed the aggressive enforcement stance of his predecessor, Gary Gensler, and aligned with Trump administration promises to support the crypto industry. However, Atkins has faced criticism from Democratic lawmakers, including Senator Elizabeth Warren, who raised concerns over potential conflicts of interest, particularly regarding dropped cases linked to Trump-affiliated companies. While regulatory clarity has improved, the SEC still awaits congressional action to formally define its jurisdiction over digital assets.

marsbit1h ago

Atkins' First Year at the Helm of the SEC: A Comprehensive Shift in Crypto Regulation

marsbit1h ago

a16z: 5 Ways Blockchain Can Help AI Agent Infrastructure

Blockchain technology provides critical infrastructure for AI agents by addressing five key challenges: 1) Non-human identity: AI agents lack standardized, portable identity systems. Blockchain enables verifiable, cross-platform agent identities (like "Know Your Agent" frameworks) through cryptographic credentials and on-chain registries. 2) AI governance: When AI systems execute decisions, blockchain ensures transparency and prevents centralized control by recording actions on-chain and enabling auditable execution logs. 3) Payments: Stablecoins and crypto payments (e.g., x402, MPP) serve as default settlement layers for agent-to-agent commerce, enabling frictionless, programmable transactions for "headless" AI-native businesses. 4) Trust and verification: As AI scales, blockchain provides cryptographic proof of origin and auditable histories, making verification—not intelligence—the scarce resource. 5) User control: Crypto-native tools (e.g., delegation toolkits, intent-based architectures) allow users to set boundaries and maintain oversight over autonomous agents, minimizing blind trust. Together, blockchain and AI can create an economic infrastructure built on transparency, accountability, and user sovereignty.

marsbit2h ago

a16z: 5 Ways Blockchain Can Help AI Agent Infrastructure

marsbit2h ago

Trading

Spot

Futures

Hot Articles

How to Buy WAR

Welcome to HTX.com! We've made purchasing WAR (WAR) simple and convenient. Follow our step-by-step guide to embark on your crypto journey.Step 1: Create Your HTX AccountUse your email or phone number to sign up for a free account on HTX. Experience a hassle-free registration journey and unlock all features.Get My AccountStep 2: Go to Buy Crypto and Choose Your Payment MethodCredit/Debit Card: Use your Visa or Mastercard to buy WAR (WAR) instantly.Balance: Use funds from your HTX account balance to trade seamlessly.Third Parties: We've added popular payment methods such as Google Pay and Apple Pay to enhance convenience.P2P: Trade directly with other users on HTX.Over-the-Counter (OTC): We offer tailor-made services and competitive exchange rates for traders.Step 3: Store Your WAR (WAR)After purchasing your WAR (WAR), store it in your HTX account. Alternatively, you can send it elsewhere via blockchain transfer or use it to trade other cryptocurrencies.Step 4: Trade WAR (WAR)Easily trade WAR (WAR) on HTX's spot market. Simply access your account, select your trading pair, execute your trades, and monitor in real-time. We offer a user-friendly experience for both beginners and seasoned traders.

1.6k Total ViewsPublished 2024.03.29Updated 2026.03.18

Discussions

Welcome to the HTX Community. Here, you can stay informed about the latest platform developments and gain access to professional market insights. Users' opinions on the price of WAR (WAR) are presented below.

Xiaomi and MiniMax Unleash Major Upgrades Simultaneously, Officially Kicking Off the Agent Pricing War

Abstract

The Same Exam, 1/17 the Tuition

Trillion Parameters vs. Self-Evolution

Four Versions in Five Months

Anonymous for 8 Days, Topping OpenRouter

Related Questions

Related Reads

Not a Price Hike, but a Supply Cut? Oil Prices Have Crossed the Tipping Point

Amazon Invests Additional $25 Billion in Anthropic, AI Infrastructure 'Arms Race' Escalates

A New CEO Who Has Worked Exclusively with Hardware for 25 Years Takes Over Apple, Valued at 4 Trillion

Atkins' First Year at the Helm of the SEC: A Comprehensive Shift in Crypto Regulation

a16z: 5 Ways Blockchain Can Help AI Agent Infrastructure

Trading

Hot Articles

How to Buy WAR

Discussions

Top Questions

Hot Categories

Hot Tags