Xiaomi and MiniMax Unleash Major Upgrades Simultaneously, Officially Kicking Off the Agent Pricing War

marsbitPubblicato 2026-03-20Pubblicato ultima volta 2026-03-20

Introduzione

Chinese AI companies MiniMax and Xiaomi's MiMo have both launched major Agent-focused models, M2.7 and V2-Pro, respectively, within two days in March. Both models rank in the top tier globally on Agent benchmarks but are priced significantly lower than leading Western models—MiniMax at $1.2 per million tokens (1/21 of Claude Opus) and MiMo at $3 (1/8 of Claude Opus). The two represent divergent technical strategies. MiMo-V2-Pro adopts a scale-driven approach with over 1 trillion parameters and a hybrid attention mechanism optimized for long-context and multi-tool agent tasks. In contrast, MiniMax’s M2.7 uses a self-iterative optimization method, autonomously refining its architecture over 100+ cycles to improve performance without disclosing parameter count. Their release rhythms also differ: MiniMax iterates rapidly with four versions in five months, while Xiaomi releases fewer but more substantial upgrades. Notably, Xiaomi debuted V2-Pro anonymously on OpenRouter as "Hunter Alpha," topping the platform’s usage chart before revealing its identity—a first for a Chinese AI model gaining global developer traction through pure performance.

On March 18 and 19, two Chinese companies successively released their major models in the Agent direction. Domestic AI startup MiniMax launched M2.7, while Xiaomi's large model team MiMo introduced V2-Pro. Both models have entered the global top tier on the Agent benchmark, but their API output pricing is 1/21 and 1/8 of Claude Opus 4.6, respectively.

They played their cards in the same week, but with completely different hands. They represent two截然不同的 technical routes, betting on two different futures for the Agent era.

The Same Exam, 1/17 the Tuition

First, let's look at the most直观 comparison.

According to data from OpenRouter and the official pricing pages of various companies, based on API output price (per million tokens), MiniMax M2.7 is $1.2, and MiMo-V2-Pro is $3. As a reference, Claude Opus 4.6's output price is $25, GPT-5.2 is $14, and Claude Sonnet 4.6 is $15.

The price gap is by an order of magnitude, but the capability gap is not. On SWE-bench Verified (the current mainstream benchmark for measuring code engineering capabilities), MiMo-V2-Pro scored 78%, while Sonnet 4.6 scored 79.6%, a difference of less than two percentage points. M2.7's SWE-Pro score was 56.22%, on par with GPT-5.3-Codex. On VIBE-Pro (end-to-end project delivery capability), M2.7 scored 55.6%,接近 the level of Opus 4.6.

The key point of this chart is not who is higher or lower—the benchmark systems of various companies are not fully aligned, so direct comparisons should be made cautiously. The key point is that "price-performance剪刀差": domestic Agent models have already挤进 the same capability band but stand in completely different price ranges.

Trillion Parameters vs. Self-Evolution

Price is only the表象. The two companies have revealed two completely different底牌.

MiMo-V2-Pro follows the "more is better" route. According to Xiaomi's official announcement, V2-Pro has over 1 trillion total parameters, 42B activated parameters, and supports an ultra-long context of 1 million tokens. Its core innovation is the Hybrid Attention mechanism, adjusting the ratio of Sliding Window Attention (SWA) to Global Attention (GA) to 7:1—the previous generation V2-Flash was 5:1. This architecture makes the model more stable when handling long documents and multi-tool parallel calling Agent scenarios. On PinchBench (Agent tool calling capability evaluation), MiMo-V2-Pro scored 84%.

M2.7 takes a completely different path. According to the official technical blog released by MiniMax on March 18, M2.7's parameter count is not公开, but it demonstrates a "self-iterative evolution" mechanism: the model autonomously runs over 100 rounds of optimization cycles, including analyzing failure trajectories, planning modifications, modifying its own code architecture, running evaluations, and cycling again, ultimately achieving a 30% performance improvement on the internal evaluation set. On the MLE Bench Lite (machine learning competition difficulty evaluation) with 22 high-difficulty problems, M2.7 won 9 gold, 5 silver, and 1 bronze, with an average medal rate of 66.6%.

Looking from five dimensions, the锋芒 of the two routes朝向 completely different directions: MiMo-V2-Pro has obvious advantages in context length and code engineering dimensions, while M2.7 pulls ahead in office automation and self-iterative capabilities. According to the same MiniMax technical blog, M2.7 scored ELO 1495 on GDPval-AA (office document processing evaluation), ranking first among open-source models, and maintained a 97% skill adherence rate in the MM-Claw test covering over 40 complex skills.

Four Versions in Five Months

The two companies not only have different technical routes but also completely different iteration rhythms.

According to public release records, MiniMax iterated four major versions from the release of M2 in October 2025 to the release of M2.7 in March 2026—a new version every 49 days on average. The interval between M2.5 and M2.7 was only about 30 days.

Xiaomi MiMo's rhythm is different: MiMo-7B (a 7B parameter open-source inference model) was released in April 2025, V2-Flash (309B total parameters) in December 2025, and V2-Pro (1T total parameters) in March 2026. The parameter scale leap between each generation is larger, but the version intervals are also longer.

MiniMax chose small steps and quick runs, with small iteration amplitudes but extremely high frequency; M2.7's self-iterative mechanism is itself designed for "continuous evolution." Xiaomi chose蓄力一击, with each version representing a major leap in parameter scale and architecture.

Anonymous for 8 Days, Topping OpenRouter

Beyond the technical route, Xiaomi's release strategy also broke industry conventions.

According to a Reuters report, on March 11, an anonymous model named Hunter Alpha appeared on OpenRouter, the world's largest API aggregation platform. No brand endorsement, no launch event, no technical blog. Its API pricing was extremely low, yet its performance was surprisingly strong.

The community began speculating about its origin. According to Republic World and multiple tech media reports, the most mainstream guess was DeepSeek V4, as MiMo team leader Luo Fuli had previously conducted research at DeepSeek. Call volume surged rapidly, exceeding 1 trillion tokens during the anonymous period, topping the OpenRouter weekly chart.

In the early hours of March 19, Xiaomi revealed the answer: Hunter Alpha was MiMo-V2-Pro. According to the same Reuters report, Xiaomi's Hong Kong stock saw a gain of up to 5.8% after the reveal.

This was the first time a domestic large model proved itself on a global platform through pure blind testing. Relying not on brand or宣传, but letting developers vote with their feet over 8 days.

Domande pertinenti

QWhat are the two Chinese companies that recently released their Agent-oriented large models, and what are the model names?

AMiniMax released the M2.7 model, and Xiaomi's MiMo team released the V2-Pro model.

QHow does the API output pricing of MiniMax M2.7 and MiMo-V2-Pro compare to Claude Opus 4.6?

AThe API output price for MiniMax M2.7 is $1.2 per million tokens, which is 1/21 of Claude Opus 4.6's $25. MiMo-V2-Pro is $3 per million tokens, which is 1/8 of Claude Opus 4.6's price.

QWhat are the core technical approaches of MiMo-V2-Pro and MiniMax M2.7?

AMiMo-V2-Pro follows a 'scale-up' approach with over 1 trillion total parameters and a Hybrid Attention mechanism. MiniMax M2.7 uses a 'self-iterative evolution' mechanism where the model autonomously runs optimization cycles to improve its own performance.

QWhat was unique about Xiaomi's release strategy for the MiMo-V2-Pro model?

AXiaomi first released the model anonymously on OpenRouter under the name 'Hunter Alpha' for 8 days. It gained significant developer traction and topped the OpenRouter weekly chart before Xiaomi revealed it was their model.

QHow did the iteration rhythms of MiniMax and Xiaomi's MiMo team differ?

AMiniMax iterated rapidly, releasing four versions in five months (approx. every 49 days). Xiaomi's MiMo team had longer release intervals with larger parameter scale jumps between versions, such as from 7B parameters to 309B, and then to 1T.

Letture associate

iQiyi Is Too Impatient

The article "iQiyi Is Too Impatient" discusses the controversy surrounding the Chinese streaming platform IQiyi's recent announcement of an "AI Actor Library" during its 2026 World Conference. IQiyi claimed over 100 actors, including well-known names like Zhang Ruoyun and Yu Hewei, had joined the initiative. CEO Gong Yu suggested AI could enable actors to "star in 14 dramas a year instead of 4" and that "live-action filming might become a world cultural heritage." The announcement quickly sparked backlash. Multiple actors named in the list issued urgent statements denying they had signed any AI-related authorization agreements. This forced IQiyi to clarify that inclusion in the library only indicated a willingness to *consider* AI projects, with separate negotiations required for any specific role. The incident, which trended on social media with hashtags like "IQiyi is crazy," is presented as a sign of the company's growing desperation. Facing intense competition from short-video platforms like Douyin and Kuaishou, as well as Bilibili and Xiaohongshu, IQiyi's financial performance has weakened, with revenues declining for two consecutive years. The author argues that IQiyi is "too impatient" to tell a compelling AI story to reassure the market, especially as it pursues a listing on the Hong Kong stock exchange. The piece concludes by outlining three key "AI questions" IQiyi must answer: defining its role as a tool provider versus a content creator, balancing the "coldness" of AI with the human element audiences desire, and properly managing the interests of platforms, actors, and viewers. The core dilemma is that while AI can reduce costs and increase efficiency, it risks creating homogenized, formulaic content and devaluing human performers.

marsbit17 min fa

Jensen Huang Publicly Challenges Google and Amazon, Is the Chip Business Entirely Sustained by Anthropic?

In a candid interview, Nvidia CEO Jensen Huang challenged competitors like Google and Amazon, admitted past strategic errors, and criticized U.S. export controls on AI chips. He framed Nvidia’s role as turning “electricity in, tokens out,” emphasizing the complexity and value of AI inference. Huang dismissed rival custom chips like Google’s TPU and Amazon’s Trainium as inflexible and niche, claiming their growth relies heavily on clients like Anthropic. He also acknowledged missing early investment opportunities in OpenAI and Anthropic. On China, Huang warned that export restrictions risk pushing the country toward self-sufficiency and could cost U.S. leadership in AI. Finally, he explained Nvidia’s acquisition of Groq as a move to serve premium, low-latency token markets. Throughout, Huang emphasized ecosystem trust and Nvidia’s central role in global AI infrastructure.

marsbit22 min fa

Jensen Huang Publicly Challenges Google and Amazon, Is the Chip Business Entirely Sustained by Anthropic?

marsbit22 min fa

ARK's SpaceX IPO Investment Guide: $1.75 Trillion Valuation, 95x Price-to-Sales Ratio, Where Is the Money Going?

ARK Invest, a major investor in SpaceX, provides an analysis following SpaceX's confidential IPO filing on April 1, 2026, targeting a $1.75 trillion valuation and up to $75 billion in funding for a Nasdaq listing around June 2026. This would be the largest IPO in history. The valuation implies a price-to-sales ratio of ~95x based on ~$18.5B in projected 2025 revenue. ARK argues this reflects future potential, not current reality. Key drivers include Starlink, with over 10M users and $20B+ in 2026 revenue, launch services which have reduced costs by ~95% since 2008, and the strategic merger with xAI in February 2026, enabling vertically integrated AI and orbital computing. ARK’s investment thesis is based on SpaceX's proven execution of ambitious goals, like reusable rockets and global satellite internet. The firm notes that significant value creation often occurs pre-IPO, and its Venture Fund offers exposure to SpaceX’s earlier private stages. Post-IPO, the fund will manage the position per standard lock-up and rebalancing processes, maintaining flexibility to reinvest in other private innovations. The analysis is based on public estimates and ARK research; the S-1 filing is not yet public. This is not investment advice.

marsbit24 min fa

ARK's SpaceX IPO Investment Guide: $1.75 Trillion Valuation, 95x Price-to-Sales Ratio, Where Is the Money Going?

marsbit24 min fa

Stablecoin Hype Overblown? Moody’s Says Banks Aren’t In Danger

The article discusses the legislative stalemate in the US over the Digital Asset Market Clarity Act of 2025 (CLARITY Act), which aims to regulate crypto assets. A key point of contention is whether stablecoins should be allowed to pay interest, with crypto companies opposing a ban and banks supporting it. Moody's analyst Abhi Srivastava states that, for now, stablecoins do not pose a significant threat to traditional banks due to existing efficient payment systems and the current prohibition on yield-bearing stablecoins. This limits their appeal for pulling deposits from banks at scale. However, with the stablecoin market cap exceeding $300 billion and growth in tokenized assets, banks could face future pressure from deposit outflows. The crypto industry warns that failure to pass the bill could lead to stricter regulatory crackdowns. Negotiations continue with little progress, despite both sides desiring a resolution.

bitcoinist1 h fa

Stablecoin Hype Overblown? Moody’s Says Banks Aren’t In Danger

bitcoinist1 h fa

Not a Price Hike, but a Supply Cut? Oil Prices Have Crossed the Tipping Point

The global oil market has passed a critical point, shifting the focus from price increases to potential physical supply shortages. The core issue is a time mismatch: even if the Strait of Hormuz reopens, shipping disruptions have already caused significant delays, which will continue to deplete onshore crude inventories for weeks. Refinery behavior acts as an amplifier. Reduced runs in Asia and Europe don’t reflect weaker demand but instead shrink product inventories, raising fuel prices and refining margins, which in turn encourages higher runs—creating a self-reinforcing cycle. If the Strait remains closed beyond April, traditional pricing models may fail. The market could face an unprecedented physical shortfall of 11-13 million barrels per day—roughly four times historical disruption levels. In such a scenario, price becomes an inadequate balancing tool. The only way to rebalance the market would be policy-driven demand destruction, similar to COVID-era lockdowns. Current prices around $95/barrel are insufficient to balance the market. Key signals to watch include inventory levels, policy announcements, and the pace of involuntary demand contraction. Geopolitically, the situation appears likely to worsen before improving, with little room for compromise between the US and Iran.

marsbit2 h fa

Not a Price Hike, but a Supply Cut? Oil Prices Have Crossed the Tipping Point

marsbit2 h fa

Trading

Spot

Futures

Articoli Popolari

Come comprare WAR

Benvenuto in HTX.com! Abbiamo reso l'acquisto di WeStarter (WAR) semplice e conveniente. Segui la nostra guida passo passo per intraprendere il tuo viaggio nel mondo delle criptovalute.Step 1: Crea il tuo Account HTXUsa la tua email o numero di telefono per registrarti il tuo account gratuito su HTX. Vivi un'esperienza facile e sblocca tutte le funzionalità,Crea il mio accountStep 2: Vai in Acquista crypto e seleziona il tuo metodo di pagamentoCarta di credito/debito: utilizza la tua Visa o Mastercard per acquistare immediatamente WeStarterWAR.Bilancio: Usa i fondi dal bilancio del tuo account HTX per fare trading senza problemi.Terze parti: abbiamo aggiunto metodi di pagamento molto utilizzati come Google Pay e Apple Pay per maggiore comodità.P2P: Fai trading direttamente con altri utenti HTX.Over-the-Counter (OTC): Offriamo servizi su misura e tassi di cambio competitivi per i trader.Step 3: Conserva WeStarter (WAR)Dopo aver acquistato WeStarter (WAR), conserva nel tuo account HTX. In alternativa, puoi inviare tramite trasferimento blockchain o scambiare per altre criptovalute.Step 4: Scambia WeStarter (WAR)Scambia facilmente WeStarter (WAR) nel mercato spot di HTX. Accedi al tuo account, seleziona la tua coppia di trading, esegui le tue operazioni e monitora in tempo reale. Offriamo un'esperienza user-friendly sia per chi ha appena iniziato che per i trader più esperti.

129 Totale visualizzazioniPubblicato il 2024.12.11Aggiornato il 2025.02.23

Discussioni

Benvenuto nella Community HTX. Qui puoi rimanere informato sugli ultimi sviluppi della piattaforma e accedere ad approfondimenti esperti sul mercato. Le opinioni degli utenti sul prezzo di WAR WAR sono presentate come di seguito.