Xiaomi and MiniMax Unleash Major Upgrades Simultaneously, Officially Kicking Off the Agent Pricing War

marsbitPublicado a 2026-03-20Actualizado a 2026-03-20

Resumen

Chinese AI companies MiniMax and Xiaomi's MiMo have both launched major Agent-focused models, M2.7 and V2-Pro, respectively, within two days in March. Both models rank in the top tier globally on Agent benchmarks but are priced significantly lower than leading Western models—MiniMax at $1.2 per million tokens (1/21 of Claude Opus) and MiMo at $3 (1/8 of Claude Opus). The two represent divergent technical strategies. MiMo-V2-Pro adopts a scale-driven approach with over 1 trillion parameters and a hybrid attention mechanism optimized for long-context and multi-tool agent tasks. In contrast, MiniMax’s M2.7 uses a self-iterative optimization method, autonomously refining its architecture over 100+ cycles to improve performance without disclosing parameter count. Their release rhythms also differ: MiniMax iterates rapidly with four versions in five months, while Xiaomi releases fewer but more substantial upgrades. Notably, Xiaomi debuted V2-Pro anonymously on OpenRouter as "Hunter Alpha," topping the platform’s usage chart before revealing its identity—a first for a Chinese AI model gaining global developer traction through pure performance.

On March 18 and 19, two Chinese companies successively released their major models in the Agent direction. Domestic AI startup MiniMax launched M2.7, while Xiaomi's large model team MiMo introduced V2-Pro. Both models have entered the global top tier on the Agent benchmark, but their API output pricing is 1/21 and 1/8 of Claude Opus 4.6, respectively.

They played their cards in the same week, but with completely different hands. They represent two截然不同的 technical routes, betting on two different futures for the Agent era.

The Same Exam, 1/17 the Tuition

First, let's look at the most直观 comparison.

According to data from OpenRouter and the official pricing pages of various companies, based on API output price (per million tokens), MiniMax M2.7 is $1.2, and MiMo-V2-Pro is $3. As a reference, Claude Opus 4.6's output price is $25, GPT-5.2 is $14, and Claude Sonnet 4.6 is $15.

The price gap is by an order of magnitude, but the capability gap is not. On SWE-bench Verified (the current mainstream benchmark for measuring code engineering capabilities), MiMo-V2-Pro scored 78%, while Sonnet 4.6 scored 79.6%, a difference of less than two percentage points. M2.7's SWE-Pro score was 56.22%, on par with GPT-5.3-Codex. On VIBE-Pro (end-to-end project delivery capability), M2.7 scored 55.6%,接近 the level of Opus 4.6.

The key point of this chart is not who is higher or lower—the benchmark systems of various companies are not fully aligned, so direct comparisons should be made cautiously. The key point is that "price-performance剪刀差": domestic Agent models have already挤进 the same capability band but stand in completely different price ranges.

Trillion Parameters vs. Self-Evolution

Price is only the表象. The two companies have revealed two completely different底牌.

MiMo-V2-Pro follows the "more is better" route. According to Xiaomi's official announcement, V2-Pro has over 1 trillion total parameters, 42B activated parameters, and supports an ultra-long context of 1 million tokens. Its core innovation is the Hybrid Attention mechanism, adjusting the ratio of Sliding Window Attention (SWA) to Global Attention (GA) to 7:1—the previous generation V2-Flash was 5:1. This architecture makes the model more stable when handling long documents and multi-tool parallel calling Agent scenarios. On PinchBench (Agent tool calling capability evaluation), MiMo-V2-Pro scored 84%.

M2.7 takes a completely different path. According to the official technical blog released by MiniMax on March 18, M2.7's parameter count is not公开, but it demonstrates a "self-iterative evolution" mechanism: the model autonomously runs over 100 rounds of optimization cycles, including analyzing failure trajectories, planning modifications, modifying its own code architecture, running evaluations, and cycling again, ultimately achieving a 30% performance improvement on the internal evaluation set. On the MLE Bench Lite (machine learning competition difficulty evaluation) with 22 high-difficulty problems, M2.7 won 9 gold, 5 silver, and 1 bronze, with an average medal rate of 66.6%.

Looking from five dimensions, the锋芒 of the two routes朝向 completely different directions: MiMo-V2-Pro has obvious advantages in context length and code engineering dimensions, while M2.7 pulls ahead in office automation and self-iterative capabilities. According to the same MiniMax technical blog, M2.7 scored ELO 1495 on GDPval-AA (office document processing evaluation), ranking first among open-source models, and maintained a 97% skill adherence rate in the MM-Claw test covering over 40 complex skills.

Four Versions in Five Months

The two companies not only have different technical routes but also completely different iteration rhythms.

According to public release records, MiniMax iterated four major versions from the release of M2 in October 2025 to the release of M2.7 in March 2026—a new version every 49 days on average. The interval between M2.5 and M2.7 was only about 30 days.

Xiaomi MiMo's rhythm is different: MiMo-7B (a 7B parameter open-source inference model) was released in April 2025, V2-Flash (309B total parameters) in December 2025, and V2-Pro (1T total parameters) in March 2026. The parameter scale leap between each generation is larger, but the version intervals are also longer.

MiniMax chose small steps and quick runs, with small iteration amplitudes but extremely high frequency; M2.7's self-iterative mechanism is itself designed for "continuous evolution." Xiaomi chose蓄力一击, with each version representing a major leap in parameter scale and architecture.

Anonymous for 8 Days, Topping OpenRouter

Beyond the technical route, Xiaomi's release strategy also broke industry conventions.

According to a Reuters report, on March 11, an anonymous model named Hunter Alpha appeared on OpenRouter, the world's largest API aggregation platform. No brand endorsement, no launch event, no technical blog. Its API pricing was extremely low, yet its performance was surprisingly strong.

The community began speculating about its origin. According to Republic World and multiple tech media reports, the most mainstream guess was DeepSeek V4, as MiMo team leader Luo Fuli had previously conducted research at DeepSeek. Call volume surged rapidly, exceeding 1 trillion tokens during the anonymous period, topping the OpenRouter weekly chart.

In the early hours of March 19, Xiaomi revealed the answer: Hunter Alpha was MiMo-V2-Pro. According to the same Reuters report, Xiaomi's Hong Kong stock saw a gain of up to 5.8% after the reveal.

This was the first time a domestic large model proved itself on a global platform through pure blind testing. Relying not on brand or宣传, but letting developers vote with their feet over 8 days.

Criptos en tendencia

CitreaCTR

wrapped stUSDTWSTUSDT

Preguntas relacionadas

QWhat are the two Chinese companies that recently released their Agent-oriented large models, and what are the model names?

AMiniMax released the M2.7 model, and Xiaomi's MiMo team released the V2-Pro model.

QHow does the API output pricing of MiniMax M2.7 and MiMo-V2-Pro compare to Claude Opus 4.6?

AThe API output price for MiniMax M2.7 is $1.2 per million tokens, which is 1/21 of Claude Opus 4.6's $25. MiMo-V2-Pro is $3 per million tokens, which is 1/8 of Claude Opus 4.6's price.

QWhat are the core technical approaches of MiMo-V2-Pro and MiniMax M2.7?

AMiMo-V2-Pro follows a 'scale-up' approach with over 1 trillion total parameters and a Hybrid Attention mechanism. MiniMax M2.7 uses a 'self-iterative evolution' mechanism where the model autonomously runs optimization cycles to improve its own performance.

QWhat was unique about Xiaomi's release strategy for the MiMo-V2-Pro model?

AXiaomi first released the model anonymously on OpenRouter under the name 'Hunter Alpha' for 8 days. It gained significant developer traction and topped the OpenRouter weekly chart before Xiaomi revealed it was their model.

QHow did the iteration rhythms of MiniMax and Xiaomi's MiMo team differ?

AMiniMax iterated rapidly, releasing four versions in five months (approx. every 49 days). Xiaomi's MiMo team had longer release intervals with larger parameter scale jumps between versions, such as from 7B parameters to 309B, and then to 1T.

Lecturas Relacionadas

Analyzing the Impact of AI on Economic Growth and Productivity

**Title: Analyzing AI's Impact on Economic Growth and Productivity** This article examines three contrasting views on AI's influence on economic growth and productivity. **The Optimistic View** posits that AI, especially through automating R&D ("recursive self-improvement"), could dramatically accelerate growth, even triggering a technological "singularity" with explosive, potentially infinite, economic expansion. **The Moderate/Mainstream View** acknowledges AI's productivity benefits but emphasizes significant real-world constraints that could limit its impact. These include: limited cost savings per task, structural ceilings on which jobs and industries are "exposed" to AI, adoption bottlenecks (e.g., compute, energy, regulatory hurdles), and the "weak link" effect where non-automatable tasks cap overall gains. Consequently, the realized AI dividend may be far lower than optimistic projections, with estimates typically ranging from 0.1% to 1.3% annual productivity growth. **The Pessimistic View** stems from two strands. The first aligns with the moderate view but applies extremely conservative assumptions about task exposure and efficiency gains, yielding minimal projected impact. The second introduces a demand-side critique: if AI primarily replaces rather than augments labor, it could depress labor's share of income, weaken consumer demand, and create a "demand trap" that ultimately stifles growth, unless offset by redistribution policies. **The authors' assessment** is nuanced: * **Short-term (1-2 years):** AI will support growth primarily through investment spending, not significant productivity gains. * **Medium-term (3-5 years):** Three potential paths emerge based on AI demand and bottleneck severity: 1. **"Optimistic Path":** High demand, few bottlenecks. Rapid productivity gains but risk of major job displacement and social conflict without redistribution. 2. **"Moderate Path" (most likely):** High demand but significant, surmountable bottlenecks. Leads to moderate productivity gains, financial market volatility (K-shaped returns), and sectoral job losses. 3. **"Pessimistic Path":** Low demand or severe bottlenecks. Minimal productivity and growth impact, triggering financial market corrections but allowing a smoother societal transition with less labor disruption. * **Long-term:** AI holds potential for a major productivity revolution and prosperity. The conclusion stresses that no path is smooth. Technologically "optimistic" outcomes could be socially detrimental, while "pessimistic" technological diffusion might be more socially stable. Policymakers must monitor developments and prepare balanced responses to manage economic, financial, and social sustainability.

marsbitHace 6 min(s)

Analyzing the Impact of AI on Economic Growth and Productivity

marsbitHace 6 min(s)

IRS Fraud Alert: Fake Letters Target Crypto Holders in Data-Stealing Scam

The IRS Criminal Investigation unit has warned cryptocurrency holders of a sophisticated phishing scam involving fake paper letters that mimic official IRS correspondence. These letters, which include official formatting and a deadline, instruct recipients to enroll in a non-existent "Digital Asset Compliance Portal." A QR code inside the letter directs victims to a spoofed IRS website designed to steal personal information and, in some cases, trick them into directly transferring cryptocurrency to scammers. The scam exploits the fact that the IRS has legitimately sent compliance letters regarding digital assets since 2019, making such mail seem plausible. The agency clarifies it never uses QR codes in official letters, never asks for digital asset transfers as part of compliance, and does not operate a digital asset enrollment portal. Legitimate notices can be verified via IRS.gov. If you receive such a letter, do not scan the QR code, call any listed numbers, or enter personal information on any linked site. Verify your account status only through official IRS tools. This scam undermines trust in genuine IRS communications during a period of increased crypto enforcement.

TheNewsCryptoHace 10 min(s)

IRS Fraud Alert: Fake Letters Target Crypto Holders in Data-Stealing Scam

TheNewsCryptoHace 10 min(s)

The New Cold War is a Tech Stock War

The New Cold War is a Tech Stock War The article argues that the contemporary geopolitical and economic rivalry between the US and China represents a "New Cold War," but one fundamentally fought through technology and financial markets, not physical barriers or conventional trade. Historically, US dominance was secured through financial systems. The Soviet Union, reliant on the rigid "Transferable Ruble," was ultimately undermined by its dependency on the US dollar for oil trade. Later, Japan's semiconductor challenge was countered not just by tariffs (e.g., Plaza Accord, 301 investigations) but by binding it to US Treasury bonds. China presents a more complex, "embedded" challenger. While it holds vast dollar reserves and US debt like Japan, its industrial base is stronger and more diversified than the Soviet Union's. Surviving the initial 2018 trade war phase, the conflict has evolved into a "tech-financial war." The core battlefield is now the stock market. US tech stocks (AI, semiconductors) are treated as sovereign assets, buoyed by bipartisan national will. China is pushing to strengthen its own financial markets to convert industrial strength into financial power and fund its tech ambitions. Companies like ChangXin (semiconductors), Moonshot AI, and DJI compete not just for market share but as financial proxies for their respective systems. The new paradigm is moving from globally efficient monopolies (Apple, Google) towards companies that achieve monopolistic profits within their respective geopolitical spheres. This competition over "pricing power" and financial valuation in segmented markets defines the current era, making the stock market the primary arena for this tech-centric struggle.

marsbitHace 15 min(s)

RWA Weekly: Ten European Financial Institutions Establish Tokenized Asset Cooperative; Ondo Launches New Execution Network Ondo Network

RWA Weekly: European Banks Form Tokenized Asset Cooperative; Ondo Launches New Execution Network Ondo Network Covering July 24-31, 2026, the RWA sector saw a steady on-chain total value locked (TVL) of $36.8 billion, with holder count hitting a record high. However, stablecoin transfer volumes fell sharply (~30%), indicating low on-chain settlement demand. Key regulatory moves include South Korea advancing stablecoin legislation and a push to scrap crypto taxes, Kenya lowering capital requirements for stablecoin issuers, and Zimbabwe approving seven projects for its crypto sandbox. In project developments, BIS-led Project Agorá successfully tested cross-border payments with tokenized funds across six currencies. Ten major European financial institutions formed the RL1 blockchain cooperative to build tokenized asset infrastructure. Other notable updates: Aviva launched a tokenized dollar liquidity fund on XRPL, POSCO International tokenized commercial invoices on Injective, and a Brazilian farmer used tokenized cattle as collateral for a loan. Additional progress includes BNY Mellon migrating its core transfer agent operations to blockchain, Securitize gaining SEC investment advisor registration, and Tether’s compliant stablecoin USA₮ launching on Celo. Ondo Finance introduced Ondo Network, a new execution layer focused on speed and privacy, moving away from its initial chain plans. An analysis highlights that despite the growing scale of on-chain RWAs (~$32B), approximately 90% remain underutilized in DeFi, pointing to a critical challenge in unlocking liquidity and fostering real-world application beyond mere issuance.

marsbitHace 16 min(s)

RWA Weekly: Ten European Financial Institutions Establish Tokenized Asset Cooperative; Ondo Launches New Execution Network Ondo Network

marsbitHace 16 min(s)

South Korean Stock Market Sees Sharp Rebound After Forceful De-leveraging, SK Hynix Rises 30%

On July 31, South Korean stocks staged a historic rebound. The benchmark KOSPI index surged 18.27%, with chipmaker SK Hynix hitting a 30% gain limit. This followed a brutal, near-40% decline in the KOSPI over the previous month, driven largely by a deleveraging spiral involving leveraged ETFs. Analysts attributed the sharp sell-off to structural liquidity issues rather than deteriorating corporate fundamentals. The rally was triggered by a confluence of positive catalysts. Firstly, strong earnings from U.S. cloud giants Microsoft and Amazon alleviated fears of an "AI bubble burst," boosting global tech sentiment. Secondly, SK Group Chairman Chey Tae-won made a rare personal purchase of SK Hynix shares, seen as a strong vote of confidence. Thirdly, the South Korean government announced a 20 trillion won ($139 billion) AI investment fund. In response to the market turmoil, South Korean regulators are tightening controls on leveraged ETFs, admitting oversight shortcomings. Measures include raising minimum cash保证金 requirements for散户 investors and suspending new product launches. While the rebound signals eased liquidity pressure, analysts note deep structural issues remain. The market's future stability is seen as dependent on global tech capital expenditure trends and memory chip price cycles, with some viewing the surge as a technical correction rather than a definitive trend reversal.

marsbitHace 36 min(s)

South Korean Stock Market Sees Sharp Rebound After Forceful De-leveraging, SK Hynix Rises 30%

marsbitHace 36 min(s)

Trading

Spot

Artículos destacados

Cómo comprar WAR

¡Bienvenido a HTX.com! Hemos hecho que comprar WAR (WAR) sea simple y conveniente. Sigue nuestra guía paso a paso para iniciar tu viaje de criptos.Paso 1: crea tu cuenta HTXUtiliza tu correo electrónico o número de teléfono para registrarte y obtener una cuenta gratuita en HTX. Experimenta un proceso de registro sin complicaciones y desbloquea todas las funciones.Obtener mi cuentaPaso 2: ve a Comprar cripto y elige tu método de pagoTarjeta de crédito/débito: usa tu Visa o Mastercard para comprar WAR (WAR) al instante.Saldo: utiliza fondos del saldo de tu cuenta HTX para tradear sin problemas.Terceros: hemos agregado métodos de pago populares como Google Pay y Apple Pay para mejorar la comodidad.P2P: tradear directamente con otros usuarios en HTX.Over-the-Counter (OTC): ofrecemos servicios personalizados y tipos de cambio competitivos para los traders.Paso 3: guarda tu WAR (WAR)Después de comprar tu WAR (WAR), guárdalo en tu cuenta HTX. Alternativamente, puedes enviarlo a otro lugar mediante transferencia blockchain o utilizarlo para tradear otras criptomonedas.Paso 4: tradear WAR (WAR)Tradear fácilmente con WAR (WAR) en HTX's mercado spot. Simplemente accede a tu cuenta, selecciona tu par de trading, ejecuta tus trades y monitorea en tiempo real. Ofrecemos una experiencia fácil de usar tanto para principiantes como para traders experimentados.

228 Vistas totalesPublicado en 2024.12.11Actualizado en 2026.06.02

Discusiones

Bienvenido a la comunidad de HTX. Aquí puedes mantenerte informado sobre los últimos desarrollos de la plataforma y acceder a análisis profesionales del mercado. A continuación se presentan las opiniones de los usuarios sobre el precio de WAR (WAR).