Wang Tao's First Large Model Brings Meta Back to the Table

marsbitОпубликовано 2026-04-09Обновлено 2026-04-09

Введение

Meta has unveiled Muse Spark, the first model from its new Muse series, marking a significant step in its AI strategy under the leadership of Alexandr (Wang) since his appointment ten months ago. Designed to be compact and fast, Spark is a natively multimodal reasoning model that integrates visual information into its core architecture, enabling advanced capabilities like visual chain-of-thought reasoning. It excels in areas such as medical reasoning, visual and chart understanding, and scientific reasoning, though it lags in coding and long-form tasks compared to rivals like GPT-5.4 and Gemini 3.1 Pro. The model will power Meta AI across platforms like WhatsApp, Instagram, and Facebook, with private API access for select partners. The release follows Meta’s shift from its earlier Llama series and significant internal investment, including a restructured AI stack and expanded infrastructure. Spark’s introduction spurred a 6.5% rise in Meta’s stock, reflecting market optimism. Meta plans to develop larger models in the Muse series, potentially adopting a hybrid open/closed-source approach moving forward.

By | Alphabet AI

It has been about ten months since Wang Tao (Alexandr Wang) joined Meta. The world is about to move from one summer to another, and Meta's "Avocado" is finally ripe.

On April 8 local time, Meta officially announced the release of Spark, the first model in the Muse series. This is also the first dish served by Meta after recruiting Wang Tao and establishing the "Meta Superintelligence Labs (MSL)".

Wang Tao posted several messages on X to introduce the new model, stating: "Nine months ago, we rebuilt the AI technology stack from scratch, including new infrastructure, architecture, and data pipelines. Muse Spark is the result of this work."

Even Yann LeCun, Meta's former chief scientist who was rumored to have disagreements with Wang Tao, came to congratulate him, creating a harmonious atmosphere.

Meta emphasized that Spark was designed to be "small and fast." Leading with such a model instead of "holding back for a big move" to release a crushing model shows that Meta knows time is of the essence.

This move seems to have worked, as Meta's stock price rose by about 9% that day.

01 New Model Muse Spark

First, let's take a look at what model Meta has released.

The new model is called Muse Spark, with Muse being the name of the model series. The name is quite interesting: Muse refers to the "Muses," and Spark means "spark."

Meta stated that Muse Spark is Meta's most powerful model to date. It currently powers Meta AI applications and websites and will be rolled out to WhatsApp, Instagram, Facebook, Messenger, and AI glasses in the coming weeks. Meta will also offer a private preview of the model via API to select partners.

Clearly, Meta wants to fully leverage its platform advantages, explicitly stating that Muse Spark is specifically built for Meta's products.

It will provide smarter and faster support for Meta AI and unlock new features over time, such as referencing recommended content and information shared by users on Instagram, Facebook, and Threads.

"We are moving toward the goal of personal superintelligence: creating an intelligent assistant that can help anyone, anytime, with the things they care about most."

Muse Spark is designed to be small and fast yet capable of handling complex problems in science, mathematics, and health. Its core is a natively multimodal reasoning model.

Unlike previous versions that "stitched" vision and text together, Muse Spark was rebuilt from the ground up, integrating visual information into its internal logic. This architectural shift enables a "visual chain of thought," allowing the model to annotate dynamic environments—for example, identifying components of a complex coffee machine or correcting a user's yoga posture through side-by-side video analysis.

However, the most important technical leap is the new "Contemplating" mode.

Meta claims that this feature coordinates multiple sub-agents for parallel reasoning, enabling Meta to compete with extreme reasoning models like Google's Gemini Deep Think and OpenAI's GPT-5.4 Pro.

In terms of single-model test results:

. PhD-level scientific reasoning (GPQA Diamond): Muse Spark achieved an accuracy of 89.5%, which is quite strong but still slightly behind Gemini 3.1 Pro (94.3%), GPT-5.4 (92.8%), and Claude Opus 4.6 (92.7%).

. Chart and visual understanding (CharXiv Reasoning, in Contemplating mode): Scored 86.4, significantly outperforming competitors in this multimodal visual reasoning task—surpassing Gemini 3.1 Pro (80.2), GPT-5.4 (82.8), and Claude Opus 4.6 (65.3). Visual understanding and chart reasoning are among Muse Spark's standout strengths.

. Hard medical reasoning (HealthBench Hard): Scored 42.8%, significantly leading all major competitors, including GPT-5.4 (40.1%), Gemini 3.1 Pro (20.6%), and Claude Opus 4.6 (14.8%). Meta stated that this is thanks to targeted training in collaboration with over 1,000 doctors. Medical-related capabilities are one of its highlights.

. Software engineering and coding (SWE-Bench Verified): Scored 77.4%, behind Claude Opus 4.6 (80.8%) and Gemini 3.1 Pro (80.6%). Meta itself admitted that there is still a gap in long-term, multi-step autonomous tasks (agentic tasks) and complex coding workflows, requiring continued investment.

. Multimodal multidisciplinary understanding (MMMU Pro): Scored approximately 80.4–80.5%, second only to Gemini 3.1 Pro (83.9%), ranking second in visual multimodal tasks.

Overall, Muse Spark excels in visual multimodal reasoning, medical fields, and efficient reasoning, making it particularly suitable for Meta's own social, content, and health ecosystems. However, it still has room for improvement in pure coding and long-chain autonomous tasks.

02 The "Avocado" Delayed Multiple Times

An interesting side note occurred on X.

As Meta's current AI leader, Wang Tao (Alexandr Wang) posted several messages on X promoting the new model.

At this point, someone pointed out that the benchmark chart provided was too misleading, "almost criminal." In this chart, Muse Spark's scores were placed in the first column and all highlighted in a prominent color. At first glance, it seemed to be leading across the board, but upon closer inspection, some scores were actually lower.

Playing tricks with charts is not new; OpenAI has been criticized for this multiple times before.

Interestingly, Wang Tao chose to immediately "apologize" in response to the criticism:

"Sorry, we did not mean to imply that our scores are the highest. On the contrary, most evaluation results show that our model still has much room for improvement. We will not make the same mistake again."

It is not hard to see that Meta does not intend for Muse Spark to achieve complete dominance but rather to return to the competition in AI.

From various signs, the Muse series is likely the project internally code-named "Avocado."

Avocado has been delayed for too long, and Meta has now adopted a "small first, then big" strategy. Meta emphasized in its official blog post that Spark focuses on being fast and small, and this is just the beginning:

"Our models are developing as expected. Muse Spark is an early data point in our development journey, and we are working on larger-scale models."

This is different from the AI industry's (especially the top players') habit of "making a splash" or "shocking," but Meta really doesn't have time to take it slow.

Early last year, after Meta released the Llama 4 series, the model's performance did not meet expectations (especially the insufficient performance of the Behemoth large model), and further open-source development of the Llama series was paused.

By last summer, Meta invested $14.3 billion in Scale AI (acquiring a 49% stake) and directly recruited Scale AI founder and CEO, 28-year-old Wang Tao (Alexandr Wang), as Chief AI Officer, formally establishing the Meta Superintelligence Labs (MSL).

At the same time, Meta engaged in疯狂挖角 (frenzy recruitment), recruiting dozens of top researchers from OpenAI, Google, and other companies with high salaries, some offers reaching millions to hundreds of millions of dollars.

In terms of costs, Meta's full-year AI-related capital expenditure in 2025 reached $72.22 billion; the financial guidance in January 2026 indicated that this number would significantly increase to $115–135 billion, almost doubling, mainly for MSL's model training and data center expansion.

Over the past ten months, Meta, as well as Zuckerberg and Meta's AI head Wang Tao, have been under tremendous pressure. People were eager to see what dish would be served after Wang Tao joined and Meta重组 (reorganized).

At least from the market's initial feedback, Meta's strategy of abandoning "holding back for a big move" and instead serving a small dish first is working. Meta's stock price surged nearly 9% that day, marking the largest single-day gain since January this year. As of the close, Meta rose 6.5%.

A noteworthy piece of information is that the outside world一直认为 (has always believed) that "Avocado" would completely转向闭源 (shift to closed-source), but Meta did not close the door this time. In the future, Meta may adopt a hybrid strategy of open-source and closed-source并行 (parallel), keeping flagship models and exclusive technologies internal while maintaining the open-source availability of fresh models to the broad developer community.

Meta has finally served "Avocado" on the table, but this is far from the end. For Wang Tao and Zuckerberg, Muse Spark is more like a starting gun. The future unfolds depends on whether the promise of "getting stronger and stronger" can be fulfilled.

Связанные с этим вопросы

QWhat is the name of the first model in the Muse series released by Meta, and what is its key design philosophy?

AThe first model in the Muse series is called Muse Spark. Its key design philosophy is to be 'small and fast'.

QWhich specific areas does the Muse Spark model excel in, according to the benchmark results mentioned in the article?

AAccording to the benchmarks, Muse Spark excels in visual and multimodal reasoning (e.g., CharXiv Reasoning), medical hard reasoning (HealthBench Hard), and efficient inference.

QHow did the market react to the announcement of the Muse Spark model?

AThe market reacted positively. Meta's stock price rose by as much as 9% during the day and closed up 6.5%, marking its largest single-day gain since January of that year.

QWho is Alexandr Wang, and what role does he play at Meta in relation to this new model?

AAlexandr Wang is the Chief AI Officer at Meta. He leads the Meta Superintelligence Labs (MSL) and was instrumental in the development of the Muse Spark model, which is the first major release since he joined the company.

QWhat was the internal code name for the Muse project, and what strategy did Meta adopt for its release?

AThe internal code name for the project was 'Avocado'. Meta adopted a 'small first, then large' strategy, releasing the smaller, faster Muse Spark first instead of waiting to release a much larger model.

Похожее

Coinbase Launches Crypto-Backed USDC Loans For UK Users In Latest Expansion

Coinbase has expanded its crypto-backed USDC lending service to UK residents, allowing them to use Bitcoin (BTC), Ethereum (ETH), and Coinbase Wrapped Staked Ether (cbETH) as collateral. Powered by the on-chain protocol Morpho on the Base network, the service enables users to borrow up to $5 million in USDC without selling their crypto holdings. Collateral is locked in a smart contract until the loan is repaid, with liquidation triggered if the loan-to-value ratio exceeds a threshold. This follows the service’s successful US launch, where originations surpassed $2.17 billion. The move is part of Coinbase’s broader expansion in the UK, including savings accounts and DEX trading, and aligns with its efforts to integrate crypto into traditional finance, such as recently offering crypto-backed mortgages.

bitcoinist45 мин. назад

Coinbase Launches Crypto-Backed USDC Loans For UK Users In Latest Expansion

bitcoinist45 мин. назад

Capital Flow Analysis Shows Ozak AI Absorbing Liquidity From BTC, ETH, and SOL During Market Pullbacks

Capital flow analysis indicates that during recent market pullbacks, liquidity is rotating from major cryptocurrencies like BTC, ETH, and SOL into Ozak AI, an early-stage AI project. Priced at $0.014, Ozak AI has raised over $6.8 million and sold more than 1.17 billion tokens in its presale, showing steady and sustained capital absorption rather than hype-driven spikes. Analysts describe this shift as strategic redeployment, not panic selling, driven by Ozak AI’s relative valuation efficiency, AI-native utility (including Prediction Agents and EigenLayer integration), and favorable risk-reward timing. The trend reflects a structural rotation into high-growth AI infrastructure projects during large-cap consolidation, with Ozak AI emerging as a preferred alternative for asymmetric returns.

TheNewsCrypto1 ч. назад

Capital Flow Analysis Shows Ozak AI Absorbing Liquidity From BTC, ETH, and SOL During Market Pullbacks

TheNewsCrypto1 ч. назад

DeFi Falls into the Most Dangerous Prisoner's Dilemma in History

ChainCatcher author Gu Yu analyzes the severe fallout from the Kelp DAO exploit, which has triggered a crisis of confidence across the DeFi ecosystem, implicating major projects like Aave, LayerZero, and Arbitrum. The attack, attributed to the Lazarus Group, exploited a single-validator (1/1) configuration in LayerZero’s decentralized verification network (DVN), leading to over $40M in losses. LayerZero’s official report claimed its protocol “operated as expected” and deflected blame onto Kelp DAO’s configuration choices, sparking backlash from developers and researchers who criticized its avoidance of responsibility. Meanwhile, Arbitrum’s security committee intervened by using a privileged transaction to freeze and recover ~30,766 ETH ($71M) from the hacker’s address—a move praised for pragmatism but criticized for undermining Layer2 decentralization ideals. The incident has intensified debates on cross-chain bridge security, Layer2 centralization risks, and the tension between practical security measures and decentralized principles. With Aave facing ~$200M in bad debt and no clear compensation plan from Kelp DAO or LayerZero, the situation reflects a dangerous “prisoner’s dilemma” where key players prioritize self-interest over collective trust, threatening foundational DeFi narratives.

marsbit2 ч. назад

DeFi Falls into the Most Dangerous Prisoner's Dilemma in History

marsbit2 ч. назад

Not Speculation but a Necessity: The 4 Unique Values of Prediction Markets

Polymarket's recent $4 billion funding round and soaring valuation of $15 billion highlight the explosive growth of prediction markets, with trading volume reaching $25.7 billion in March 2026—a 10.6% monthly increase. This analysis argues that prediction markets serve critical non-speculative functions, positioning them as essential tools rather than mere gambling platforms. Prediction markets offer four unique values: entertainment consumption, insurance-like protection, risk hedging, and truth discovery. Firstly, they stimulate economic activity by engaging users in event-based betting, similar to the broader sports industry. Secondly, they act as a form of decentralized insurance, allowing users to hedge against specific, well-defined risks (e.g., weather events) transparently and without traditional overhead costs. Thirdly, institutions and individuals use these markets to hedge against geopolitical and commodity price risks, as demonstrated during the U.S.-Iran conflict and the launch of 24/7 commodity markets on platforms like Kalshi. Finally, prediction markets counter media bias by aggregating crowd-sourced information, often achieving 30% higher accuracy than surveys due to users' vested interests. Experts like Bitwise’s Jeff Park and SIG’s Jeff Yass emphasize the markets' role in risk transfer and financial innovation. As these platforms evolve, they are poised to become trillion-dollar markets, offering more reliable, decentralized mechanisms for information pricing and risk management.

marsbit2 ч. назад

Not Speculation but a Necessity: The 4 Unique Values of Prediction Markets

Polymarket's recent $4 billion funding round, valuing it at $15 billion, highlights the explosive growth of prediction markets, with trading volume surging from $8.7 billion to $25.7 billion in just a few months. This article argues that prediction markets are not merely speculative gambling platforms but serve four core functions. First, they act as a form of entertainment and consumption, stimulating economic activity similar to the sports industry. Second, they provide a form of limited insurance, allowing users to hedge against specific, well-defined risks (e.g., weather events) in a transparent, cost-effective manner without traditional insurance overhead. Third, they are powerful tools for risk hedging, enabling both individuals and institutions to mitigate exposure to volatile events like commodity price swings and geopolitical crises. Finally, prediction markets can function as "truth machines," aggregating crowd wisdom to counter media narratives and biases, with some claims suggesting a 30% higher accuracy over traditional surveys. The piece concludes that the role and potential value of prediction markets are far greater than commonly perceived, positioning them as a significant future market.

Odaily星球日报2 ч. назад

Торговля

Спот

Фьючерсы

Обсуждения

Добро пожаловать в Сообщество HTX. Здесь вы сможете быть в курсе последних новостей о развитии платформы и получить доступ к профессиональной аналитической информации о рынке. Мнения пользователей о цене на S (S) представлены ниже.

Wang Tao's First Large Model Brings Meta Back to the Table

Введение

01 New Model Muse Spark

02 The "Avocado" Delayed Multiple Times

Связанные с этим вопросы

Похожее

Coinbase Launches Crypto-Backed USDC Loans For UK Users In Latest Expansion

Capital Flow Analysis Shows Ozak AI Absorbing Liquidity From BTC, ETH, and SOL During Market Pullbacks

DeFi Falls into the Most Dangerous Prisoner's Dilemma in History

Not Speculation but a Necessity: The 4 Unique Values of Prediction Markets

Not Speculation but a Necessity: The 4 Unique Values of Prediction Markets

Торговля

Популярные статьи

Как купить S

Sonic: Обновления под руководством Андре Кронье – новая звезда Layer-1 на фоне спада рынка

HTX Learn: Пройдите обучение по "Sonic" и разделите 1000 USDT

Обсуждения

Топ вопросы

Популярные категории

Популярные теги