After 10 Years, Altman Finally Has the Person He Wanted

marsbitPublished on 2026-06-18Last updated on 2026-06-18

Abstract

After a decade of waiting, OpenAI CEO Sam Altman has finally secured his desired collaborator: Noam Shazeer, a legendary AI researcher and co-author of the seminal "Attention Is All You Need" paper that introduced the Transformer architecture. Shazeer has announced his departure from Google to join OpenAI as Head of Architectural Research. Shazeer, a crucial early Google employee who returned to Google DeepMind in a high-profile $2.7 billion deal two years ago, confirmed his move on social media platform X. Altman expressed his long-standing desire to work with Shazeer, stating the 10-year wait would be worth it. OpenAI's research lead, Mark Chen, welcomed Shazeer, highlighting his foundational work on Transformer, Mixture-of-Experts (MoE) models, and efficient decoding, which have profoundly shaped modern AI. His departure is seen as a significant blow to Google's Gemini project, where he served as a technical co-lead. Industry observers note this move represents a major win for OpenAI in the ongoing AI talent war, with some quipping that OpenAI acquired his expertise "for free" after Google's massive investment.

Noam Shazeer, the legendary AI figure whom Google brought back two years ago with a $2.7 billion deal, announced his departure from Google to join OpenAI.

Just now, Noam Shazeer confirmed this news on X. He stated that he is joining OpenAI and looks forward to working with the excellent team there. He also expressed that leaving Google was not an easy decision; he is immensely proud of the Google team and everything they have built together, and it has been an honor and a pleasure to work alongside these colleagues.

Subsequently, Sam Altman said, "From the very beginning of OpenAI's founding, Noam has been one of the people I most wanted to work with. It just took 10 years to finally get the chance. I believe the wait will be worth it!"

OpenAI research lead Mark Chen and others also responded on X: "A huge welcome to Noam Shazeer joining OpenAI as Head of Architecture Research! His research contributions in Transformer, MoE (Mixture of Experts), and efficient decoding have profoundly shaped the development of modern AI.

He not only holds a strong belief in achieving AGI but also has deeply considered insights on how to ensure its smooth development. Welcome, Noam!"

Noam Shazeer

Noam Shazeer is one of Google's most important early employees. He joined Google at the end of 2000, serving as a Principal Software Engineer responsible for early advertising systems.

Noam Shazeer is no ordinary researcher. He is one of the co-authors of the seminal 2017 paper "Attention Is All You Need." The Transformer architecture proposed in this paper later became the most crucial technical foundation of the large language model era. From GPT and Gemini to Claude, and nearly all mainstream large models today, they all rely on the Transformer technical lineage.

More importantly, Shazeer's contributions extend beyond Transformer itself.

Long before large models truly entered the phase of scale competition, he had been focused on model scaling, sparse computation, and training of massive models. He co-proposed Sparsely-Gated Mixture-of-Experts, an important early foundation for the later MoE approach; Switch Transformer further pushed sparse expert models to the trillion-parameter scale. Today, MoE has become one of the key approaches for frontier models to increase parameter scale while controlling inference costs.

In 2021, Noam Shazeer left Google, disappointed by the bureaucracy at the search giant, and co-founded Character.AI with Daniel De Freitas. That company once became one of the most watched AI startups.

In 2024, Google reached a technology licensing deal with Character.AI and brought Shazeer and others back to Google DeepMind. Subsequently, he was appointed as co-head of Gemini technology, involved in Google's core large model project.

Now, in less than two years, he has turned to join OpenAI. This is truly harsh news for Gemini's development.

Many netizens believe that during the critical stage where Gemini still needs to continuously strengthen model capabilities and engineering systems, losing a figure like Shazeer will be a significant talent drain.

Others were more blunt in their jokes: Gemini wasn't that great to begin with; now with a core figure leaving, it's completely finished.

Another netizen joked: Google paid $2.7 billion for Shazeer's intellectual property. And OpenAI got these patents for free. This is the most favorable acquisition price in tech history.

In this AI talent war, Shazeer's addition is undoubtedly a significant victory for OpenAI.

This article is from the WeChat public account "Machine Heart" (ID: almosthuman2014), author: Focus on AI Big Shots.

Trending Cryptos

Related Questions

QWho is Noam Shazeer and what are his key contributions to AI?

ANoam Shazeer is a prominent AI researcher and a co-author of the seminal 2017 paper "Attention Is All You Need," which introduced the Transformer architecture. He is also known for his foundational work on Sparsely-Gated Mixture-of-Experts (MoE) and efficient decoding, technologies that are critical to modern large language models.

QWhy did Sam Altman say it took 10 years to get Noam Shazeer to join OpenAI?

ASam Altman stated that Noam Shazeer had been one of the people he most wanted to work with since OpenAI's founding. The wait of 10 years refers to the period from OpenAI's establishment to finally securing Shazeer's collaboration.

QWhat position did Noam Shazeer hold at Google before joining OpenAI, and why is his departure significant?

ABefore joining OpenAI, Noam Shazeer was a co-technical lead for the Gemini project at Google DeepMind. His departure is seen as a significant talent loss for Google, potentially impacting the Gemini project's development during a critical phase of strengthening its capabilities.

QWhat was the reported financial deal between Google and Character.AI involving Noam Shazeer?

AAccording to the article, Google reportedly struck a $2.7 billion deal for a technology license with Character.AI, the company co-founded by Shazeer, which also led to his return to Google in 2024.

QHow did the AI community react to Noam Shazeer's move to OpenAI?

AReactions were mixed. Some viewed it as a major win for OpenAI in the AI talent war. Others expressed concern for Google's Gemini project, with some joking that OpenAI got Shazeer's expertise 'for free' while Google had paid billions, and some even suggesting it was a devastating blow for Gemini.

Related Reads

NVIDIA CPU Advances, China's RISC-V Responds: Semiconductor Deep Dive - Part Four

NVIDIA is set to launch its new Vera AI data center CPU in China as early as August, with high pricing. While this move offers a new option, it highlights China's continued dependence on foreign-controlled Arm architecture. In response, the Chinese semiconductor industry is increasingly turning to RISC-V as a strategic alternative for achieving high-performance computing autonomy. The article explores the concept of the "impossible triangle" in CPU development—balancing prosperity, control, and autonomy—and posits that RISC-V's open-source, modular nature offers a unique path to achieving all three. While RISC-V is already dominant in embedded systems, the focus is now shifting to data centers and AI workloads. China has become a global hotspot for RISC-V development, driven by AI-driven compute demand, supply chain concerns from export controls, cost benefits of open-source, and strong policy support. Multiple Chinese companies have reportedly crossed the key performance threshold of 15 SPECint per GHz, a benchmark for entering the high-performance CPU club. Progress extends beyond single-core benchmarks. Companies are developing complete computing subsystems, including commercial-grade coherent network-on-chip (NoC) technology and server processors with up to 40 cores that strictly adhere to the RVA23 standard to ensure software compatibility. Real-world applications are emerging in areas like video transcoding and edge AI. However, significant challenges remain. The RISC-V ecosystem faces fragmentation, immature toolchains and verification processes, and gaps in single-core performance and energy efficiency compared to mature x86 and Arm architectures. The formidable software moat, epitomized by NVIDIA's CUDA, is a long-term hurdle. In conclusion, while RISC-V cannot immediately replace offerings like NVIDIA's Vera, it represents a viable long-term path for China to develop a self-sufficient, high-performance CPU ecosystem. The journey is acknowledged to be long and arduous, requiring sustained effort to overcome technical and ecosystem challenges.

marsbit5h ago

NVIDIA CPU Advances, China's RISC-V Responds: Semiconductor Deep Dive - Part Four

marsbit5h ago

My Coding Betting Dashboard is Profiting, but Polymarket is Truly Not a Good Place for 'Arbitrage'

The author built a custom monitoring dashboard for Polymarket, a prediction market platform, and tested it with $1,600, achieving over 30% returns. However, the core argument is that Polymarket is not a good venue for traditional arbitrage. The dashboard has two main sections: a "Portfolio Dashboard" for tracking active positions with key metrics like total capital, P&L, and a risk-control module using a tier system (T1, T2, T3), and an "Opportunity Watchlist" for monitoring markets. The article details a critical structural trap in binary markets: a bet with a high perceived probability of success still carries a 100% loss risk if wrong. The author's T1/T2/T3 system is designed to manage this by limiting position sizes based on conviction and time horizon, emphasizing that high confidence should not equal high concentration. A key insight is the danger of "pseudo-diversification"—betting on different markets driven by the same underlying variable. The author concludes that Polymarket offers few true low-risk, arbitrage opportunities. It is instead a high-risk environment where wins can create a false sense of mastery, leading to large losses. The platform is better viewed as a training ground for honing judgment through disciplined, framework-driven betting rather than a reliable income source. The tools help transform intuition into structured, rule-based decisions to mitigate the risk of catastrophic errors.

marsbit8h ago

My Coding Betting Dashboard is Profiting, but Polymarket is Truly Not a Good Place for 'Arbitrage'

marsbit8h ago

WeChat AI Card Hands-On Guide: Has the AI Shopping Era Arrived?

**"WeChat AI Card" Practical Test Guide: Has the Era of AI Shopping Arrived?** WeChat has officially launched the "AI Exclusive Card," a feature integrated into its Workbuddy AI assistant. This card is designed to handle payments for AI-initiated purchases. Our hands-on test reveals it's not yet a tool for fully autonomous AI shopping, but rather a controlled payment layer for AI agents. The AI Card functions as an isolated sub-wallet within WeChat Pay. Users must bind the card and transfer funds into it from their main wallet. Crucially, every transaction requires explicit user confirmation via smartphone scan; AI cannot spend autonomously. Currently accessible through the Workbuddy agent, the card targets specific digital consumption scenarios: purchasing paid content (reports, data), calling paid APIs/tools, and subscribing to services. Its design prioritizes security and control by separating funds and mandating approval for each payment. We tested a real-world scenario: ordering bubble tea via Workbuddy using a "Meituan Life Assistant" skill. The process encountered multiple hurdles: high "skill" usage costs (exceeding daily free credits), and most importantly, while a payment was successfully initiated, the AI purchased an incorrect product (a mismatched group-buy coupon instead of the desired drink). This highlights the current limitation: the **AI Card only solves the payment step**. The broader challenge lies in the **AI agent's execution chain**—accurately understanding intent, navigating third-party platforms, selecting the right product, and ensuring proper fulfillment. The payment succeeded, but the purchase failed to meet the user's need. In conclusion, the WeChat AI Exclusive Card is a cautious, early-step experiment in AI commerce. It provides a secure, user-controlled payment method for agent interactions but is not yet capable of reliable, end-to-end complex purchases. For now, it's best used for low-value, low-risk digital services with careful user verification at each step. The vision of AI handling complete shopping tasks remains a work in progress.

marsbit10h ago

WeChat AI Card Hands-On Guide: Has the AI Shopping Era Arrived?

marsbit10h ago

Trading

Spot
Futures

Hot Articles

Discussions

Welcome to the HTX Community. Here, you can stay informed about the latest platform developments and gain access to professional market insights. Users' opinions on the price of AI (AI) are presented below.

活动图片