Breaking: Google Gemini Co-Head Poached by OpenAI

marsbit發佈於 2026-06-18更新於 2026-06-18

文章摘要

In a significant industry move, Noam Shazeer, a former DeepMind researcher and co-lead of Google's Gemini project, has officially joined OpenAI as the Lead for Architecture Research. Shazeer is a legendary figure in AI, best known as a core author of the seminal 2017 paper "Attention Is All You Need," which introduced the Transformer architecture foundational to modern models like GPT and Gemini. Shazeer announced his move on social media, expressing excitement to join OpenAI's team while also thanking his former colleagues at Google. His career spans over 18 years at Google, where he contributed to key technologies including the Mixture of Experts (MoE) architecture, Mesh TensorFlow, and the LaMDA model. He left Google in 2021 to co-found the conversational AI company Character.AI, which achieved a multi-billion dollar valuation. In 2024, he returned to Google's DeepMind via a major technology licensing deal to help lead the Gemini project. His departure is viewed as a substantial loss for Google and a major talent acquisition for OpenAI, which is intensifying its competition with rivals like Anthropic. At OpenAI, Shazeer will focus on exploring next-generation AI model architectures and the evolution beyond the current Transformer paradigm. The move underscores the fierce competition for top AI research talent as companies race to develop advanced AI systems.

June 19, ZHIDONGSHE News, Today, Google DeepMind researcher and Gemini co-head Noam Shazeer has officially joined OpenAI, where he will serve as the Lead for Architecture Research. He will be responsible for exploring next-generation AI model architectures and driving the further evolution of the Transformer architecture.

Shazeer himself announced his move to OpenAI in a post on the overseas social platform X, stating: "I'm thrilled to join OpenAI and look forward to working with the exceptional team here."

Simultaneously, he expressed gratitude to the Google team: "Leaving was a difficult decision. I'm immensely proud of the Google team and everything we've accomplished together. It has been a privilege to work with you all." A Google spokesperson, in a response to Reuters, stated that the company is grateful for Shazeer's significant contributions over the years and wishes him all the best in his future endeavors.

Noam Shazeer is one of the legendary figures in the AI field. He is one of the core authors of the seminal 2017 paper 'Attention Is All You Need', which first proposed the Transformer architecture, directly establishing the technical foundation for modern large models like the GPT series, Gemini, and Claude.

'Attention Is All You Need'

Before joining OpenAI, Shazeer had left Google in 2021 to found Character.AI. In fact, Shazeer is best known as the Co-founder and CEO of Character.AI.

This company bet on the "AI companionship"赛道 even earlier than the ChatGPT explosion, allowing users to engage in long-term conversations with various AI characters. It once became one of the fastest-growing consumer AI applications globally. By 2023, Character.AI's valuation had exceeded $1 billion (approximately RMB 6.765 billion).

In 2024, Google reached a technology licensing agreement with Character.AI valued at approximately $2.7 billion (about RMB 18.266 billion), bringing Noam Shazeer and part of the core team back into DeepMind. Noam Shazeer was appointed as a Gemini co-head, involved in the pre-training R&D of the next-generation Gemini models.

For OpenAI, which is fiercely competing with Anthropic, this is viewed externally as one of the most significant top-tier talent acquisitions in recent years. Following the announcement, OpenAI's senior leadership and several renowned researchers immediately left welcoming messages on X.

OpenAI's Chief Research Officer, Mark Chen, posted: "Very excited to welcome Noam Shazeer to OpenAI as our Lead for Architecture Research. His work on Transformers, MoE, and efficient decoding has shaped modern AI."

Subsequently, numerous AI researchers also offered congratulations in the comments section, including Google DeepMind researcher and Chinese member of the Gemini Thinking and Coding team, Yuchen Zhuang; OpenAI researcher and core contributor to the o-series reasoning models, Noam Brown; and former Microsoft AI VP, now OpenAI researcher, Sebastien Bubeck, among others.

Google DeepMind Chinese researcher Yuchen Zhuang

OpenAI o-series reasoning model core contributor Noam Brown

OpenAI researcher Sebastien Bubeck

Meanwhile, netizens lamented: "Losing a Transformer author and Gemini co-head is undoubtedly a heavy blow for Google."

01. Renowned Transformer Author, Worked at Google for Nearly 18 Years

From the perspective of the history of generative AI, Noam Shazeer has participated in nearly every key milestone. He joined Google in 2000, serving as a Software Engineer and later Principal Software Engineer, accumulating over 18 years of work experience.

Noam Shazeer's Work Experience and Education Background (Source: LinkedIn)

In 2017, he, along with Ashish Vaswani, Jakob Uszkoreit, and six other Google researchers, co-authored the seminal paper 'Attention Is All You Need', proposing the Transformer architecture. Compared to the then-dominant RNN and LSTM models, the Transformer could process long text more efficiently and possessed greater scalability.

Over the past few years, models such as OpenAI's GPT series, Google's Gemini, Anthropic's Claude, as well as DeepSeek and Llama, are almost all built upon the Transformer architecture at their core. In other words, today's global wave of large models largely rests on the technical foundation laid by this paper.

However, the Transformer is just one of Noam Shazeer's many notable contributions.

During his tenure at Google, he also contributed to the advancement of numerous influential large model technologies. In 2017, as the first author, he proposed the Sparse Gated Mixture of Experts (MoE) architecture, providing crucial technical ideas for later models like GPT-4, Gemini, and DeepSeek-V3; in 2018, he participated in developing Mesh TensorFlow, providing foundational tools for super-large-scale Transformer training; subsequently, he was also involved in key project R&D such as the T5 model and Google's dialogue model LaMDA.

02. Left Google to Start a Company, Then Bought Back for $18.2B

In 2021, Shazeer left Google and co-founded Character.AI with Daniel De Freitas.

At that time, large language models had not yet experienced their "ChatGPT moment," but Character.AI was already pioneering efforts to bring chatbot products to the mass market, rapidly accumulating a large user base.

In 2024, Google reincorporated Shazeer and his core team into the DeepMind ecosystem through a cooperation deal worth approximately $2.7 billion (about RMB 18.266 billion). He subsequently became one of the key leaders of the Gemini project, participating in the pre-training of the next-generation Gemini models.

Shazeer's return coincided with a period of immense pressure on Google's AI business. ChatGPT had exploded in popularity, and Gemini was still in a catch-up phase. After returning to DeepMind, Shazeer participated in model R&D and eventually assumed the role of Gemini co-head, becoming one of the leading figures in Google's AI technology.

Following this, the Gemini 3 series models ranked at the forefront in multiple benchmark tests such as coding and reasoning, becoming a significant asset in Google's competition with OpenAI and Anthropic.

From Google researcher, to entrepreneur, to Gemini co-head, Shazeer has witnessed nearly every major turning point in Google's AI development over the past decade. Therefore, his departure from Google to join OpenAI is regarded by many industry insiders as one of the most significant talent loss events for Google in recent years.

03. OpenAI's Talent Battle Continues to Escalate

Shazeer's joining unfolds against the backdrop of increasingly fierce talent competition in the AI industry.

Over the past year, competition between OpenAI and Anthropic has continued to intensify. The two are not only competing over model capabilities but are also constantly vying for top researchers and core engineers.

The UK's Financial Times reported that OpenAI internally views Shazeer's addition as a major reinforcement. In the future, he will focus on researching new architectural directions beyond the Transformer, and on further enhancing model capabilities.

It's worth noting that the Transformer has dominated the AI field for nearly a decade. With the development of reasoning models, multi-agent systems, and world models, more and more researchers are beginning to ponder: Will the Transformer undergo its next major architectural upgrade?

And Shazeer is precisely one of the most qualified individuals to answer that question.

For Google, this means losing a Transformer author, a Gemini co-head, and one of its most seasoned AI architecture designers. For OpenAI, its laboratory welcomes someone who has personally shaped the modern AI technology stack.

04. Conclusion: A Significant Talent Move in the Transformer Era

As technology gradually approaches the frontier, top researchers themselves have become one of the scarcest resources. The trajectory of Noam Shazeer's career almost connects four key nodes: Transformer, Character.AI, Gemini, and OpenAI.

Now, this Transformer author's departure from Google and joining of OpenAI is not only a talent movement but also reflects the increasingly fierce competitive landscape among current AI giants.

Especially at a time when OpenAI and Anthropic are engaged in a white-hot competition over next-generation models, the enterprise market, and top research talent, and Google is also catching up through Gemini. Losing a Transformer founder and Gemini co-head at this juncture is undoubtedly a significant loss for Google.

For the entire industry, a subsequent question worth following is: When a Transformer author personally researches "architectures beyond the Transformer," where will the next generation of AI models head.

This article is from the WeChat public account "Zhixidongxi" (ID: zhidxcom), author: Jiang Yu, editor: Li Shuiqing

相關問答

QWho is Noam Shazeer and what is his significance in the AI field?

ANoam Shazeer is a renowned AI researcher, a core author of the seminal 2017 paper 'Attention Is All You Need' which introduced the Transformer architecture, and the former co-lead of Google's Gemini project. His work laid the technical foundation for modern large language models like GPT, Gemini, and Claude.

QWhat new role did Noam Shazeer take at OpenAI and what will he be responsible for?

ANoam Shazeer joined OpenAI as the Lead for Architecture Research. He will be responsible for exploring next-generation AI model architectures and driving the further evolution of the Transformer architecture.

QWhat was the nature of the significant deal between Google and Character.AI involving Shazeer in 2024?

AIn 2024, Google reached a technology licensing deal with Character.AI valued at approximately $2.7 billion (RMB 18.266 billion). This deal brought Noam Shazeer and part of the core team back into Google DeepMind, where he was appointed co-lead of the Gemini project.

QWhy is Shazeer's move to OpenAI considered a significant loss for Google?

AShazeer's departure is considered a significant loss for Google because he is a Transformer author, a former Gemini co-lead, and one of its most senior AI architecture designers. Losing such a key figure in the midst of intense competition with OpenAI and Anthropic is a substantial setback.

QWhat broader industry trend does Shazeer's career move highlight according to the article?

AShazeer's move highlights the increasingly fierce competition for top-tier AI research talent among major tech companies like OpenAI, Google, and Anthropic. As technology pushes the frontier, these researchers themselves become among the most scarce and valuable resources.

你可能也喜歡

EF史诗级重组:裁员20%、预算砍半,以太坊要轻装上阵?

以太坊基金会(EF)近日宣布进行大规模组织架构重组,将内部划分为协议层、接入层、用户层、社区层和机构层等多个职能集群。同时,基金会裁员约20%(54名员工),并计划在未来几年削减约40%的预算,年度支出率将从约15%逐步降至2030年后的约5%,向捐赠基金驱动的运营模式转型。 此次改革被视为EF对长期以来角色定位模糊、面临社区诸多质疑(如执行力不足、战略不清晰、持续出售ETH影响市场信心等)的系统性回应。EF明确将工作重心回归到协议研发、公共物品支持和生态协调等核心职能,主动收缩边界,将更多具体的生态建设工作交由市场与独立团队承担。 改革伴随着一些具体调整,例如PSE(隐私与扩容探索)团队将逐步退出,Devcon将转向更小规模模式,对以太坊外大型项目的投入也会减少。与此同时,以太坊生态内正在涌现新的独立组织(如由前EF研究员创立的Ethlabs),以及上市财库公司等力量,共同填补EF收缩后留下的空间,推动生态向更去中心化、协作式的结构演进。 Solana联合创始人toly对此评论表示看好,认为更精简的EF将能更专注、更快速地行动。此次重组标志着EF承认其能力有限,不再试图包办一切,而是推动以太坊生态进入一个由多方力量共同驱动发展的新阶段。

Odaily星球日报20 分鐘前

EF史诗级重组:裁员20%、预算砍半,以太坊要轻装上阵?

Odaily星球日报20 分鐘前

Dragonfly 合伙人 Haseeb: 未来增长最快的公司或都将卡在 149 人

Dragonfly合伙人Haseeb在文章中提出,未来增长最快的公司可能都会刻意将员工规模控制在149人以下,其根本原因在于当前大型AI模型公司(如Anthropic)的企业定价策略。 文章指出,AI公司为小规模团队(通常指少于150人)提供类似“团队订阅”的打包定价模式,用户每月支付固定费用即可获得大量代币额度。在这种模式下,额外使用代币的边际成本为零,极大地鼓励了初创公司和小团队进行大胆的AI实验和应用,最大化利用资源,几乎是在享受一种“创新补贴”。 然而,一旦公司规模超过150人,就必须切换到“企业版”定价。该模式按实际代币使用量收费,且据分析其毛利率高达75%左右。这意味着大型企业每多使用一个代币,都需要支付高昂的附加费用,形成了一种实质上的“AI劳动力税”。 这种定价差异造成了两种截然不同的激励机制: 1. **对小公司(<150人)**:零边际成本激励它们疯狂探索AI自动化,用尽订阅额度,力求以最少的人力创造最大产出。 2. **对大公司(>150人)**:高昂的边际成本抑制了其进行探索性、实验性的AI应用,只愿意将AI用于最成熟、批量化的任务,从而在边际上倾向于保留更多人类员工。 Haseeb将此比作一种由企业制定的“税收政策”,其核心断点就在150人。这可能导致一个奇特的经济现象:最具颠覆性的AI原生公司会想尽办法(如广泛使用AI智能体、频繁外包、严格控编)将正式员工数维持在149人以下,以保持成本优势。而大公司的劳动力替代可能不会以内部“AI裁员”的形式直接出现,而是因其在效率上输给这些灵活的小型竞争对手,导致市场收缩和间接裁员。 最终,文章认为,这种非官方的“定价税”可能在未来十年成为塑造企业形态和市场竞争格局的关键无形之手,促使一批高增长公司集体卡在149人的规模上。

marsbit31 分鐘前

Dragonfly 合伙人 Haseeb: 未来增长最快的公司或都将卡在 149 人

marsbit31 分鐘前

Dragonfly 合伙人 Haseeb: 为何未来增长最快的公司,或都将卡在 149 人

Dragonfly 合伙人 Haseeb 近日撰文分析了以 Anthropic 为代表的大模型公司定价策略可能引发的深远影响。文章指出,这类公司对小型企业(团队规模150人以下)采用类似健身房会员的“团队订阅”模式,用户只需支付固定月费即可使用大量代币,边际成本几乎为零;但对150人以上的大型企业,则强制切换到“企业版”,需按代币用量支付高昂的 API 费用,其毛利率可能高达75%。 Haseeb 将这种定价差异类比为一种“税收政策”:对初创公司补贴创新(边际税率为0),而对大企业则征收高额“AI劳动力税”。这导致了截然不同的激励效果: * **初创公司**:有极强动机最大化利用订阅额度(tokenmaxxing),疯狂探索和自动化,力求成为让大模型公司在订阅上“亏损”最多的用户。 * **大型企业**:由于每个额外代token都成本不菲,它们只会自动化最明显、批量大的任务,而抑制了边际性、实验性的自动化探索,倾向于保留更多人类岗位。 这种结构意味着,AI对劳动力的替代可能不会以“大公司直接用AI裁员”的形式大规模出现,而是表现为:**获得补贴的、高度自动化的AI原生初创公司,在市场竞争中击败背负高额“AI税”的大企业。** 大企业的裁员可能源于业务衰退(并美其名为“AI增效”),而获胜的初创公司永远不会重建那些旧岗位。 文章进一步指出,150人的定价“断点”可能像法国50人劳工法规一样,成为一个关键的“监管断点”(Notch),**强烈激励公司把规模控制在149人以内**,以维持低廉的AI使用成本。这或将催生一种全新的“AI优先”管理哲学:公司极度痴迷于用智能体替代人力,保持极小团队规模,以实现效率最大化。 Haseeb 总结道,代币定价策略虽非有意设计,但其影响堪比税收政策,可能在未来十年深刻塑造经济形态,决定哪些公司能够崛起以及它们如何组织自身。未来增长最快的公司,或许都将“卡在149人”的规模上。

链捕手40 分鐘前

Dragonfly 合伙人 Haseeb: 为何未来增长最快的公司,或都将卡在 149 人

链捕手40 分鐘前

交易

現貨
合約
活动图片