Breaking: Google Gemini Co-Head Poached by OpenAI

marsbit发布于2026-06-18更新于2026-06-18

文章摘要

In a significant industry move, Noam Shazeer, a former DeepMind researcher and co-lead of Google's Gemini project, has officially joined OpenAI as the Lead for Architecture Research. Shazeer is a legendary figure in AI, best known as a core author of the seminal 2017 paper "Attention Is All You Need," which introduced the Transformer architecture foundational to modern models like GPT and Gemini. Shazeer announced his move on social media, expressing excitement to join OpenAI's team while also thanking his former colleagues at Google. His career spans over 18 years at Google, where he contributed to key technologies including the Mixture of Experts (MoE) architecture, Mesh TensorFlow, and the LaMDA model. He left Google in 2021 to co-found the conversational AI company Character.AI, which achieved a multi-billion dollar valuation. In 2024, he returned to Google's DeepMind via a major technology licensing deal to help lead the Gemini project. His departure is viewed as a substantial loss for Google and a major talent acquisition for OpenAI, which is intensifying its competition with rivals like Anthropic. At OpenAI, Shazeer will focus on exploring next-generation AI model architectures and the evolution beyond the current Transformer paradigm. The move underscores the fierce competition for top AI research talent as companies race to develop advanced AI systems.

June 19, ZHIDONGSHE News, Today, Google DeepMind researcher and Gemini co-head Noam Shazeer has officially joined OpenAI, where he will serve as the Lead for Architecture Research. He will be responsible for exploring next-generation AI model architectures and driving the further evolution of the Transformer architecture.

Shazeer himself announced his move to OpenAI in a post on the overseas social platform X, stating: "I'm thrilled to join OpenAI and look forward to working with the exceptional team here."

Simultaneously, he expressed gratitude to the Google team: "Leaving was a difficult decision. I'm immensely proud of the Google team and everything we've accomplished together. It has been a privilege to work with you all." A Google spokesperson, in a response to Reuters, stated that the company is grateful for Shazeer's significant contributions over the years and wishes him all the best in his future endeavors.

Noam Shazeer is one of the legendary figures in the AI field. He is one of the core authors of the seminal 2017 paper 'Attention Is All You Need', which first proposed the Transformer architecture, directly establishing the technical foundation for modern large models like the GPT series, Gemini, and Claude.

'Attention Is All You Need'

Before joining OpenAI, Shazeer had left Google in 2021 to found Character.AI. In fact, Shazeer is best known as the Co-founder and CEO of Character.AI.

This company bet on the "AI companionship"赛道 even earlier than the ChatGPT explosion, allowing users to engage in long-term conversations with various AI characters. It once became one of the fastest-growing consumer AI applications globally. By 2023, Character.AI's valuation had exceeded $1 billion (approximately RMB 6.765 billion).

In 2024, Google reached a technology licensing agreement with Character.AI valued at approximately $2.7 billion (about RMB 18.266 billion), bringing Noam Shazeer and part of the core team back into DeepMind. Noam Shazeer was appointed as a Gemini co-head, involved in the pre-training R&D of the next-generation Gemini models.

For OpenAI, which is fiercely competing with Anthropic, this is viewed externally as one of the most significant top-tier talent acquisitions in recent years. Following the announcement, OpenAI's senior leadership and several renowned researchers immediately left welcoming messages on X.

OpenAI's Chief Research Officer, Mark Chen, posted: "Very excited to welcome Noam Shazeer to OpenAI as our Lead for Architecture Research. His work on Transformers, MoE, and efficient decoding has shaped modern AI."

Subsequently, numerous AI researchers also offered congratulations in the comments section, including Google DeepMind researcher and Chinese member of the Gemini Thinking and Coding team, Yuchen Zhuang; OpenAI researcher and core contributor to the o-series reasoning models, Noam Brown; and former Microsoft AI VP, now OpenAI researcher, Sebastien Bubeck, among others.

Google DeepMind Chinese researcher Yuchen Zhuang

OpenAI o-series reasoning model core contributor Noam Brown

OpenAI researcher Sebastien Bubeck

Meanwhile, netizens lamented: "Losing a Transformer author and Gemini co-head is undoubtedly a heavy blow for Google."

01. Renowned Transformer Author, Worked at Google for Nearly 18 Years

From the perspective of the history of generative AI, Noam Shazeer has participated in nearly every key milestone. He joined Google in 2000, serving as a Software Engineer and later Principal Software Engineer, accumulating over 18 years of work experience.

Noam Shazeer's Work Experience and Education Background (Source: LinkedIn)

In 2017, he, along with Ashish Vaswani, Jakob Uszkoreit, and six other Google researchers, co-authored the seminal paper 'Attention Is All You Need', proposing the Transformer architecture. Compared to the then-dominant RNN and LSTM models, the Transformer could process long text more efficiently and possessed greater scalability.

Over the past few years, models such as OpenAI's GPT series, Google's Gemini, Anthropic's Claude, as well as DeepSeek and Llama, are almost all built upon the Transformer architecture at their core. In other words, today's global wave of large models largely rests on the technical foundation laid by this paper.

However, the Transformer is just one of Noam Shazeer's many notable contributions.

During his tenure at Google, he also contributed to the advancement of numerous influential large model technologies. In 2017, as the first author, he proposed the Sparse Gated Mixture of Experts (MoE) architecture, providing crucial technical ideas for later models like GPT-4, Gemini, and DeepSeek-V3; in 2018, he participated in developing Mesh TensorFlow, providing foundational tools for super-large-scale Transformer training; subsequently, he was also involved in key project R&D such as the T5 model and Google's dialogue model LaMDA.

02. Left Google to Start a Company, Then Bought Back for $18.2B

In 2021, Shazeer left Google and co-founded Character.AI with Daniel De Freitas.

At that time, large language models had not yet experienced their "ChatGPT moment," but Character.AI was already pioneering efforts to bring chatbot products to the mass market, rapidly accumulating a large user base.

In 2024, Google reincorporated Shazeer and his core team into the DeepMind ecosystem through a cooperation deal worth approximately $2.7 billion (about RMB 18.266 billion). He subsequently became one of the key leaders of the Gemini project, participating in the pre-training of the next-generation Gemini models.

Shazeer's return coincided with a period of immense pressure on Google's AI business. ChatGPT had exploded in popularity, and Gemini was still in a catch-up phase. After returning to DeepMind, Shazeer participated in model R&D and eventually assumed the role of Gemini co-head, becoming one of the leading figures in Google's AI technology.

Following this, the Gemini 3 series models ranked at the forefront in multiple benchmark tests such as coding and reasoning, becoming a significant asset in Google's competition with OpenAI and Anthropic.

From Google researcher, to entrepreneur, to Gemini co-head, Shazeer has witnessed nearly every major turning point in Google's AI development over the past decade. Therefore, his departure from Google to join OpenAI is regarded by many industry insiders as one of the most significant talent loss events for Google in recent years.

03. OpenAI's Talent Battle Continues to Escalate

Shazeer's joining unfolds against the backdrop of increasingly fierce talent competition in the AI industry.

Over the past year, competition between OpenAI and Anthropic has continued to intensify. The two are not only competing over model capabilities but are also constantly vying for top researchers and core engineers.

The UK's Financial Times reported that OpenAI internally views Shazeer's addition as a major reinforcement. In the future, he will focus on researching new architectural directions beyond the Transformer, and on further enhancing model capabilities.

It's worth noting that the Transformer has dominated the AI field for nearly a decade. With the development of reasoning models, multi-agent systems, and world models, more and more researchers are beginning to ponder: Will the Transformer undergo its next major architectural upgrade?

And Shazeer is precisely one of the most qualified individuals to answer that question.

For Google, this means losing a Transformer author, a Gemini co-head, and one of its most seasoned AI architecture designers. For OpenAI, its laboratory welcomes someone who has personally shaped the modern AI technology stack.

04. Conclusion: A Significant Talent Move in the Transformer Era

As technology gradually approaches the frontier, top researchers themselves have become one of the scarcest resources. The trajectory of Noam Shazeer's career almost connects four key nodes: Transformer, Character.AI, Gemini, and OpenAI.

Now, this Transformer author's departure from Google and joining of OpenAI is not only a talent movement but also reflects the increasingly fierce competitive landscape among current AI giants.

Especially at a time when OpenAI and Anthropic are engaged in a white-hot competition over next-generation models, the enterprise market, and top research talent, and Google is also catching up through Gemini. Losing a Transformer founder and Gemini co-head at this juncture is undoubtedly a significant loss for Google.

For the entire industry, a subsequent question worth following is: When a Transformer author personally researches "architectures beyond the Transformer," where will the next generation of AI models head.

This article is from the WeChat public account "Zhixidongxi" (ID: zhidxcom), author: Jiang Yu, editor: Li Shuiqing

相关问答

QWho is Noam Shazeer and what is his significance in the AI field?

ANoam Shazeer is a renowned AI researcher, a core author of the seminal 2017 paper 'Attention Is All You Need' which introduced the Transformer architecture, and the former co-lead of Google's Gemini project. His work laid the technical foundation for modern large language models like GPT, Gemini, and Claude.

QWhat new role did Noam Shazeer take at OpenAI and what will he be responsible for?

ANoam Shazeer joined OpenAI as the Lead for Architecture Research. He will be responsible for exploring next-generation AI model architectures and driving the further evolution of the Transformer architecture.

QWhat was the nature of the significant deal between Google and Character.AI involving Shazeer in 2024?

AIn 2024, Google reached a technology licensing deal with Character.AI valued at approximately $2.7 billion (RMB 18.266 billion). This deal brought Noam Shazeer and part of the core team back into Google DeepMind, where he was appointed co-lead of the Gemini project.

QWhy is Shazeer's move to OpenAI considered a significant loss for Google?

AShazeer's departure is considered a significant loss for Google because he is a Transformer author, a former Gemini co-lead, and one of its most senior AI architecture designers. Losing such a key figure in the midst of intense competition with OpenAI and Anthropic is a substantial setback.

QWhat broader industry trend does Shazeer's career move highlight according to the article?

AShazeer's move highlights the increasingly fierce competition for top-tier AI research talent among major tech companies like OpenAI, Google, and Anthropic. As technology pushes the frontier, these researchers themselves become among the most scarce and valuable resources.

你可能也喜欢

SemiAnalysis 万字拆解长鑫存储:500 亿美元营收,超级周期里的 IPO

长鑫存储即将在科创板上市,有望成为中国史上最大的半导体IPO。公司成立于2016年,通过收购破产的德国DRAM厂商奇梦达的专利与技术文档起步,并吸引了包括奇梦达、美光等公司的顶尖人才。在合肥市政府“耐心资本”近十年的持续输血与产业链支持下,长鑫于2025年首次实现年度盈利。 2026年,长鑫业绩迎来爆发式增长,一季度单季营收达73亿美元,预计全年营收可能超过500亿美元。这主要得益于当前DRAM行业的“超级周期”带来的价格上涨,而非市场份额的显著提升。按产能计,长鑫已是全球第四大DRAM厂商,并正在快速逼近美光。 然而,公司在高端HBM(高带宽内存)领域面临巨大挑战。其HBM技术仍不成熟,良率远低于行业巨头,且当前将稀缺产能分配给利润率更高的大宗DRAM产品在商业上更为合理。但由于美国出口管制限制先进HBM对华销售,中国对AI算力自主可控的强烈需求可能推动长鑫未来不得不加速HBM产能布局。 此次IPO计划募资约295亿元人民币,主要用于现有DRAM产能升级与研发,并未明确提及HBM项目。其复杂的股权结构显示,国有资本合计持股超过30%,是公司的核心支持力量。分析认为,当前IPO估值可能被显著低估。随着阿里巴巴等国内核心客户兼股东的支持,长鑫在国内市场需求方面具备独特优势,但与三星、SK海力士和美光的竞争将愈发激烈。

marsbit3分钟前

SemiAnalysis 万字拆解长鑫存储:500 亿美元营收,超级周期里的 IPO

marsbit3分钟前

从Corning到Ciena,AI光通信链条里的10倍股机会

本文分析了AI数据中心背景下,光通信产业链的投资机会。随着数据传输从800G向1.6T、3.2T升级,铜缆面临物理极限,光通信因距离更远、发热更低、能耗更小成为必然选择。文章指出,最大的投资机会往往在于整个产业链中不可或缺的供应商,而非单一明星公司。 文章重点梳理了产业链关键环节的代表公司: 1. **光纤与玻璃层**:**康宁(Corning)** 作为核心供应商,凭借技术优势获得Meta、亚马逊等巨头长期大额订单,其利润增速远超收入增长,显示定价权与规模效应。 2. **互连层**:**安费诺(Amphenol)** 通过并购扩张,在AI服务器连接器市场增长强劲,利润率提升且估值相对合理。**Credo Technology** 则扮演铜缆与光通信的桥梁角色,增长迅速但客户集中度高,风险较大。 3. **系统层**:**Ciena** 是相干光学龙头,其技术能让现有光纤承载更多数据,订单积压强劲,但估值已较高。 4. **上游材料与测试**:**AXT** 提供光激光器关键材料磷化铟晶圆,具有稀缺性,但面临中国出口管制等高风险。**VEO Solutions** 作为测试设备“卖铲人”,业务不受具体技术路线影响,随着光设备需求爆发而增长。 最后,文章提及了专注于光子学的主题ETF(代码FOTO)作为一键配置选择,但提醒其成立时间短、规模尚小。总结认为,铜转光是确定趋势,投资机会将沿整条光子产业链扩散。

marsbit15分钟前

从Corning到Ciena,AI光通信链条里的10倍股机会

marsbit15分钟前

新主席、旧通胀、超预期就业:沃什首秀后,全球资产如何重新定价?

上周,美联储新任主席凯文·沃什主持了上任后首次货币政策会议。会议决定维持利率不变,但政策声明被大幅简化,删除了前瞻指引等措辞。沃什强调,美联储不应过早承诺未来行动,而应让市场重新聚焦经济数据本身。这标志着一套新沟通框架的开始。 沃什面临的首要挑战仍是通胀。4月PCE通胀数据仍远高于2%目标,且通胀压力来源多元。与此同时,5月就业数据远超预期,强劲的就业市场反而加剧了市场对货币政策收紧的担忧,导致股市下跌。沃什接手的是一个在政策方向上存在内部分歧、且面临政治压力的美联储,建立内部共识是其重要考验。 会议释放的鹰派信号影响了全球资产定价。美元因加息预期升温而走强,美债收益率面临上行压力但亦受经济前景影响,黄金在利率与地缘风险间拉锯。AI基建等成长股板块面临估值压力,但若云厂商资本开支未收缩,产业逻辑依然成立。防务板块则因订单确定性而具有一定防御性。 展望未来,市场需重点关注数据:7月初的6月非农数据可能决定7月会议基调;7月中旬的6月CPI数据直接影响通胀判断;7月底的第二次FOMC会议将是沃什做出实质性政策选择的关键节点。此外,下半年美国中期选举带来的政治压力,也将持续考验美联储的独立性。

marsbit40分钟前

新主席、旧通胀、超预期就业:沃什首秀后,全球资产如何重新定价?

marsbit40分钟前

交易

现货
合约
活动图片