AGI Countdown: OpenAI's Chief Research Officer Makes Major Statement — The Window for Humanity is 'Very Small'

marsbit发布于2026-06-30更新于2026-06-30

文章摘要

The countdown to AGI has begun, according to OpenAI's Chief Scientist Mark Chen, who states the window for human-centric progress is "very small." Chen argues that AI is reaching a point where models can perform "self-sustaining research," autonomously driving innovation in fields from mathematics to programming. He points to the proliferation of AI's "superhuman" insights—akin to AlphaGo's legendary "Move 37"—across disciplines as evidence of this shift. Chen firmly dismisses claims that scaling laws are plateauing or that pre-training is dead, asserting the field remains on an exponential curve. He cites OpenAI's successful bet on reasoning models like o1 as proof that fundamental breakthroughs are still possible. The future of research, he suggests, lies with "Vibe Researchers"—humans who provide high-level direction and "taste" while AI handles execution and orchestration of complex, long-horizon tasks. However, significant hurdles remain. Chen highlights a "benchmarking crisis," where models can overfit to existing tests without gaining true generalization. He also notes the "jagged frontier" of AI capabilities, where systems excel at advanced reasoning but struggle with contextual, continual learning from everyday experiences. Despite these challenges, he expresses confidence that these gaps will be closed. In a personal reflection, Chen shares that post-AGI, his wish is to open a noodle shop—a metaphor emphasizing that when AI masters knowledge and innovation, uniqu...

Artificial General Intelligence (AGI) is coming soon.

Just now, OpenAI's Chief Research Officer, Mark Chen, declared boldly:

In a sense, it's like I hope you can feel that AGI (Artificial General Intelligence) is coming...

We are getting closer and closer to a world where models can autonomously propose more innovations—they can conduct self-sustaining research.

This is not just an improvement in efficiency; the very process of 'evolution' is being outsourced to silicon-based life.

As Mark Chen skillfully chopped mushrooms and onions in front of the camera, he was talking not just about a bowl of soup, but about the last bastion of human civilization.

If AI can research itself, then on the eve of AGI's arrival, what role exactly should humanity play?

Every Field is Experiencing Its Own 'Move 37'

To understand the weight of this statement, we must go back to the moment Mark entered this field.

2016. AlphaGo vs. Lee Sedol.

In the second game, there was a move—'Move 37'—that the entire world of human players collectively failed to comprehend the moment it was made.

It was only later understood that it was a move conceived by the machine, one that humans could never have imagined. That moment ignited countless people and pulled Mark Chen into this field.

And now?

"The craziest thing," Mark says, "is that you can now see a 'Move 37' in almost every field."

In mathematics. In computer science. In programming.

He describes a very subtle tipping point: many people, around the beginning of this year, "woke up one day" and suddenly realized: AI agents in my line of work, they can actually do real work.

Not toys. Not demos. They can complete meaningful, long-cycle, real-world work (long-horizon work) for you.

This means the idea of "models doing research on their own" is no longer a trope from science fiction.

It's the next step, naturally extrapolated from a series of already-occurring 'Move 37s'.

Look down this line, and standing at the end is that model that will conduct its own research.

Scaling Continues, Pretraining is Not Dead

But what underpins such optimism?

A belief: the scaling curve has not yet ended.

In recent years, claims like "pretraining is dead" or "language models won't lead to AGI" surface every so often.

Mark Chen "vehemently disagrees" with these pessimistic views.

He points out the pattern.

"Pretraining is dead" sounds fresh, but it's actually an old, worn-out script that has been replayed repeatedly over the years.

Each time, someone points at a bottleneck and says, "It's peaked, it can't go further"; each time, OpenAI somehow manages to pull out a new engineering trick, or a new research insight, to break through that wall.

Mark Chen firmly believes, "We are on an exponential curve. It has already sustained through nearly 10 orders of magnitude. There's no reason it shouldn't continue to sustain."

And the most convincing evidence is that OpenAI itself has bet and won once.

The bet was on reasoning.

When o1 was first proposed, even within OpenAI, some didn't believe in it.

The paradigm of "pretraining + post-training" was so dominant at the time that people would naturally ask: The machine is working fine as is, why bother with something else?

It was through the strong push of a few people with conviction and judgment, like Jakub Pachocki and Ilya Sutskever, that it slowly became a fundamental bet for the entire company.

A year later, o1 emerged, and the reasoning paradigm ignited the entire industry.

The curve hasn't ended, coupled with the fact that the biggest breakthroughs often come from bets that nobody initially believed in. These two points together are the foundation of Mark Chen's confidence in saying "models conducting self-sustaining research is not far off."

When a model starts thinking on tasks that span weeks or even months, the innovations it generates may already be beyond the cognitive blind spots of human experts.

This is precisely the foundation of "self-sustaining research": if it can derive mathematical formulas humans have never seen, it can certainly write algorithm architectures superior to human designs.

Vibe Researcher: When Execution Becomes Cheap

We already have vibe coders—speak, and the AI writes the code.

Research is also sliding in this direction.

During the interview, a highly controversial concept was repeatedly mentioned: Vibe Researcher.

This is a somewhat self-deprecating yet deeply considered career prediction.

Mark believes that the top researchers of the future will no longer be the ones writing every line of PyTorch code, but rather those who "get the vibe right."

Whether at OpenAI or other labs, you're beginning to see that a massive amount of work is shifting towards being primarily about 'orchestration'.

In plain language: humans are responsible for the ideas, and models are responsible for doing all the work.

The researcher uses their brain to conceive ideas; the rest—implementation, execution, scheduling—the model handles itself.

OpenAI's three-year roadmap clearly states the endpoint: enabling models to conduct end-to-end research, from idea generation to producing results, entirely on their own.

But This Road is Full of Unfilled Potholes

As AI becomes capable of autonomously executing and orchestrating tasks, human work will be compressed to the extreme ends:

1. Proposing the truly important questions.

2. Judging whether the answers AI provides have 'soul'.

This is the so-called 'Taste'.

Because machines don't have 'life', they lack 'common sense', and thus cannot develop 'taste'.

But stepping back, Mark Chen knows better than anyone that this road is far from smooth.

The first pothole: Evaluation is broken.

He used an internal term: 'Benchmaxxing'—finding a bunch of problems that look almost identical to the test set, training on them relentlessly, achieving impressive scores but gaining zero improvement in generalization ability.

What's worse, there are too few widely accepted gold-standard benchmarks.

"We are truly in an evaluation crisis," he says. Classic tests like the SAT are all saturated for today's models.

In fact, once a benchmark is released to the world, it's no longer a good benchmark—like an exam paper that becomes invalid the moment it's printed.

Two strategies to address this issue:

1. Separate the evaluation creation team from the model optimization team, creating an adversarial incentive structure.

2. Deploy models at scale and observe failure modes in real-world applications.

He also pointed out that the emergence of every new capability brings with it a corresponding need for evaluation, and steering the direction of evaluation is a significant part of his work.

The second pothole: The jagged frontier.

A model can solve Olympiad-level problems in math or informatics but might fail at trivial tasks humans do without thinking—a genius that can mentally calculate calculus but can't tie its own shoelaces.

Where's the gap? It's in 'context', in continual learning—applying the lessons learned from one task to the next.

This is so natural for humans, but for models, it's a hard nut the entire industry is trying to crack.

When asked if reaching AGI still requires two or three fundamental breakthroughs, Mark didn't take the bait.

He said that abilities like continual learning are "essential foundational capabilities that must be unlocked." As for whether that counts as a 'breakthrough', he wasn't sure, but "many shots are already aimed at the goal, and I'm pretty sure they'll go in."

That's his attitude: the potholes are real, but work is already underway to fill every single one, and he's betting they can be filled.

The Soup Metaphor: Opening a Noodle Shop After AGI

The most heartwarming moment in the interview was the story about 'soup'.

It is said that Mark Zuckerberg once tried to poach OpenAI researchers with homemade soup, and Mark Chen's response was to bring the soup directly to the office and share it with everyone.

When asked about his ultimate wish after AGI is achieved, this man in charge of the world's most powerful AI brains answered:

"I want to open a noodle shop. That might be my post-AGI hobby."

There is profound meaning hidden in this answer.

When AI can perform all "self-sustaining research", when all knowledge and innovation can be generated at the speed of light, the most scarce resource for humanity will no longer be intelligence, but 'experience'.

A machine can calculate the optimal saltiness for a bowl of soup, but it can never imbue that soup with 'warmth' and 'story'.

References:

https://www.youtube.com/watch?v=fpAthTtha8c

https://finance.biggo.com/podcast/1241bc21164ccc75

This article is from WeChat public account "Xin Zhi Yuan", author: ASI Revelation.

热门币种推荐

相关问答

QAccording to Mark Chen, what is the most 'insane' development currently happening across various fields?

AMark Chen states that the most 'insane' thing is that you can now see 'god's one move' or a pivotal, game-changing insight happening in almost every field, including mathematics, computer science, and programming.

QWhat does Mark Chen believe is the foundation for his confidence that 'self-sustaining research by models' is not far away?

AHis confidence is based on two key beliefs: 1) The scaling curve is not at its end yet and has sustained nearly 10 orders of magnitude. 2) Major breakthroughs, like the o1 project which focused on reasoning, often come from bold bets that few initially believe in.

QWhat is a 'Vibe Researcher' as described in the article, and what is their proposed future role?

AA 'Vibe Researcher' is a predicted future role where the top researcher is not the one who writes the code, but the one who 'gets the vibe' or has the 'taste' to pose truly important questions and judge the 'soul' of the answers AI provides. Their role shifts to high-level idea generation and orchestration, while AI handles implementation.

QWhat are the two major technical 'pits' or challenges highlighted by Mark Chen on the road to AGI?

AThe two major challenges are: 1) The evaluation crisis ('Benchmaxxing'), where current benchmarks are easily gamed and become obsolete once public, making true model capability hard to measure. 2) The 'jagged frontier', where models excel at complex, specialized tasks (like math Olympiads) but fail at trivial, context-dependent tasks that require continual learning and common sense.

QWhat does Mark Chen metaphorically suggest might be the most valuable human role after AGI is achieved, and what personal aspiration does he link to this idea?

AHe suggests that after AGI, when AI handles all self-sustaining research and knowledge generation, the most valuable human resource will be 'experience'. Machines can calculate optimal outcomes, but they cannot create 'warmth' and 'story'. He personally aspires to open a noodle shop as a post-AGI pursuit, symbolizing this human-centric value of creating meaningful experiences.

你可能也喜欢

XRP活跃地址激增72%,衍生品市场杠杆率下降

XRP(瑞波币)的链上信号近日显得更为清晰。数据显示,在短短两周内,XRP的每日活跃地址数激增约72%,与此同时,衍生品市场的未平仓合约和投机性杠杆有所降温。这种活跃地址增长与杠杆减少的组合值得关注,因为它可能意味着市场价格走势对高杠杆头寸的依赖降低,而更多反映了真实的网络使用需求。 活跃地址数增加可能意味着更多用户与网络交互、链上交易增多,或是沉寂的钱包重新活跃。这对于XRP尤为重要,因为它常受法律、机构、支付等多重叙事影响,链上数据为此提供了更具体的衡量依据。不过,分析师也谨慎指出,地址数激增也可能包含钱包维护、交易所内部转账等非需求驱动的活动,不一定直接转化为持续的购买力。 关键点在于,此次地址增长伴随着杠杆下降,这与活动与杠杆同时飙升的情况不同。它表明市场可能去除了一些泡沫,让交易者能更专注于评估网络实际活跃度。然而,两周的数据增长虽具建设性,但并非决定性信号。市场仍需观察活跃地址的上升趋势能否持续,交易量是否会跟进,以及现货需求能否在不依赖高杠杆的情况下改善。 目前,市场结构向好但尚未定论。多头希望看到链上活动持续,空头则认为除非地址增长能转化为更强的价格走势和流动性,否则意义有限。接下来的市场表现对XRP的短期走向至关重要。

bitcoinist37分钟前

XRP活跃地址激增72%,衍生品市场杠杆率下降

bitcoinist37分钟前

Solana网络活动激增如何推动SOL突破82美元关口

Solana链上活动近期急剧加速,网络参与度达到数月来的最强水平。每日活跃钱包数量攀升至451万的历史新高,且这一峰值持续时间自2月以来最为持久。 活动增长主要归因于代币化股权的快速发展、xStocks活动激增以及DeFi活动的复苏。随着用户回归,SOL收复了重要的技术点位。这表明Solana网络正通过实际使用增长来支撑价格,而非仅靠价格上涨。然而,网络的持续采用将取决于涨势消退后新用户是否会继续使用该平台。 网络活动的复兴日益得到现实世界金融应用扩展的支持,而不仅仅是投机交易。Solana上的代币化股权用户正在增加。稳定币供应量保持高位,跨链桥净流入、总锁定价值(TVL)和去中心化交易所(DEX)交易量的持续增长表明,流入Solana生态的资金正在留存,而非快速流出。 SOL在6月29日上涨7.48%,从69.74美元升至时段高点76.49美元,随后回落至73美元附近。此次反弹使Solana有望在连续九个月收跌后首次录得月度阳线,买家信心有所改善。但78-82美元的阻力区仍是市场面临的最大考验,此前已数次阻止上涨。 若能突破该区间,可能为涨向92美元打开道路。然而,多头必须守住72美元支撑位,以维护正在形成的“高点更高、低点更高”结构。若再次遇阻回落,则可能表明更广泛的复苏仍然缺乏持久信念。

ambcrypto56分钟前

Solana网络活动激增如何推动SOL突破82美元关口

ambcrypto56分钟前

交易

现货

热门文章

如何购买S

欢迎来到HTX.com!我们已经让购买Sonic(S)变得简单而便捷。跟随我们的逐步指南,放心开始您的加密货币之旅。第一步:创建您的HTX账户使用您的电子邮件、手机号码注册一个免费账户在HTX上。体验无忧的注册过程并解锁所有平台功能。立即注册第二步:前往买币页面,选择您的支付方式信用卡/借记卡购买:使用您的Visa或Mastercard即时购买Sonic(S)。余额购买:使用您HTX账户余额中的资金进行无缝交易。第三方购买:探索诸如Google Pay或Apple Pay等流行支付方法以增加便利性。C2C购买:在HTX平台上直接与其他用户交易。HTX场外交易台(OTC)购买:为大量交易者提供个性化服务和竞争性汇率。第三步:存储您的Sonic(S)购买完您的Sonic(S)后,将其存储在您的HTX账户钱包中。您也可以通过区块链转账将其发送到其他地方或者用于交易其他加密货币。第四步:交易Sonic(S)在HTX的现货市场轻松交易Sonic(S)。访问您的账户,选择您的交易对,执行您的交易,并实时监控。HTX为初学者和经验丰富的交易者提供了友好的用户体验。

2.7k人学过发布于 2025.01.15更新于 2026.06.02

如何购买S

相关讨论

欢迎来到HTX社区。在这里,您可以了解最新的平台发展动态并获得专业的市场意见。以下是用户对S(S)币价的意见。

活动图片