AGI Countdown: OpenAI's Chief Research Officer Makes Major Statement — The Window for Humanity is 'Very Small'

marsbit发布于2026-06-30更新于2026-06-30

文章摘要

The countdown to AGI has begun, according to OpenAI's Chief Scientist Mark Chen, who states the window for human-centric progress is "very small." Chen argues that AI is reaching a point where models can perform "self-sustaining research," autonomously driving innovation in fields from mathematics to programming. He points to the proliferation of AI's "superhuman" insights—akin to AlphaGo's legendary "Move 37"—across disciplines as evidence of this shift. Chen firmly dismisses claims that scaling laws are plateauing or that pre-training is dead, asserting the field remains on an exponential curve. He cites OpenAI's successful bet on reasoning models like o1 as proof that fundamental breakthroughs are still possible. The future of research, he suggests, lies with "Vibe Researchers"—humans who provide high-level direction and "taste" while AI handles execution and orchestration of complex, long-horizon tasks. However, significant hurdles remain. Chen highlights a "benchmarking crisis," where models can overfit to existing tests without gaining true generalization. He also notes the "jagged frontier" of AI capabilities, where systems excel at advanced reasoning but struggle with contextual, continual learning from everyday experiences. Despite these challenges, he expresses confidence that these gaps will be closed. In a personal reflection, Chen shares that post-AGI, his wish is to open a noodle shop—a metaphor emphasizing that when AI masters knowledge and innovation, uniqu...

Artificial General Intelligence (AGI) is coming soon.

Just now, OpenAI's Chief Research Officer, Mark Chen, declared boldly:

In a sense, it's like I hope you can feel that AGI (Artificial General Intelligence) is coming...

We are getting closer and closer to a world where models can autonomously propose more innovations—they can conduct self-sustaining research.

This is not just an improvement in efficiency; the very process of 'evolution' is being outsourced to silicon-based life.

As Mark Chen skillfully chopped mushrooms and onions in front of the camera, he was talking not just about a bowl of soup, but about the last bastion of human civilization.

If AI can research itself, then on the eve of AGI's arrival, what role exactly should humanity play?

Every Field is Experiencing Its Own 'Move 37'

To understand the weight of this statement, we must go back to the moment Mark entered this field.

2016. AlphaGo vs. Lee Sedol.

In the second game, there was a move—'Move 37'—that the entire world of human players collectively failed to comprehend the moment it was made.

It was only later understood that it was a move conceived by the machine, one that humans could never have imagined. That moment ignited countless people and pulled Mark Chen into this field.

And now?

"The craziest thing," Mark says, "is that you can now see a 'Move 37' in almost every field."

In mathematics. In computer science. In programming.

He describes a very subtle tipping point: many people, around the beginning of this year, "woke up one day" and suddenly realized: AI agents in my line of work, they can actually do real work.

Not toys. Not demos. They can complete meaningful, long-cycle, real-world work (long-horizon work) for you.

This means the idea of "models doing research on their own" is no longer a trope from science fiction.

It's the next step, naturally extrapolated from a series of already-occurring 'Move 37s'.

Look down this line, and standing at the end is that model that will conduct its own research.

Scaling Continues, Pretraining is Not Dead

But what underpins such optimism?

A belief: the scaling curve has not yet ended.

In recent years, claims like "pretraining is dead" or "language models won't lead to AGI" surface every so often.

Mark Chen "vehemently disagrees" with these pessimistic views.

He points out the pattern.

"Pretraining is dead" sounds fresh, but it's actually an old, worn-out script that has been replayed repeatedly over the years.

Each time, someone points at a bottleneck and says, "It's peaked, it can't go further"; each time, OpenAI somehow manages to pull out a new engineering trick, or a new research insight, to break through that wall.

Mark Chen firmly believes, "We are on an exponential curve. It has already sustained through nearly 10 orders of magnitude. There's no reason it shouldn't continue to sustain."

And the most convincing evidence is that OpenAI itself has bet and won once.

The bet was on reasoning.

When o1 was first proposed, even within OpenAI, some didn't believe in it.

The paradigm of "pretraining + post-training" was so dominant at the time that people would naturally ask: The machine is working fine as is, why bother with something else?

It was through the strong push of a few people with conviction and judgment, like Jakub Pachocki and Ilya Sutskever, that it slowly became a fundamental bet for the entire company.

A year later, o1 emerged, and the reasoning paradigm ignited the entire industry.

The curve hasn't ended, coupled with the fact that the biggest breakthroughs often come from bets that nobody initially believed in. These two points together are the foundation of Mark Chen's confidence in saying "models conducting self-sustaining research is not far off."

When a model starts thinking on tasks that span weeks or even months, the innovations it generates may already be beyond the cognitive blind spots of human experts.

This is precisely the foundation of "self-sustaining research": if it can derive mathematical formulas humans have never seen, it can certainly write algorithm architectures superior to human designs.

Vibe Researcher: When Execution Becomes Cheap

We already have vibe coders—speak, and the AI writes the code.

Research is also sliding in this direction.

During the interview, a highly controversial concept was repeatedly mentioned: Vibe Researcher.

This is a somewhat self-deprecating yet deeply considered career prediction.

Mark believes that the top researchers of the future will no longer be the ones writing every line of PyTorch code, but rather those who "get the vibe right."

Whether at OpenAI or other labs, you're beginning to see that a massive amount of work is shifting towards being primarily about 'orchestration'.

In plain language: humans are responsible for the ideas, and models are responsible for doing all the work.

The researcher uses their brain to conceive ideas; the rest—implementation, execution, scheduling—the model handles itself.

OpenAI's three-year roadmap clearly states the endpoint: enabling models to conduct end-to-end research, from idea generation to producing results, entirely on their own.

But This Road is Full of Unfilled Potholes

As AI becomes capable of autonomously executing and orchestrating tasks, human work will be compressed to the extreme ends:

1. Proposing the truly important questions.

2. Judging whether the answers AI provides have 'soul'.

This is the so-called 'Taste'.

Because machines don't have 'life', they lack 'common sense', and thus cannot develop 'taste'.

But stepping back, Mark Chen knows better than anyone that this road is far from smooth.

The first pothole: Evaluation is broken.

He used an internal term: 'Benchmaxxing'—finding a bunch of problems that look almost identical to the test set, training on them relentlessly, achieving impressive scores but gaining zero improvement in generalization ability.

What's worse, there are too few widely accepted gold-standard benchmarks.

"We are truly in an evaluation crisis," he says. Classic tests like the SAT are all saturated for today's models.

In fact, once a benchmark is released to the world, it's no longer a good benchmark—like an exam paper that becomes invalid the moment it's printed.

Two strategies to address this issue:

1. Separate the evaluation creation team from the model optimization team, creating an adversarial incentive structure.

2. Deploy models at scale and observe failure modes in real-world applications.

He also pointed out that the emergence of every new capability brings with it a corresponding need for evaluation, and steering the direction of evaluation is a significant part of his work.

The second pothole: The jagged frontier.

A model can solve Olympiad-level problems in math or informatics but might fail at trivial tasks humans do without thinking—a genius that can mentally calculate calculus but can't tie its own shoelaces.

Where's the gap? It's in 'context', in continual learning—applying the lessons learned from one task to the next.

This is so natural for humans, but for models, it's a hard nut the entire industry is trying to crack.

When asked if reaching AGI still requires two or three fundamental breakthroughs, Mark didn't take the bait.

He said that abilities like continual learning are "essential foundational capabilities that must be unlocked." As for whether that counts as a 'breakthrough', he wasn't sure, but "many shots are already aimed at the goal, and I'm pretty sure they'll go in."

That's his attitude: the potholes are real, but work is already underway to fill every single one, and he's betting they can be filled.

The Soup Metaphor: Opening a Noodle Shop After AGI

The most heartwarming moment in the interview was the story about 'soup'.

It is said that Mark Zuckerberg once tried to poach OpenAI researchers with homemade soup, and Mark Chen's response was to bring the soup directly to the office and share it with everyone.

When asked about his ultimate wish after AGI is achieved, this man in charge of the world's most powerful AI brains answered:

"I want to open a noodle shop. That might be my post-AGI hobby."

There is profound meaning hidden in this answer.

When AI can perform all "self-sustaining research", when all knowledge and innovation can be generated at the speed of light, the most scarce resource for humanity will no longer be intelligence, but 'experience'.

A machine can calculate the optimal saltiness for a bowl of soup, but it can never imbue that soup with 'warmth' and 'story'.

References:

https://www.youtube.com/watch?v=fpAthTtha8c

https://finance.biggo.com/podcast/1241bc21164ccc75

This article is from WeChat public account "Xin Zhi Yuan", author: ASI Revelation.

你可能也喜欢

XRP活跃地址激增72%，衍生品市场杠杆率下降

XRP（瑞波币）的链上信号近日显得更为清晰。数据显示，在短短两周内，XRP的每日活跃地址数激增约72%，与此同时，衍生品市场的未平仓合约和投机性杠杆有所降温。这种活跃地址增长与杠杆减少的组合值得关注，因为它可能意味着市场价格走势对高杠杆头寸的依赖降低，而更多反映了真实的网络使用需求。活跃地址数增加可能意味着更多用户与网络交互、链上交易增多，或是沉寂的钱包重新活跃。这对于XRP尤为重要，因为它常受法律、机构、支付等多重叙事影响，链上数据为此提供了更具体的衡量依据。不过，分析师也谨慎指出，地址数激增也可能包含钱包维护、交易所内部转账等非需求驱动的活动，不一定直接转化为持续的购买力。关键点在于，此次地址增长伴随着杠杆下降，这与活动与杠杆同时飙升的情况不同。它表明市场可能去除了一些泡沫，让交易者能更专注于评估网络实际活跃度。然而，两周的数据增长虽具建设性，但并非决定性信号。市场仍需观察活跃地址的上升趋势能否持续，交易量是否会跟进，以及现货需求能否在不依赖高杠杆的情况下改善。目前，市场结构向好但尚未定论。多头希望看到链上活动持续，空头则认为除非地址增长能转化为更强的价格走势和流动性，否则意义有限。接下来的市场表现对XRP的短期走向至关重要。

bitcoinist37分钟前

bitcoinist37分钟前

Solana网络活动激增如何推动SOL突破82美元关口

Solana链上活动近期急剧加速，网络参与度达到数月来的最强水平。每日活跃钱包数量攀升至451万的历史新高，且这一峰值持续时间自2月以来最为持久。活动增长主要归因于代币化股权的快速发展、xStocks活动激增以及DeFi活动的复苏。随着用户回归，SOL收复了重要的技术点位。这表明Solana网络正通过实际使用增长来支撑价格，而非仅靠价格上涨。然而，网络的持续采用将取决于涨势消退后新用户是否会继续使用该平台。网络活动的复兴日益得到现实世界金融应用扩展的支持，而不仅仅是投机交易。Solana上的代币化股权用户正在增加。稳定币供应量保持高位，跨链桥净流入、总锁定价值（TVL）和去中心化交易所（DEX）交易量的持续增长表明，流入Solana生态的资金正在留存，而非快速流出。 SOL在6月29日上涨7.48%，从69.74美元升至时段高点76.49美元，随后回落至73美元附近。此次反弹使Solana有望在连续九个月收跌后首次录得月度阳线，买家信心有所改善。但78-82美元的阻力区仍是市场面临的最大考验，此前已数次阻止上涨。若能突破该区间，可能为涨向92美元打开道路。然而，多头必须守住72美元支撑位，以维护正在形成的“高点更高、低点更高”结构。若再次遇阻回落，则可能表明更广泛的复苏仍然缺乏持久信念。

ambcrypto56分钟前

ambcrypto56分钟前

SUI、ENA与EIGEN引领本周价值7300万美元的代币解锁潮

本周，SUI、ENA和EIGEN将引领一轮价值7300万美元的代币解锁浪潮。SUI（约1372万枚，价值约940万美元）和EIGEN（约3682万枚，价值约870万美元）的解锁定于2026年7月1日，而ENA（约312万美元）的解锁则在2026年7月3日。文章强调，解锁事件并非直接的抛售信号，做市商通常会提前对冲这些供应事件。对于交易者而言，此类事件会影响短期头寸、对冲策略和流动性分布，但其影响应置于更广泛的市场背景中评估，包括ETF资金流、杠杆情况和山寨币流动性轮动。当前关键在于观察数据是否持续印证这一趋势，以判断这是短期波动还是结构性转变。报告提醒避免将单一数据点过度解读为市场全貌，而应结合整体流动性、宏观条件和衍生品状况进行综合分析。

bitcoinist1小时前

bitcoinist1小时前

Kaspa价格预测：利好消息公布后卖出反应抹去近期涨幅

Kaspa（KAS）网络的Toccata硬分叉已于6月30日成功完成，此次升级引入了智能合约功能并支持KRC-20代币。受此事件预期推动，KAS价格在硬分叉前一度从6月25日的低点0.0266美元上涨20.38%，至撰稿时的0.032美元。然而，硬分叉完成后10小时内，价格迅速回落7.8%，呈现典型的“利好出尽是利空”行情。从技术分析看，KAS长期趋势依然看跌。日线图显示，自2026年初从0.0532美元跌至0.0249美元后，价格虽两度尝试挑战0.04美元阻力区均告失败，近期更在市场普跌中逼近0.025美元前低。MACD指标位于零轴下方，资金流量指标（CMF）为-0.17，表明市场动能疲软且资金持续外流。短期来看，价格在硬分叉前短暂突破0.03美元局部阻力位，但随后快速抛售证实了这仅是一次卖空机会。清算热图也显示，在0.03美元上方积聚了大量空头清算关卡，价格在清扫该区域后已开始反转。综合来看，文章预测KAS在接下来几周有可能进一步下探0.0249美元，甚至跌向0.020美元水平。长期下行趋势未改，交易者需为此做好准备。

ambcrypto1小时前

ambcrypto1小时前

Upbit上线Gensyn，AI加密叙事在韩国市场获得新推力

韩国加密货币交易所Upbit宣布上线去中心化AI GPU计算项目Gensyn的代币GEN，为其开通韩元、比特币和USDT交易对。此举显著提升了Gensyn的知名度，并使其接入全球最活跃的零售市场之一——韩国市场。韩国韩元交易对的增加通常能为代币带来大量新流动性和交易者关注。当前，人工智能（AI）相关加密资产持续吸引资金，而Gensyn所代表的去中心化计算叙事契合这一热点。其核心理念是利用区块链网络协调开放市场，为AI模型提供庞大的计算资源。然而，文章也指出交易所上线带来的“上市溢价”具有双刃剑效应。短期内可能因新流动性和零售关注度激增而引发价格剧烈波动，但随后市场将更关注项目的实质：真实的计算活动、用户采用、代币激励的可持续性以及AI叙事能否持续吸引资金。上线虽能扩大项目受众和流动性，但长期价值最终取决于项目能否将关注转化为实际使用。简而言之，Upbit的上线为GEN和AI加密赛道提供了新的催化剂，使其更难以被山寨币交易者忽视，但项目的成功关键在于其实际执行与发展，而非仅仅依赖市场热度。

bitcoinist2小时前

bitcoinist2小时前

交易

现货

AGI Countdown: OpenAI's Chief Research Officer Makes Major Statement — The Window for Humanity is 'Very Small'

文章摘要

Every Field is Experiencing Its Own 'Move 37'

Scaling Continues, Pretraining is Not Dead

Vibe Researcher: When Execution Becomes Cheap

But This Road is Full of Unfilled Potholes

The Soup Metaphor: Opening a Noodle Shop After AGI

热门币种推荐

相关问答

你可能也喜欢

XRP活跃地址激增72%，衍生品市场杠杆率下降

Solana网络活动激增如何推动SOL突破82美元关口

SUI、ENA与EIGEN引领本周价值7300万美元的代币解锁潮

Kaspa价格预测：利好消息公布后卖出反应抹去近期涨幅

Upbit上线Gensyn，AI加密叙事在韩国市场获得新推力

交易

热门文章

如何购买S

Sonic：Andre Cronje主导升级，逆势上涨的Layer1新星

成长学院：学习“ Sonic“ ，瓜分价值 1000 USDT

相关讨论

热门问答

热门分类

热门标签