Anthropic全球警告，OpenAI已跨“可靠性阈值”：AI自我加速启动

marsbitPublished on 2026-06-06Last updated on 2026-06-06

Abstract

AI领域出现重要警告与发展洞察。Anthropic发出全球警告，指出AI递归自我改进进程加速，已接近“自己造自己”的临界点，呼吁减缓研究。与此同时，OpenAI后训练团队负责人Yann Dubois在访谈中揭示了关键内部视角： 1. **能力提升是连续线性，但实用性感知是跳跃的**。AI能力在达到“可靠性阈值”前如同玩具，跨越后则成为可托付工作的可靠工具。OpenAI被认为在去年12月左右跨过了此阈值。 2. **AI正进入自我加速循环**。模型能力提升后，本身已成为研发的有力工具（如辅助编程），从而加速下一代模型的开发，形成越转越快的正反馈回路。 3. **AI构建更像“手艺”而非纯科学**。在硬核领域，经验、直觉和反复试错（类似“炼金术”）目前扮演关键角色，科学解释常事后补足。 4. **垂直应用（Harness）价值巨大，甚至能触及AGI体验**。Dubois认为，若冻结现有模型，仅通过精心打磨针对特定领域的编排系统，即可在许多场景中实现类似通用人工智能（AGI）的效果。当前瓶颈常在于“最后一公里”——权限、数据连接与业务流程集成，而非模型智能本身。 5. **持续学习仍是核心挑战**。模型难以像人类一样在特定环境中持续学习和优化，其学习曲线容易趋于平缓，这是亟待解决的重要问题。综上，AI发展已迈过关键可靠性门槛，进入自我加速阶段，同时为垂直领域的深度应用与集成创造了巨大机遇与挑战。

AI圈平地一声雷!

Anthropic向全人类发出警告:停止研究AI!

Anthropic内部数据显示,AI正在加速AI的发展,通往递归自我改进的路径或已浮现。

换句话说,AI正在接近「自己造自己」的临界点。

这一进程比Anthropic预想的更快,所以Anthropic呼吁减缓或暂停AI研究。

而与此同时,OpenAI 后训练团队负责人 Yann Dubois 在最新访谈中,给出了一个更微观但同样令人深思的视角:

AI进化不是突然开挂,而是刚过合格线!

在最新访谈中,他揭秘了多项内部视角:

AI能力的增长是线性的、连续的,但用户体验到的「有用性」却是离散的、跳跃的。

因为在达到某个「可靠性阈值」之前,AI只是一个会变戏法的玩具;一旦跨过那个点,它就成了能托付工作的员工,会自我加速。

这个阈值,OpenAI在去年12月左右才跨过了这道坎。

此外,Yann Dubois提出了反直觉的论断:AI构建更像「手艺(Craft)」而非「科学」。

这个洞察极具张力:在这个最强调硬核算力的领域,最终胜出的竟然是类似炼金术的「flare(直觉/灵感)」。

他还提出「最后一公里的AI红利」。

如果现在冻结所有模型,只靠做垂直应用(Harness),我们其实已经能实现AGI了。

瓶颈不在模型大脑,而在「权限、连接和数据」。这直接给正在观望的开发者泼了一盆冷水,同时也指明了金矿所在。

可靠性阈值已过,AI自我加速

过去几周,AI圈热闹不断:GPT5.5发布、Claude Mythos也出来了。

尤其是在网络安全、AI智能体写代码这块,给人的感觉真是日新月异,AI的进展让人感觉像突然「跳了一大格」。

Dubois的说法很不客气:能力提升其实一直挺连续,大家之所以觉得像坐了火箭,是因为中间隔着一道「可靠性门槛」。

门槛没跨过去之前,AI像个聪明但爱闯祸的实习生:能写、能算、能出主意,但你不敢把活彻底交给它。

门槛跨过去之后,你才敢让它「真上岗」。

他判断OpenAI大概是在「去年12月」跨过了这条线,于是才出现了外界感受到的「阶梯式跃迁」。

更刺激的是第二层原因:当模型足够好,它会反过来加速研发本身。

这就是Anthropic最担心的事。

Dubois提到,尤其在编程场景里,研究人员自己天天写代码,模型一变强,等于全员多了个不睡觉的搭档——既能帮研究者把工具链搭起来,也能在训练下一代模型时「以AI养AI」。

加速回路一旦转起来,就会越转越快,最近几个月「越来越猛」并不奇怪。

这也发生在Anthropic内部,到2026年第二季度,人均每个季度贡献的代码,已经是2024年第一季度的8倍。

第三个推动力,来自强化学习(RL)的「转型升级」。

早期像o1这类推理模型,主要在「可验证奖励」的任务上猛刷分——数学题、编程竞赛,因为对错清清楚楚,奖励好定义。

但过去一年,他们把在竞赛里练出来的那套工具,往更真实、更模糊的工作场景迁移:不再只优化「有标准答案的题」,而是优化「用户觉得真有用的东西」。

一句话:从刷题选手,开始往职场打工人进化。

AI工程师不是科学家,AI是「养」出来的

可一旦走进真实世界,麻烦就来了:可靠性怎么提?

Dubois给了个非常直白的「概率模型」:

既然现在很多都是AI智能体(agentic)系统,你可以粗暴地想成「每两分钟就有一定概率犯错」;运行时间越长,最终答案翻车的概率就越高。

所以所谓「提升可靠性」,本质就是不断把这个「每两分钟出错率」压下去。

这是AI智能体固有的硬骨头。

这也解释了为什么Dubois说,AI的构建更像「手艺活」,不像课本里的「科学实验」。

现实流程往往是:先靠经验、直觉、反复试错把东西做出来,甚至带点「炼金术」味道;等它真能跑、真能用,再回头补更科学的解释与方法论。

他还提到一个颇具反转的小插曲——

当年ChatGPT公开说用了RL,他第一反应是「太复杂了吧,用监督微调(SFT)就够了」,这也正是他在Stanford做Alpaca时想验证的思路。

但后来事实显示,一旦模型规模跨过某个水平,RL真的会「突然开始好用」,只是代价不低——采样很多答案、判断哪些对、哪些错,算力和系统工程都很烧钱。

垂直领域Harness已达AGI

说到「把AI拉进现实」,绕不开最近创业圈最爱挂在嘴边的词:Harness(编排系统)。

有人把它当成AI智能体的「外挂骨架」,也有人怀疑它迟早被模型「吃掉」。

Dubois的态度很现实:

短期内,垂直场景的Harness很值,能把可靠性从80%推到85%。

但前提是你得接受:模型在持续变强,Harness也得反复重调。

想做一个长期稳定、放之四海而皆准的「通用Harness」,他认为基本走不通。

甚至他还抛出一句很「挑衅」的判断:如果今天把现有模型「冻住」,只认真打磨Harness并围绕它训练,很多领域的人可能会「明显感到通用人工智能(AGI)的味道」。

最后一公里

但真正让Dubois既兴奋又皱眉的,是「持续学习(continual learning)」这道老大难。

三年前ChatGPT刚火时,他和朋友甚至认真讨论过要不要创业做个性化记忆与持续学习。

当时他们觉得「6个月内OpenAI就会搞定」,于是没做;三年后他人都进OpenAI了,却发现这事仍没真正解决。

现在模型的尴尬在于:第一天空降到公司,可能比多数新员工还顶用(起点高);但之后基本「保持原样」,因为它不会在特定环境里越干越懂你、越干越高效。

人类学习曲线是往上爬的,AI这条线却容易变平。

把AI的曲线从「平的」掰成「持续上升」的,Dubois认为会是接下来最重要的问题之一。

所以,创业公司还有没有空间做垂直应用?

Dubois给的答案很干脆:不但有,而且很大。

因为真正的瓶颈往往不是「模型够不够聪明」,而是最后一公里——权限怎么给、数据怎么接、连接器怎么打通、怎么嵌进具体业务流程。

大模型在天上飞得再高,不落地也只能当烟花;而把它拽到地面、让它拿到该拿的钥匙、打开该开的门,反而是最有含金量的苦活累活。

参考资料:

https://x.com/Potatoloogs/status/2062494654885749126

https://www.youtube.com/watch?v=DhD1zZ8w8Mw&t=3s

本文来自微信公众号“新智元”,作者:ASI启示录

Trending Cryptos

CitreaCTR

wrapped stUSDTWSTUSDT

Catholic And Law Enforcement Groups Warn CLARITY Act Could Weaken Crypto Crime Safeguards

A coalition of Catholic leaders, law enforcement groups, and anti-trafficking advocates warns that the CLARITY Act could weaken safeguards against crypto-enabled crime. Their criticism focuses on provisions that would protect developers of non-custodial software (like wallets and DeFi protocols) from being regulated as money transmitters. This highlights a core regulatory dilemma: distinguishing neutral software from financial intermediation. Crypto advocates argue such developers should not be treated like exchanges, while critics fear broad exemptions could hinder the tracking of illicit finance, including human trafficking and sanctions evasion. The bill aims to create clearer market-structure rules, but this opposition shows the debate extends beyond investor protection to law enforcement concerns. The pushback means supporters may need to address these issues, potentially leading to amendments or stricter requirements. For the crypto industry, clearer rules could spur U.S. investment, but the bill faces a tougher political path if framed as weakening crime-fighting tools.

bitcoinist3h ago

Catholic And Law Enforcement Groups Warn CLARITY Act Could Weaken Crypto Crime Safeguards

bitcoinist3h ago

Bitcoin Falls About 5% to $59,023.10, Hitting a New Low Since October 2024. Retail Investor Retreat Exposes Market's New Vulnerabilities.

Bitcoin fell approximately 5% to $59,023.10, marking its lowest level since October 2024. This decline highlights new vulnerabilities in the market as retail investors withdraw.

华尔街日报4h ago

Bitcoin Falls About 5% to $59,023.10, Hitting a New Low Since October 2024. Retail Investor Retreat Exposes Market's New Vulnerabilities.

华尔街日报4h ago

The Bitcoin Vector #61

Executive Summary: The article "The Bitcoin Vector #61" discusses recent developments and analysis in the Bitcoin ecosystem. It notes significant price volatility, major institutional investment moves, and key regulatory updates. The piece explores the impact of macroeconomic factors on cryptocurrency markets and provides technical analysis of Bitcoin's current market position.

insights.glassnode6h ago

The Full Story of How Crypto Unicorn Blockstream Is Mired in Serious Fraud Allegations

This article details serious allegations of fraud against Bitcoin infrastructure company Blockstream, founded by Bitcoin pioneer Adam Back. In June 2024, investigative account NatInfoSec published a report accusing Blockstream's mining note (BMN) program of potentially operating a multi-billion dollar scheme with Ponzi-like characteristics. The core allegations focus on Blockstream Mining Notes (BMNs), which offer investors fixed annual yields up to approximately 20% from Bitcoin mining. NatInfoSec's investigation raises several key issues: 1. **Suspicious Hashrate & Payout Capacity**: The analysis suggests Blockstream would need 20-45 EH/s of mining power to cover its BMN obligations, but its public dashboard shows only around 15 EH/s. Furthermore, no verifiable public evidence (e.g., grid connection records, import data) was found to support the massive mining operation required. 2. **Questionable Payout Source**: The BMN contract allows Blockstream to use Bitcoin from *any source* (Substitute Performance BTC) to fulfill investor payouts, raising concerns that payouts may not come from actual mining revenue. 3. **High-Risk, Fixed Returns**: Offering ~20% fixed yields in the volatile, cyclical Bitcoin mining industry is viewed as highly unusual and requires clear explanation. 4. **Undisclosed Criminal Record of Key Figure**: Christopher William Cook, a key figure in Blockstream's mining operations and CEO of spin-off Exacore, was found to have a federal felony conviction for mail fraud in 2008, a fact not disclosed in BMN offering documents. His background was also allegedly embellished. 5. **Potential Contagion to BSTR SPAC**: Questions were raised about whether these liabilities and Cook's record should have been disclosed in the SEC filings for Bitcoin Standard Treasury Company (BSTR), a separate Adam Back-associated firm planning a SPAC merger. The crypto community is divided. BitMEX Research validated Cook's criminal record and expressed concern over the high yields but found other evidence lacking or misleading, noting the legal separation between BMN, Blockstream, and BSTR. Blockstream defenders, like Samson Mow, argue the mining is real. Critics, however, emphasize the lack of independent, verifiable proof of the mining operation's scale and the true source of investor payouts. The article concludes that BMN remains shrouded in key unanswered questions regarding its actual size, the verifiability of its underlying mining assets and payouts, the source of its high yields, and the full role and disclosure concerning Chris Cook. Blockstream had not issued a comprehensive response at the time of writing.

marsbit7h ago

The Full Story of How Crypto Unicorn Blockstream Is Mired in Serious Fraud Allegations

marsbit7h ago

South Korea Targets 40 Unregistered Crypto Operators in Regulatory Crackdown

South Korea's Financial Intelligence Unit has referred around 40 unregistered virtual asset service providers to the police as part of a regulatory crackdown. Under local law, all crypto exchanges must obtain ISMS certification and FIU registration before operating, yet only 28 entities are currently registered. The FIU identified methods used by offshore platforms to attract Korean users, including advertising on local messaging apps and using private money changers to convert stablecoins. Officials stressed that unregistered platforms are not protected by Korean law. This action aligns with South Korea's push for stricter global crypto compliance through FATF measures. FIU Director Lee Hyung-joo recently urged FATF members to remove transaction thresholds for the "Travel Rule," as South Korea plans to mandate identity checks for all crypto transactions starting in August, expanding from the current 1 million won (≈$730) minimum. Regulatory enforcement has intensified in 2026, with authorities pursuing criminal cases against market manipulation schemes and enhancing cooperation with financial institutions to curb cross-border illegal transactions.

TheNewsCrypto8h ago

South Korea Targets 40 Unregistered Crypto Operators in Regulatory Crackdown

TheNewsCrypto8h ago

Trading

Spot

Futures

Hot Articles

Audiera: The AI Agent Network Powering the Web4 Entertainment Economy

Audiera is a dual-platform Web4 entertainment ecosystem combining a mobile rhythm experience and a lightweight Telegram mini-game, powered by AI interaction and an on-chain creator economy.

40.3k Total ViewsPublished 2026.03.11Updated 2026.03.11

Audiera: The AI Agent Network Powering the Web4 Entertainment Economy

The Cornerstone of the Autonomous AI Economy: How Talus is Reshaping On-Chain Intelligent Agents

Talus is a decentralized AI Agent framework built on the Sui, designed to solve the structural problems of current AI systems: centralization, opacity, and a lack of native economic identity.

43.0k Total ViewsPublished 2026.03.18Updated 2026.03.18

The Cornerstone of the Autonomous AI Economy: How Talus is Reshaping On-Chain Intelligent Agents

In-depth Analysis of AI and Crypto: The Era of Symbiosis between Algorithms and Ledgers

By 2026, the integration of artificial intelligence and cryptocurrency has advanced from proof-of-concept to a new stage of "system-level integration".

2.2k Total ViewsPublished 2026.03.26Updated 2026.03.26

In-depth Analysis of AI and Crypto: The Era of Symbiosis between Algorithms and Ledgers

Discussions

Welcome to the HTX Community. Here, you can stay informed about the latest platform developments and gain access to professional market insights. Users' opinions on the price of AI (AI) are presented below.