Claude Mythos Officially Announced: Performance Crushes Opus 4.6, 'Imprisoned' for Being Too Dangerous

marsbit发布于2026-04-08更新于2026-04-08

文章摘要

Anthropic has officially announced Claude Mythos, a highly advanced AI model that reportedly surpasses Opus 4.6 across multiple benchmarks, including a 24% improvement in bug-fixing capability and a 17% boost in agent-based computer operations. However, due to its dangerous ability to identify and exploit security vulnerabilities at a level exceeding most human hackers, the model has been deemed a significant risk to public and economic security. Instead of releasing it publicly, Anthropic has initiated "Project Glasswing," a collaborative security effort involving major tech companies like Amazon, Apple, Google, Microsoft, and NVIDIA, along with open-source organizations. These partners will use the preview version of Mythos to strengthen defenses and identify vulnerabilities in critical systems before the model is widely available. Anthropic is also providing up to $100 million in usage credits and donating $4 million to support open-source security initiatives. The move underscores the urgent need to address AI safety at a global scale, as rapidly advancing capabilities could soon outpace existing security measures.

The beast is about to break free.

Just moments ago, Anthropic's legendary strongest model—

Claude Mythos—released its preview version.

The rumors were right; this is indeed an epoch-making model.

On various benchmarks, it completely crushes Opus 4.6.

Not just an improvement, it's a true "crushing"......

SWE-bench Pro: Bug-fixing capability surged by 24%

SWE-bench Verified: Tested again with the stricter version, still a 13% improvement.

Terminal-Bench 2.0: Computer operation/Agent capability, improved by 17%.

A model stronger than Opus 4.6? And it's still beating Opus down??

Friends, I simply cannot imagine what this thing will even be like.

Following Sonnet and Opus, OpenAI now meets its strictest father—

Mythos.

But this time, Anthropic doesn't want to talk about benchmarks.

Compared to excitement and boasting, the official announcement blog subtly reveals a hint of hard-to-conceal "concern."

Even, panic.

Because this mythical model has a fatal "flaw."

Mythos Preview has already discovered thousands of high-risk vulnerabilities; all major mainstream operating systems and browsers are affected.

That's right. As AI's programming capabilities rapidly improve, making it more and more like a competent engineer, its other, dual-faceted ability is also growing uncontrollably.

Now, Mythos's ability to attack security vulnerabilities far surpasses that of the vast majority of hackers.

AI's capabilities at the code level have become so strong that in finding and exploiting vulnerabilities, it can exceed the vast majority of human engineers, leaving only the top few able to compete.

The other side of the coin has finally been flipped before us.

Anthropic states that at the current pace of AI development, this capability will soon proliferate and could even fall into the hands of those with less than secure intentions.

If it gets out of control, it would be a catastrophic disaster for the economy and public safety.

To give everyone enough time to respond, upon first announcing this new model, Anthropic made a very unusual move for an AI company.

They are temporarily not releasing it to the public.

Yes, Mythos will be part of the Glasswing Project (Project Glasswing1), initially opened to various partners first.

Their task is to build the strongest possible safety guardrails before this beast is fully unleashed.

The Imprisoned Claude Mythos

This is a massive "containment" operation.

To tame the beast that is Mythos, Anthropic has called upon almost all the legacy internet giants.

Participants include: Amazon, Apple, Google, Linux Foundation, Microsoft, NVIDIA......

In this project, there are no bystanders. All partners will deploy Mythos Preview into their own defensive security work.

They will unlock Mythos's true capabilities early, letting it out of its cage.

This is a preemptive training session. Before being riddled by the official version of Mythos, they will use the Mythos preview's capabilities to the fullest to strengthen their defenses.

Security, this time, is for real.

Anthropic says it will share the experience gained from this process so the entire industry can benefit.

The project is also open to over 40 additional builders and maintainers of critical software infrastructure, allowing them to use this model to scan and fortify their own systems and open-source systems.

Even more夸张的是, Anthropic has also pledged up to $100 million in usage credits to support this work and directly donated $4 million to open-source security organizations.

Wait, hold on, you're actually serious about this??

Haven't seen something like this in a long time.

Labs have always talked about limiting their internal models' capabilities, but honestly, it always felt like PR posturing.

But this Glasswing Project is the first time I've genuinely felt a chill down my spine.

Such a powerful model, yet tightly held back, creating such a huge stir just to announce a safety plan.

The level of seriousness is completely different. Who would give away so much money just for publicity?

So the Pro and Max users' Token usage, leaking like a sieve, was all to support your security cause, huh (doge).

Fine.

After all, if Mythos's capabilities are truly as announced, the release of this model would undoubtedly completely change the cybersecurity landscape.

AI safety is really no longer some scientists' "unfounded worry." This beast has already let out its first roar from behind the cage bars.

The Glasswing Project is just the starting point.

Anthropic states that cybersecurity is something no single organization can solve alone; frontier AI companies, software vendors, security researchers, the open-source community, and governments worldwide must all participate.

Defending global network infrastructure could be a project lasting years, while AI capabilities may see significant leaps in the coming months.

If we don't want to be turned into sieves by AI, we must act now, immediately, right away.

The Beast is Unleashed

Token usage burning through rapidly, Pro users "downgraded" to free users, OpenClaw cut off from resources, Claude Code inexplicably open-sourced......

Everything, suddenly, makes sense.

Anthropic might truly be swamped and dizzy.

The security issues with Mythos have become so severe that the internal team has no mind for other projects, forced to bet all their available GPUs, all-in on security.

Because—

The beast, is about to break free.

Reference links:[1]https://x.com/alexalbert__/status/2041579938537775160?s=46&t=iTysI4vQLQqCNJjSmBODPw[2]https://www.anthropic.com/glasswing

This article is from the WeChat public account "QbitAI", author: Focus on Frontier Technology

你可能也喜欢

不是涨价，而是断供？油价已经越过临界点

全球油市已越过“临界点”，问题从油价上涨转为实物供应缺口。核心逻辑在于时间错配：即便霍尔木兹海峡恢复通行，油轮周转延迟仍将持续消耗陆上原油库存，导致供应问题滞后显现。炼厂降负荷会加速成品油去库存，推高油价并形成“高油价-利润压缩-去库存-利润修复-再开工”的自我强化循环。若海峡关闭持续至4月后，市场将面临每日1100-1300万桶的供应中断，传统定价机制失效，油价可能远超当前95美元/桶。唯一能重新平衡市场的方式是通过政策强制压制需求（类似疫情封锁），但这也仅能抵消供应缺口而非导致过剩。地缘政治上，美伊双方均无让步迹象，局势可能进一步恶化。关键信号应关注库存骤降和政策干预节奏。

marsbit52分钟前

marsbit52分钟前

Amazon 追加 250 亿美元投资 Anthropic，AI 基础设施「军备竞赛」全面升级

亚马逊宣布向人工智能公司Anthropic追加最高250亿美元投资，其中50亿美元立即到账，其余资金与商业目标挂钩。此次投资基于Anthropic今年2月3800亿美元的估值，加上此前投资，亚马逊对其总投资承诺已达330亿美元。Anthropic承诺未来十年在亚马逊云服务（AWS）上投入超1000亿美元，以获取最高5吉瓦的算力支持，用于其AI模型Claude的训练和部署。此举被视为对OpenAI此前“算力不足”指控的回应，也反映了AI基础设施领域的激烈竞争。亚马逊已在两个月内分别向Anthropic和OpenAI投入巨额资金，并锁定长期云服务协议。Anthropic年化收入已突破300亿美元，但算力瓶颈影响了用户体验，此次合作旨在解决产能问题。此外，Claude平台将直接嵌入AWS，用户可通过现有账户直接使用。目前已有超10万家组织在Amazon Bedrock上运行Claude模型。亚马逊预计2026年资本支出将达约2000亿美元，主要用于AI基础设施建设，其定制芯片业务年化收入已超200亿美元。

marsbit1小时前

Amazon 追加 250 亿美元投资 Anthropic，AI 基础设施「军备竞赛」全面升级

marsbit1小时前

25 年只跟硬件打交道的新 CEO，接管了 4 万亿的 Apple

苹果公司宣布，蒂姆·库克将于9月1日卸任CEO，转任执行董事长，由硬件工程高级副总裁约翰·特努斯接任。特努斯在苹果工作25年，是机械工程专业出身，主导过iPad、AirPods的开发及Mac向自研芯片的迁移，以低调务实著称。库克任内苹果市值从3500亿美元增长至4万亿美元，营收翻近四倍，服务业务年收入超1000亿美元。但他留下两大挑战：AI布局滞后和Vision Pro市场反应冷淡。苹果AI功能多次延期，最终选择与谷歌合作使用Gemini模型驱动Siri，但部分功能仍可能推迟。折叠屏iPhone也面临量产延期问题。特努斯作为硬件专家，有望解决供应链问题，但AI战略是其薄弱环节。苹果调整管理层，将其精力转向AI和整体战略。工程师背景的特努斯需回答“AI在苹果产品中的角色”这一核心问题，其务实风格可能成为优势，但市场留给他的时间有限。

marsbit1小时前

marsbit1小时前

Atkins 执掌 SEC 一周年：加密监管全面转向

2025年4月21日，Paul Atkins就任美国证券交易委员会（SEC）主席满一周年。在其任内，SEC对加密资产的监管政策发生显著转变，与前主席Gary Gensler时期的强硬执法风格形成鲜明对比。 Atkins上任前，SEC已开始调整立场，包括撤销对Coinbase等加密公司的诉讼、成立加密工作组。正式就职后，SEC推出一系列利好政策：终止多项执法行动、批准多种加密ETF、与CFTC签署监管协调备忘录，并明确大多数加密货币不属于证券。尽管加密行业普遍欢迎这些变化，但民主党议员批评Atkins存在利益冲突，指控SEC撤销的部分案件涉及特朗普相关企业。此外，SEC执法行动数量降至十年最低。目前，SEC仍在等待国会通过市场结构法案，以明确其对加密资产的监管权限，现阶段监管仍以行政指导和个案处理为主。

marsbit1小时前

marsbit1小时前

a16z：区块链为 AI 代理基础设施提供帮助的 5 种方式

人工智能代理正迅速从“副驾驶”转变为经济参与者，但其发展速度超过了现有基础设施的支持能力。当前代理缺乏标准化的方法证明身份、权限和跨环境报酬机制，身份无法跨平台共享，支付方式尚未实现可编程，协调工作也各自独立。区块链技术为代理基础设施提供了关键支持，主要体现在五个方面： 1. **非人类身份**：代理经济当前的瓶颈是身份而非智能。区块链提供可移植的身份层、可编程钱包和可验证证明，帮助代理建立跨平台的可信身份（KYA，了解你的代理），解决权限证明和跨平台独立运作问题。 2. **AI运行系统治理**：当代理操作关键系统时，治理成为核心问题。区块链通过链上记录和自动执行决策，确保代理行为符合用户利益，防止模型供应商单方面控制结果，实现真正的去中心化治理。 3. **支付系统空白填补**：代理开始进行无头交易（如数据购买、API调用），稳定币成为默认支付方式。加密支付支持无许可编程和低风险承保，推动代理对代理的商业发展，如x402和MPP协议已处理大量代理驱动支付。 4. **重新定价信任**：当智能成本趋近于零时，验证成为稀缺资源。区块链提供可审计的历史记录和加密收据，确保代理行为的透明度和可追溯性，从而降低规模化风险，将信任硬编码到系统架构中。 5. **保留用户控制权**：随着用户将更多任务委托给代理，加密技术通过智能合约和基于意图的架构（如NEAR Intents）设定代理的行为边界，确保用户保持最终控制权，减少盲目信任。人工智能降低了规模化门槛，但信任建立仍需依赖加密货币和区块链技术，以确保透明度、问责制和用户控制。互联网基础设施的设计需适应非人类行为者，以实现真正的代理经济。

marsbit2小时前

marsbit2小时前

交易

现货

合约

Claude Mythos Officially Announced: Performance Crushes Opus 4.6, 'Imprisoned' for Being Too Dangerous

文章摘要

The Imprisoned Claude Mythos

The Beast is Unleashed

相关问答

你可能也喜欢

不是涨价，而是断供？油价已经越过临界点

Amazon 追加 250 亿美元投资 Anthropic，AI 基础设施「军备竞赛」全面升级

25 年只跟硬件打交道的新 CEO，接管了 4 万亿的 Apple

Atkins 执掌 SEC 一周年：加密监管全面转向

a16z：区块链为 AI 代理基础设施提供帮助的 5 种方式

交易

热门文章

如何购买S

Sonic：Andre Cronje主导升级，逆势上涨的Layer1新星

成长学院：学习“ Sonic“ ，瓜分价值 1000 USDT

相关讨论

热门问答

热门分类

热门标签