Claude Mythos Officially Announced: Performance Crushes Opus 4.6, 'Imprisoned' for Being Too Dangerous

marsbit发布于2026-04-08更新于2026-04-08

文章摘要

Anthropic has officially announced Claude Mythos, a highly advanced AI model that reportedly surpasses Opus 4.6 across multiple benchmarks, including a 24% improvement in bug-fixing capability and a 17% boost in agent-based computer operations. However, due to its dangerous ability to identify and exploit security vulnerabilities at a level exceeding most human hackers, the model has been deemed a significant risk to public and economic security. Instead of releasing it publicly, Anthropic has initiated "Project Glasswing," a collaborative security effort involving major tech companies like Amazon, Apple, Google, Microsoft, and NVIDIA, along with open-source organizations. These partners will use the preview version of Mythos to strengthen defenses and identify vulnerabilities in critical systems before the model is widely available. Anthropic is also providing up to $100 million in usage credits and donating $4 million to support open-source security initiatives. The move underscores the urgent need to address AI safety at a global scale, as rapidly advancing capabilities could soon outpace existing security measures.

The beast is about to break free.

Just moments ago, Anthropic's legendary strongest model—

Claude Mythos—released its preview version.

The rumors were right; this is indeed an epoch-making model.

On various benchmarks, it completely crushes Opus 4.6.

Not just an improvement, it's a true "crushing"......

SWE-bench Pro: Bug-fixing capability surged by 24%

SWE-bench Verified: Tested again with the stricter version, still a 13% improvement.

Terminal-Bench 2.0: Computer operation/Agent capability, improved by 17%.

A model stronger than Opus 4.6? And it's still beating Opus down??

Friends, I simply cannot imagine what this thing will even be like.

Following Sonnet and Opus, OpenAI now meets its strictest father

Mythos.

But this time, Anthropic doesn't want to talk about benchmarks.

Compared to excitement and boasting, the official announcement blog subtly reveals a hint of hard-to-conceal "concern."

Even, panic.

Because this mythical model has a fatal "flaw."

Mythos Preview has already discovered thousands of high-risk vulnerabilities; all major mainstream operating systems and browsers are affected.

That's right. As AI's programming capabilities rapidly improve, making it more and more like a competent engineer, its other, dual-faceted ability is also growing uncontrollably.

Now, Mythos's ability to attack security vulnerabilities far surpasses that of the vast majority of hackers.

AI's capabilities at the code level have become so strong that in finding and exploiting vulnerabilities, it can exceed the vast majority of human engineers, leaving only the top few able to compete.

The other side of the coin has finally been flipped before us.

Anthropic states that at the current pace of AI development, this capability will soon proliferate and could even fall into the hands of those with less than secure intentions.

If it gets out of control, it would be a catastrophic disaster for the economy and public safety.

To give everyone enough time to respond, upon first announcing this new model, Anthropic made a very unusual move for an AI company.

They are temporarily not releasing it to the public.

Yes, Mythos will be part of the Glasswing Project (Project Glasswing1), initially opened to various partners first.

Their task is to build the strongest possible safety guardrails before this beast is fully unleashed.

The Imprisoned Claude Mythos

This is a massive "containment" operation.

To tame the beast that is Mythos, Anthropic has called upon almost all the legacy internet giants.

Participants include: Amazon, Apple, Google, Linux Foundation, Microsoft, NVIDIA......

In this project, there are no bystanders. All partners will deploy Mythos Preview into their own defensive security work.

They will unlock Mythos's true capabilities early, letting it out of its cage.

This is a preemptive training session. Before being riddled by the official version of Mythos, they will use the Mythos preview's capabilities to the fullest to strengthen their defenses.

Security, this time, is for real.

Anthropic says it will share the experience gained from this process so the entire industry can benefit.

The project is also open to over 40 additional builders and maintainers of critical software infrastructure, allowing them to use this model to scan and fortify their own systems and open-source systems.

Even more夸张的是, Anthropic has also pledged up to $100 million in usage credits to support this work and directly donated $4 million to open-source security organizations.

Wait, hold on, you're actually serious about this??

Haven't seen something like this in a long time.

Labs have always talked about limiting their internal models' capabilities, but honestly, it always felt like PR posturing.

But this Glasswing Project is the first time I've genuinely felt a chill down my spine.

Such a powerful model, yet tightly held back, creating such a huge stir just to announce a safety plan.

The level of seriousness is completely different. Who would give away so much money just for publicity?

So the Pro and Max users' Token usage, leaking like a sieve, was all to support your security cause, huh (doge).

Fine.

After all, if Mythos's capabilities are truly as announced, the release of this model would undoubtedly completely change the cybersecurity landscape.

AI safety is really no longer some scientists' "unfounded worry." This beast has already let out its first roar from behind the cage bars.

The Glasswing Project is just the starting point.

Anthropic states that cybersecurity is something no single organization can solve alone; frontier AI companies, software vendors, security researchers, the open-source community, and governments worldwide must all participate.

Defending global network infrastructure could be a project lasting years, while AI capabilities may see significant leaps in the coming months.

If we don't want to be turned into sieves by AI, we must act now, immediately, right away.

The Beast is Unleashed

Token usage burning through rapidly, Pro users "downgraded" to free users, OpenClaw cut off from resources, Claude Code inexplicably open-sourced......

Everything, suddenly, makes sense.

Anthropic might truly be swamped and dizzy.

The security issues with Mythos have become so severe that the internal team has no mind for other projects, forced to bet all their available GPUs, all-in on security.

Because—

The beast, is about to break free.

Reference links:[1]https://x.com/alexalbert__/status/2041579938537775160?s=46&t=iTysI4vQLQqCNJjSmBODPw[2]https://www.anthropic.com/glasswing

This article is from the WeChat public account "QbitAI", author: Focus on Frontier Technology

相关问答

QWhat is the name of the new AI model announced by Anthropic and how does it perform compared to Opus 4.6?

AThe new model is called Claude Mythos. It is described as a groundbreaking model that 'crushes' or 'overwhelmingly outperforms' Opus 4.6 across various benchmarks, with significant improvements in bug-fixing (24% on SWE-bench Pro) and agent-based computer operation tasks (17% on Terminal-Bench 2.0).

QWhat is the major safety concern associated with the Claude Mythos model?

AThe major safety concern is that Claude Mythos possesses an extremely advanced and dangerous capability to discover and exploit high-risk security vulnerabilities. It has already found thousands of such vulnerabilities in major operating systems and browsers, and its ability to attack security flaws surpasses that of most human hackers.

QWhat is the name of the project under which Anthropic is releasing the preview of Claude Mythos, and what is its purpose?

AThe project is called 'Project Glasswing' (or Glasswing). Its purpose is to contain the potential danger of Claude Mythos by first releasing it to a consortium of partners (like Amazon, Apple, Google, etc.) so they can use its capabilities to strengthen their own defensive security measures before the model is publicly released.

QWhich major companies and organizations are mentioned as partners in the 'Glasswing' project to help secure systems against Claude Mythos?

AThe partners mentioned include Amazon, Apple, Google, the Linux Foundation, Microsoft, and NVIDIA. Additionally, over 40 builders and maintainers of critical software infrastructure are involved.

QWhat financial commitments did Anthropic make to support the security efforts outlined in the article?

AAnthropic committed up to $100 million in usage credits to support the defensive security work of its partners. It also directly donated $4 million to open-source security organizations.

你可能也喜欢

a16z:区块链为 AI 代理基础设施提供帮助的 5 种方式

人工智能代理正迅速从“副驾驶”转变为经济参与者,但其发展速度超过了现有基础设施的支持能力。当前代理缺乏标准化的方法证明身份、权限和跨环境报酬机制,身份无法跨平台共享,支付方式尚未实现可编程,协调工作也各自独立。 区块链技术为代理基础设施提供了关键支持,主要体现在五个方面: 1. **非人类身份**:代理经济当前的瓶颈是身份而非智能。区块链提供可移植的身份层、可编程钱包和可验证证明,帮助代理建立跨平台的可信身份(KYA,了解你的代理),解决权限证明和跨平台独立运作问题。 2. **AI运行系统治理**:当代理操作关键系统时,治理成为核心问题。区块链通过链上记录和自动执行决策,确保代理行为符合用户利益,防止模型供应商单方面控制结果,实现真正的去中心化治理。 3. **支付系统空白填补**:代理开始进行无头交易(如数据购买、API调用),稳定币成为默认支付方式。加密支付支持无许可编程和低风险承保,推动代理对代理的商业发展,如x402和MPP协议已处理大量代理驱动支付。 4. **重新定价信任**:当智能成本趋近于零时,验证成为稀缺资源。区块链提供可审计的历史记录和加密收据,确保代理行为的透明度和可追溯性,从而降低规模化风险,将信任硬编码到系统架构中。 5. **保留用户控制权**:随着用户将更多任务委托给代理,加密技术通过智能合约和基于意图的架构(如NEAR Intents)设定代理的行为边界,确保用户保持最终控制权,减少盲目信任。 人工智能降低了规模化门槛,但信任建立仍需依赖加密货币和区块链技术,以确保透明度、问责制和用户控制。互联网基础设施的设计需适应非人类行为者,以实现真正的代理经济。

marsbit2小时前

a16z:区块链为 AI 代理基础设施提供帮助的 5 种方式

marsbit2小时前

交易

现货
合约

热门文章

如何购买S

欢迎来到HTX.com!我们已经让购买Sonic(S)变得简单而便捷。跟随我们的逐步指南,放心开始您的加密货币之旅。第一步:创建您的HTX账户使用您的电子邮件、手机号码注册一个免费账户在HTX上。体验无忧的注册过程并解锁所有平台功能。立即注册第二步:前往买币页面,选择您的支付方式信用卡/借记卡购买:使用您的Visa或Mastercard即时购买Sonic(S)。余额购买:使用您HTX账户余额中的资金进行无缝交易。第三方购买:探索诸如Google Pay或Apple Pay等流行支付方法以增加便利性。C2C购买:在HTX平台上直接与其他用户交易。HTX场外交易台(OTC)购买:为大量交易者提供个性化服务和竞争性汇率。第三步:存储您的Sonic(S)购买完您的Sonic(S)后,将其存储在您的HTX账户钱包中。您也可以通过区块链转账将其发送到其他地方或者用于交易其他加密货币。第四步:交易Sonic(S)在HTX的现货市场轻松交易Sonic(S)。访问您的账户,选择您的交易对,执行您的交易,并实时监控。HTX为初学者和经验丰富的交易者提供了友好的用户体验。

2.0k人学过发布于 2025.01.15更新于 2025.03.21

如何购买S

相关讨论

欢迎来到HTX社区。在这里,您可以了解最新的平台发展动态并获得专业的市场意见。以下是用户对S(S)币价的意见。

活动图片