Claude Mythos Officially Announced: Performance Crushes Opus 4.6, 'Imprisoned' for Being Too Dangerous
Anthropic has officially announced Claude Mythos, a highly advanced AI model that reportedly surpasses Opus 4.6 across multiple benchmarks, including a 24% improvement in bug-fixing capability and a 17% boost in agent-based computer operations. However, due to its dangerous ability to identify and exploit security vulnerabilities at a level exceeding most human hackers, the model has been deemed a significant risk to public and economic security.
Instead of releasing it publicly, Anthropic has initiated "Project Glasswing," a collaborative security effort involving major tech companies like Amazon, Apple, Google, Microsoft, and NVIDIA, along with open-source organizations. These partners will use the preview version of Mythos to strengthen defenses and identify vulnerabilities in critical systems before the model is widely available. Anthropic is also providing up to $100 million in usage credits and donating $4 million to support open-source security initiatives.
The move underscores the urgent need to address AI safety at a global scale, as rapidly advancing capabilities could soon outpace existing security measures.
marsbit04/08 00:26