U.S. Government Bans Foreign Nationals from Using Fable 5, Anthropic Issues Rebuttal

链捕手Опубликовано 2026-06-13Обновлено 2026-06-13

Введение

U.S. Government Bans Foreign Access to Fable 5, Anthropic Issues Rebuttal On June 12th, the U.S. government ordered AI company Anthropic to immediately suspend all foreign access—including foreign nationals within the U.S. and Anthropic's own foreign employees—to its newly released Fable 5 and Mythos 5 AI models, citing national security concerns. This forced Anthropic to temporarily disable access to both models for all users globally, as it cannot technically differentiate user nationality at scale. The models, released just three days prior, represent Anthropic's highest public capability tier. Fable 5 is the first publicly available model from the advanced "Mythos" family, while Mythos 5 is a less-restricted version for approved cybersecurity and critical infrastructure partners. The government's directive was reportedly triggered by claims from another company that it could "jailbreak" Mythos 5, raising alarm within the Trump administration. Anthropic, in a detailed public statement, strongly challenged this rationale. The company argues the demonstrated "jailbreak" is a narrow, non-generalized technique that merely involves identifying minor, known software vulnerabilities—a capability common to other publicly available models like OpenAI's GPT-5.5 and routinely used by cybersecurity defenders. Anthropic stated it has complied with the order but disagrees with the government's standard, warning that applying it industry-wide would halt all new frontier model deploym...

Author: Xiong Lei

Editor: Xu Qingyang

On June 12th, U.S. time, a rare direct confrontation between government and industry in the history of AI regulation erupted abruptly this Friday.

The U.S. government, citing national security concerns, issued an export control directive to Anthropic, demanding the immediate suspension of all access to the Fable 5 and Mythos 5 AI models by foreign entities. The directive's scope is extremely broad, applying not only to foreign users outside the United States but also to foreign citizens within U.S. territory, even including foreign employees within Anthropic itself.

As a compliance measure, Anthropic had to shut down access to these two models for all users—the only feasible solution currently available to ensure compliance, as the company lacks the technical capability to precisely distinguish between "foreign entity" and "U.S. citizen" user groups. Access to the company's other models remains unaffected, with users automatically falling back to Claude Opus 4.8.

This emergency suspension came as a sudden shock. Fable 5 and Mythos 5 were officially released on June 9th, only three days prior. The abrupt removal of these two models has caused widespread reverberations within the tech industry and AI community.

01 What Exactly Are These Two Models?

To understand the core tension of this incident, one must first understand what kind of models Fable 5 and Mythos 5 are, and why they have been under the regulatory spotlight from the beginning.

Mythos is a new model family from Anthropic that sits at a higher capability tier than the Opus series, representing the highest capability level currently deployable by Anthropic for public use. The first Mythos-class model, Claude Mythos Preview, was released in April this year through the "Project Glasswing" initiative, with access strictly limited to a select few partners, citing its overly powerful capabilities in cybersecurity as unsuitable for broad release.

Fable 5 is the first Mythos-level model officially released to the general public, surpassing all previously available models from Anthropic in capabilities, achieving top-tier industry benchmarks in nearly all tests, including software engineering, knowledge work, visual understanding, and scientific research.

To enable public release, Anthropic equipped Fable 5 with dedicated safety guardrails—in high-risk areas such as cybersecurity, biology, and chemistry, the model automatically blocks responses and falls back to Claude Opus 4.8 for processing.

Mythos 5 is a version built on the same underlying model but with fewer safety guardrails, available only to approved institutions that previously had access to Project Glasswing. It is positioned as a professional tool for cybersecurity defenders and critical infrastructure operators. Both models are priced identically at $10 per million input tokens and $50 per million output tokens.

02 The Trigger for the Directive

According to reports, U.S. Secretary of Commerce Howard Lutnick sent a letter to Anthropic CEO Dario Amodei (Dario Amodei) on June 12th, announcing that Mythos 5 and Fable 5 would be subject to export controls. The immediate trigger for this decision was a claim from another company that it could "jailbreak" Mythos, alerting the Trump administration to potential national security risks.

It is reported that the Trump administration had previously attempted to prevent Anthropic from releasing these two models but was unsuccessful—prompting the subsequent, more forceful measure of export controls.

Faced with this sudden directive, Anthropic, while complying with its execution, issued an unusually strongly worded, lengthy statement, systematically rebutting the government's rationale.

Anthropic argues that the government's evidence of a "jailbreak" pertains only to a very narrow, non-generalizable attack method. Essentially, it involves instructing the model to read a specific code repository and fix software vulnerabilities within it—a capability that also exists in other publicly available models, including OpenAI's GPT-5.5, and is used daily by cybersecurity defenders for normal system maintenance.

Anthropic explicitly stated in its declaration that if "the existence of a limited-scope potential jailbreak" were to become the standard for recalling already-deployed commercial models, it would effectively bring all new deployments of frontier models across the entire industry to a halt. The company also emphasized that this government action did not follow the transparent, fair, and technically fact-based legal procedures it had previously publicly called for.

Anthropic stated it would comply with the government's lawful directive while actively engaging in communication with the government to work towards restoring access as soon as possible, promising to release more technical details within the next 24 hours. The company believes this incident stems from a misunderstanding and deeply apologizes for the inconvenience caused to users.

The following is the full text of the statement released by Anthropic, titled "Statement Regarding the U.S. Government's Directive to Suspend Access to Fable 5 and Mythos 5":

The U.S. government, citing national security authority, issued an export control directive requiring the suspension of access to Fable 5 and Mythos 5 for all foreign nationals, regardless of whether they are inside or outside the United States, including foreign employees of Anthropic. The practical effect of this directive is: we must immediately shut down Fable 5 and Mythos 5 for all users to ensure compliance. Other Anthropic models are unaffected.

We received this directive at 5:21 PM Eastern Time today. The letter did not specify the specific national security concerns. To our understanding, the government believes it possesses a method to bypass Fable 5's safety guardrails, referred to as a "jailbreak" technique. We reviewed a demonstration of this technique and found it only capable of identifying a small number of previously known, low-impact vulnerabilities. These vulnerabilities are all relatively simple; other publicly available models can identify the same issues without requiring a jailbreak.

In our Fable launch blog post, Anthropic articulated our position regarding its safety guardrails, as follows:

We have established robust safety guardrails that significantly reduce the likelihood of Fable being misused for cybersecurity-related tasks, among others. In fact, some users have reported that our guardrails are overly strict.

For weeks prior to Fable's launch, Anthropic collaborated with the U.S. government, the UK's AI Safety Institute, multiple third-party organizations, and internal teams, conducting cumulative thousands of hours of red-teaming on Fable's safety guardrails. The results showed that Fable's safety guardrails were markedly more effective than those of any previously deployed model.

To date, no testers have been able to find a "generalized jailbreak" method—that is, a jailbreak capable of comprehensively breaching the model's safety guardrails and unlocking a wide range of cyberattack capabilities.

We believe that, currently, no model provider can achieve perfect jailbreak protection. All guardrail mechanisms in the industry are susceptible to being bypassed by "non-generalized jailbreaks" (i.e., obtaining limited cyber information under specific circumstances), and generalized jailbreak methods may also emerge in the future. We made this clear at the time of Fable 5's launch.

Given that perfect jailbreak protection is currently infeasible, Anthropic adopted a defense-in-depth strategy for Fable 5. Our goal is to make jailbreak attacks either extremely narrow in scope (for non-generalized jailbreaks) or prohibitively expensive (for generalized jailbreaks), complemented by comprehensive monitoring to quickly detect and contain any successful attacks. This is also the reason Anthropic requires customer data retention for 30 days—although this policy has real impacts on our customer relationships, it aids our research and response to jailbreak risks.

We stand by this defense-in-depth strategy. It effectively reduces the risks posed by Fable to a level comparable with existing deployed models in the industry.

To date, we have not even received any formal disclosure regarding a non-generalized jailbreak that could lead to harmful consequences. The potential jailbreak cases disclosed to us are either entirely harmless or constitute only minor findings that do not demonstrate capabilities exclusive to Mythos.

Currently, the U.S. government has only provided us with verbal evidence regarding a potential, limited-scope, non-generalized jailbreak method. Its essence is instructing the model to read a specific code repository and fix software vulnerabilities within it. To our knowledge, this potential jailbreak method has been shared with the government. We reviewed a report—which we believe forms the basis for the government's directive—and have verified that the level of capability demonstrated in the report is similarly prevalent in other models (including OpenAI's GPT-5.5) and is used daily by cybersecurity defenders to maintain system security. We will release more details within the next 24 hours.

We will comply with the government's lawful directive and shut down access to Fable 5 and Mythos 5 for all users. However, we do not agree that the discovery of a limited-scope potential jailbreak method is sufficient grounds to recall a commercial model already deployed to hundreds of millions of users. Applying this standard across the industry would effectively bring all new model deployments by frontier model providers to a complete standstill.

We have publicly stated that governments should have the authority to prevent the deployment of unsafe models based on transparent, fair, clear, and technically fact-based legal procedures. This action did not adhere to those principles.

We deeply apologize for the inconvenience caused to our users. We believe this incident stems from a misunderstanding and are actively working to restore access as soon as possible.

Связанные с этим вопросы

QWhat was the core reason given by the U.S. government for imposing export controls on Anthropic's Fable 5 and Mythos 5 AI models?

AThe U.S. government cited national security concerns as the core reason. The decision was triggered by another company's claim of a potential "jailbreak" method against the Mythos model, which raised alarms within the Trump administration about potential security risks.

QWhat immediate action did Anthropic take in response to the U.S. government's export control directive regarding Fable 5 and Mythos 5?

AIn compliance with the directive, Anthropic immediately shut down access to both the Fable 5 and Mythos 5 models for all users globally. This was because the company's technical systems could not precisely distinguish between "foreign persons" and "U.S. citizens." Access to their other models, like Claude Opus 4.8, remained unaffected.

QWhat is the main argument in Anthropic's rebuttal statement against the government's rationale for the export controls?

AAnthropic argues that the potential jailbreak method cited by the government is extremely narrow and non-generalized. The company claims the demonstrated ability (analyzing a codebase to fix software bugs) is common in other publicly available AI models, including OpenAI's GPT-5.5, and is routinely used by cybersecurity defenders for legitimate system maintenance. They contend that using this as a standard would halt all new deployments of frontier models across the industry.

QWhat distinguishes the Fable 5 model from the Mythos 5 model as described in the article?

AFable 5 is the first publicly released model from Anthropic's higher-tier Mythos family. It has strong safety guardrails that automatically block responses in high-risk areas like cybersecurity, biology, and chemistry. Mythos 5, based on the same underlying model, is a version with fewer safety restrictions, intended as a professional tool accessible only to pre-approved organizations, such as cybersecurity defenders and critical infrastructure operators.

QAccording to Anthropic, what key principle was lacking in the government's action to impose the export controls?

AAnthropic stated that the government's action did not follow the principles of a transparent, fair, clearly-defined, and technically-grounded statutory process. The company believes it has the right, based on such a proper process, to block unsafe model deployments, but argues this specific action lacked that due process.

Похожее

За кулисами оценок ИИ стоит китайский «составитель заданий»

За заголовками AI-бенчмарков, таких как MMLU-Pro, MMMU и MMMU-Pro, стоит имя китайского исследователя Вэньху Чэня, доцента Университета Ватерлоо и основателя TIGER Lab. Его работа фокусируется на создании точных и сложных тестов для оценки способностей языковых и мультимодальных моделей. Когда старый стандарт MMLU перестал эффективно различать передовые модели, Чэнь и его команда разработали MMLU-Pro. Этот новый бенчмарк, содержащий более 12 000 вопросов, сделал задачи сложнее и стабильнее, что позволило снова выявить разницу между моделями, которые на старом тесте показывали почти идеальные результаты. В области мультимодального ИИ команда Чэня создала бенчмарк MMMU, который проверяет способность моделей понимать и рассуждать на основе комбинации текста, изображений, схем и других визуальных данных из профессиональных областей. Даже самые мощные модели изначально показывали на нём низкую точность, что выявило значительный пробел в их реальных способностях. Последующее обновление, MMMU-Pro, ещё больше усложнило задачу, предотвращая попытки моделей игнорировать визуальную информацию. Исследовательский интерес Чэня к сложным вопросам, требующим рассуждений и синтеза информации из разных источников, сформировался ещё во время его работы в Калифорнийском университете и позже в Google DeepMind над проектом Gemini. Этот практический опыт в разработке моделей помогает ему создавать более эффективные и «защищённые от обмана» тесты. Помимо оценки, его лаборатория также занимается исследованиями в области создания моделей, например, для работы с видео. В настоящее время Чэнь работает в лаборатории суперинтеллекта Meta, продолжая исследования в области данных и оценки для мультимодального ИИ. Его история является примером того, как китайские специалисты вносят ключевой вклад в развитие ИИ на фундаментальном уровне, часто оставаясь за кадром публичных обсуждений.

marsbit5 мин. назад

За кулисами оценок ИИ стоит китайский «составитель заданий»

marsbit5 мин. назад

STRC отвязался на 11%: продолжится ли вечное движение стратегии?

Акции привилегированного типа STRC компании MicroStrategy в настоящее время находятся в состоянии устойчивого «отрыва» от целевой номинальной стоимости в 100 долларов. С 15 мая скидка достигла более 11%, что ставит под сомнение ключевую цель дизайна данного продукта — функционирование в качестве ценной бумаги с доходом, стабильной рядом с номиналом. STRC является основным двигателем капитальной «маховика» MicroStrategy, позволяя через механизм ATM привлекать фиатные средства без размывания доли обычных акционеров (MSTR) для последующего увеличения резервов биткойна. Нынешний отрыв от номинала, несмотря на повышение дивидендной ставки до 11,5% и увеличение частоты выплат, сигнализирует о том, что рынок оценивает риски, выходящие за рамки самой доходности. Среди причин — технические факторы, такие как распродажи для снижения левериджа, и более глубокие опасения относительно ликвидности компании. Анализ JPMorgan указывает на ограниченность денежных резервов для покрытия дивидендных обязательств, в то время как MicroStrategy апеллирует к своим обширным резервам биткойна. Ключевой вопрос для рынка — будет ли компания в будущем вынуждена продавать биткойн для выполнения обязательств, что противоречит её прежнему нарративу о долгосрочном стратегическом холдинге. Устойчивый дисконт STRC ослабляет способность MicroStrategy к привлечению средств. Если эта ситуация продолжится, а денежные резервы будут истощаться, опасения относительно потенциальных продаж биткойна усилятся, что может превратить MicroStrategy из крупнейшего маржинального покупателя в продавца, оказывая значительное давление на рынок биткойна.

Odaily星球日报18 мин. назад

STRC отвязался на 11%: продолжится ли вечное движение стратегии?

Odaily星球日报18 мин. назад

В Кремниевой долине появилась самая востребованная новая вакансия

В Силиконовой долине растет спрос на новую роль — инженеров прямого развертывания (FDE). Это отражает ключевой сдвиг в ИИ-индустрии: акцент смещается с создания моделей на их практическое внедрение в бизнес. Крупнейшие компании, такие как OpenAI, Anthropic и Google, активно создают подразделения и нанимают FDE. Их задача — работать на местах у клиентов, интегрируя ИИ в сложные бизнес-процессы, преодолевая организационные барьеры, а не технические. Опыт Palantir показал, что успех зависит от понимания структуры компании, данных и рабочих потоков. Основная проблема внедрения ИИ — не технологии, а сопротивление внутри организации, распределение ответственности и устаревшие процессы. Таким образом, FDE становятся ключевыми фигурами в преобразовании бизнеса, сочетая технические знания с умением управлять изменениями в организации.

marsbit18 мин. назад

В Кремниевой долине появилась самая востребованная новая вакансия

marsbit18 мин. назад

Когда Кубок мира встречается с Agent: от Web2 к Web3, как кошельки идут к Agentic Wallet?

Чемпионат мира по футболу — это подходящий контекст для наблюдения за эволюцией кошельков Web3. Такие события, как выход команд в плей-офф или изменение коэффициентов, на платформах прогнозных рынков, подобных Polymarket, превращаются в торгуемые активы. Интеграция подобных активностей в основные кошельки Web3 может служить облегчённым входом для пользователей в ончейн-взаимодействие. Более значимым и ранним изменением становится внедрение AI Agent в сценарии использования кошельков, что меняет способ взаимодействия пользователей с блокчейном. Например, imToken в рамках активности, связанной с ЧМ, начал экспериментировать с AI Agent, позволяя пользователям через веб-интерфейс или Discord в естественной форме выражать намерение (например, «какие ставки на матч Португалии?»), после чего Agent помогает выстроить путь выполнения и незаметно вернуть пользователя в цепочку для завершения транзакции. Это можно рассматривать как раннюю форму «Agentic Wallet» — будущий Web3-кошелёк может не ограничиваться приложением, а становиться повсеместной «AI-формой кошелька». Ключевое изменение заключается в переходе от кошелька как «меню функций» к «интерпретатору намерений». Пользователь выражает цель на естественном языке, а Agent разбивает её на последовательность шагов, которую кошелёк затем превращает в ончейн-действия. Это отличается от простого добавления чат-бота. Тенденция выходит за рамки Web3. Такие игроки, как Mastercard с их решением Agent Pay for Machines, работают над инфраструктурой для авторизованных и контролируемых платежей через AI Agent, что подчёркивает растущую коммерческую значимость этого направления. В Web3-контексте вопросы доверия, разрешений и безопасности становятся ещё более критичными, поскольку транзакции необратимы. Таким образом, главный вызов для Agentic Wallet — не автоматизация как таковая, а определение границ и сохранение контроля пользователя. Кошелёк должен обеспечивать прозрачность: какой Agent что может делать, какие разрешения выданы, на какой срок, и позволять пользователю легко приостанавливать или отменять действия. Чем мощнее Agent, упрощающий сложные процессы, тем важнее роль кошелька как последнего защитного барьера, переводящего технические детали в понятные для пользователя условия и предупреждения. Эксперименты на таких событиях, как Чемпионат мира, являются отправной точкой для этой эволюции.

marsbit1 ч. назад

Когда Кубок мира встречается с Agent: от Web2 к Web3, как кошельки идут к Agentic Wallet?

marsbit1 ч. назад

Не работает ли опцион в DeFi? Виталик, возможно, так не думает

На протяжении долгого времени опционы в DeFi не были популярным инструментом из-за сложности, низкой ликвидности и недостатка естественного спроса. Однако Виталик Бутерин в своем последнем предложении по алгоритмическим стейблкоинам открыл новую возможность: использовать опционы не как самостоятельный продукт для торговли, а как базовый финансовый модуль для стейблкоинов, доходных продуктов и структурированных активов. В его дизайне стейбл-актив по сути представляет собой синтетический покрытый колл-опцион: 1 ETH делится на две части — одна обеспечивает «стабильную стоимость» ниже определенного страйка, а другая получает прибыль от роста выше страйка. Поскольку сумма двух частей всегда равна 1 ETH, система не требует долга, маржи или ликвидации, избегая ключевого риска традиционных CDP-стейблкоинов. Однако основная сложность заключается в том, что для поддержания стабильности стейбл-активу необходимо постоянно пролонгировать глубокие ITM-колл опционы, что создает проблемы с проскальзыванием, фронт-раннингом и ликвидностью. Кроме того, для каждого стейбл-актива должен существовать держатель соответствующего апсайд-актива — leveraged long позиции по ETH без финансирования и риска ликвидации. Устойчивый спрос на такую позицию определяет возможность масштабирования системы. Опыт Rysk показывает, что опционы в DeFi сложно масштабировать как прямой торговый инструмент, но они могут стать эффективной инфраструктурой для более сложных продуктов, таких как стейблкоины или структурированная доходность. Возможность опционов заключается не в том, чтобы стать следующим перпетуал-контрактом, а в том, чтобы стать движком ценообразования и распределения рисков для следующего поколения ончейн-финансовых продуктов.

marsbit2 ч. назад

Не работает ли опцион в DeFi? Виталик, возможно, так не думает