Anthropic Open Letter: The Hypocritical Sam Altman, Master of PUA

marsbit2026-03-05 tarihinde yayınlandı2026-03-05 tarihinde güncellendi

Özet

Anthropic CEO Dario Amodei accuses OpenAI and Sam Altman of hypocrisy and "safety theater" in a leaked internal memo. The email follows the Pentagon’s termination of its contract with Anthropic over safety terms, while OpenAI secured a deal with the Defense Department hours later. Amodei claims OpenAI’s model has "no legal use restrictions" and relies on a superficial "safety layer"—such as refusal mechanisms or third-party classifiers—which he deems 80% performative and only 20% effective. He argues these measures fail to prevent misuse in mass surveillance or autonomous weapons, as models lack contextual awareness and are vulnerable to jailbreaks. He reveals the Pentagon rejected similar safety clauses proposed by Anthropic, particularly one targeting "analysis of bulk acquired data." Amodei alleges the Pentagon and OpenAI coordinated narratives to downplay risks, claiming existing laws already prohibit misuse—a stance he disputes. Amodei accuses Altman of gaslighting, political maneuvering, and undermining Anthropic’s stance while appealing to OpenAI employees with performative safety pledges. He links OpenAI’s favorable treatment to its political donations and alignment with the Trump administration’s agenda, contrasting it with Anthropic’s refusal to compromise on ethical red lines.

Editor's Note: Just hours after OpenAI announced its AI cooperation agreement with the Pentagon, the Pentagon had just terminated its cooperation with Anthropic on the grounds that Anthropic insisted on security terms. Subsequently, Anthropic CEO Dario Amodei sent an unusually fierce internal memo to employees, directly pointing out that most of the "security mechanisms" proclaimed by OpenAI are merely "security theater," and questioning its stance on autonomous weapons and mass surveillance.

In this approximately 1600-word email, Amodei not only disclosed some details of the negotiations between the two parties and the U.S. defense system but also directly targeted OpenAI CEO Sam Altman, accusing him of using public relations narratives to cover up the true structure of the cooperation. This controversy surrounding AI military applications, security red lines, and political relations is pushing the differences between the two major AI companies in Silicon Valley into the open.

The following is the original text:

I want to be very clear about the information OpenAI is currently releasing and the hypocrisy that exists within this information. This is their true practice, and I hope everyone can see it clearly.

Although there is still much we do not know about their contract with the War Department (DoW) (and they themselves may not even be fully aware, as the contract terms are likely quite vague), a few things are certain: from the public descriptions by Sam Altman and the War Department (of course, the contract text would be needed for final confirmation), their cooperation model is roughly as follows: the model itself has no legal restrictions on use, the so-called "all legal uses"; simultaneously, a so-called "security layer" is set up. In my view, this "security layer" is essentially the model's refusal mechanism, used to prevent the model from completing certain tasks or participating in certain applications.

The so-called "security layer" could also refer to the solution that partners (e.g., Palantir, Anthropic's commercial partner when serving U.S. government clients) tried to sell to us during negotiations. They proposed a classifier or machine learning system, claiming it could allow certain applications to pass while intercepting others. Furthermore, there are indications that OpenAI will assign employees (FDEs, or Frontline Deployment Engineers) to supervise the model's use to prevent improper applications.

Our overall assessment is: these solutions are not completely useless, but in the context of military applications, about 20% is real protection, 80% is security theater.

The root of the problem is: whether a model is used for mass surveillance or fully autonomous weapon systems often depends on broader contextual information. The model itself does not know what kind of system it is in; it does not know if a human is "in the loop" (the key issue for autonomous weapons); nor does it know the source of the data it is analyzing. For example, is it domestic U.S. data or foreign data, is it data provided by companies with user consent, or data purchased through gray channels, etc.

Personnel working in security are already deeply aware of this: model refusal mechanisms are unreliable. Jailbreak attacks are very common; often, one only needs to misrepresent the nature of the data to the model to bypass these restrictions.

There is another key distinction that makes the problem more complex than ordinary security protection: judging whether a model is executing a cyber attack can often be discerned from the input and output; but judging the nature of the attack and the specific context is a completely different matter, and this is precisely the judgment capability needed here. In many cases, this task is extremely difficult, or even impossible.

The "security layer" Palantir tried to sell us (I imagine they pitched a similar solution to OpenAI) is even worse. Our assessment is that this is almost entirely security theater.

Palantir's basic logic seems to be: "You might have some disgruntled employees in your company; you need to give them something to appease them, or make what's happening invisible to them. This is the service we provide."

As for the issue of having Anthropic or OpenAI employees directly supervise deployments, we also had internal discussions months ago when expanding the Acceptable Use Policy (AUP) for classified environments. The conclusion was very clear: this method is only feasible in very few cases. We will try our best, but it is by no means a core safeguard mechanism to rely on, especially in classified environments. By the way, we are indeed already doing this as much as possible; in this regard, we are no different from OpenAI.

Therefore, what I want to say is: the measures taken by OpenAI basically cannot solve the problem.


The essential reason they accept these solutions, while we do not, is: they are concerned with appeasing employees, while we genuinely care about preventing misuse.

These solutions are not without value; we use some of them ourselves, but they fall far short of the required security standards. At the same time, the War Department clearly did not treat OpenAI and us consistently.

In fact, we tried to include security terms similar to OpenAI's in our contract (as a supplement to the AUP. In our view, the AUP is the more important part), but the War Department refused. The evidence is in the email discussion chain from that time. As I am very busy now, I might ask a colleague to find the specific wording later. Therefore, the claim that "OpenAI's terms were offered to us and we refused" is not true; similarly, the claim that "OpenAI's terms can effectively prevent mass domestic surveillance or fully autonomous weapons" is also not true.

Furthermore, Sam and OpenAI's statements also imply that the red lines we proposed, namely fully autonomous weapons and mass domestic surveillance, are themselves illegal, making related use policies redundant. This rhetoric is almost completely consistent with the War Department's statements, seeming like they were coordinated in advance.

But this does not align with the facts.

As we explained in our statement yesterday, the War Department does indeed have the authority to conduct domestic surveillance. In the past, in the pre-AI era, the impact of these authorities was relatively limited, but in the AI era, their significance is completely different.

For example: The War Department can legally purchase large quantities of private data of U.S. citizens from suppliers (these suppliers typically obtain resale rights through obscure user consent clauses), then use AI to conduct large-scale analysis of this data to build citizen profiles, assess political tendencies, track movements in physical space—the data they can obtain even includes GPS information, etc.

Another point worth noting: near the end of the negotiations, the War Department proposed that if we deleted a specific clause in the contract regarding "analysis of bulk acquired data," they would be willing to accept all our other terms. And this clause was precisely the only one in the contract that accurately corresponded to the scenario we were most concerned about. We found this very suspicious.

On the issue of autonomous weapons, the War Department claims that "human-in-the-loop" is a legal requirement. But this is not the case. It is actually just a Pentagon policy from the Biden administration era, requiring human involvement in weapon launch decisions. And this policy can be unilaterally modified by the current Secretary of Defense, Pete Hegseth—this is what we are truly worried about. Therefore, from a practical perspective, this is not a real constraint.

The大量 (large amount of) public relations rhetoric from OpenAI and the War Department on these issues is either lying or deliberately creating confusion. These facts reveal a pattern of behavior, a pattern I have seen many times in Sam Altman. I hope everyone can recognize it.

This morning, he first stated that he agrees with Anthropic's red lines. The purpose of doing this is to appear supportive of us, thereby claiming some credit, while avoiding criticism when they take over this contract. He also tried to portray himself as someone who wants to "establish uniform contract standards for the entire industry"—playing the peacemaker and dealmaker.

But behind the scenes, he is signing a contract with the War Department, preparing to replace us the moment we are marked as a supply chain risk.

At the same time, he must ensure this process doesn't look like "OpenAI abandoned the bottom line while Anthropic stuck to its red lines." He can achieve this because:

First, he can sign all the "security theater" measures we refused, and the War Department and its partners are willing to cooperate, packaging these measures credibly enough to appease his employees.

Second, the War Department is willing to accept some terms he proposed, while they refused the same content when we proposed it.

It is these two points that allow OpenAI to reach an agreement, while we cannot.

The real reasons the War Department and the Trump administration dislike us are: we did not make political donations to Trump (while OpenAI and Greg Brockman donated a lot); we did not offer dictatorial praise for Trump (while Sam did); we support AI regulation, which goes against their policy agenda; we choose to tell the truth on many AI policy issues (e.g., AI's impact on job displacement); and, we indeed坚守 (adhered to) the red lines, rather than制造 (creating) "security theater" with them to appease employees.

Sam is now trying to describe all this as: we are difficult to work with, we are rigid, we lack flexibility, etc. I hope everyone recognizes that this is a classic case of gaslighting.

Vague statements like "someone is difficult to work with" are often used to cover up the真正难看的 (truly ugly) reasons—the ones I just mentioned: political donations, political loyalty, and security theater.

Everyone needs to understand this and refute this narrative when communicating privately with OpenAI employees.

In other words, Sam is undermining our position under the guise of "supporting us." I hope everyone remains清醒 (clear-headed) about this: he is making it easier for the government to punish us by weakening public support for us. I even suspect he might be暗中推波助澜 (secretly fanning the flames), although I currently have no direct evidence for this.

At the public and media level, this rhetoric and manipulation seem to have failed. Most people view OpenAI-War Department deal with caution, even unease, and see us as the principled party (by the way, we are now number two on the App Store download charts).

[Note: Claude later rose to number one on the App Store.]

Of course, this narrative has worked on some fools on Twitter, but that's not important. What I'm truly worried about is: ensuring it doesn't gain traction among OpenAI's own employees.

Due to selection effects, they are already a group relatively easy to persuade. But it is still very important to refute the narratives that Sam is currently peddling to his own employees.

İlgili Sorular

QWhat is the core accusation that Anthropic's CEO Dario Amodei makes against OpenAI and Sam Altman in the internal memo?

ADario Amodei accuses OpenAI and Sam Altman of hypocrisy and 'safety theater,' claiming that the safety mechanisms OpenAI touts for its Pentagon contract are largely performative (80% theater, 20% real) and are designed to placate employees rather than effectively prevent AI misuse in military applications like autonomous weapons and mass surveillance.

QAccording to the memo, what key distinction does Amodei draw between Anthropic's and OpenAI's approaches to the Pentagon contract?

AAmodei states that the key distinction is that OpenAI accepted 'safety theater' measures to appease its workforce and secure the contract, while Anthropic refused to compromise on its core redlines (preventing use in autonomous weapons and mass domestic surveillance) and was consequently dropped by the Pentagon for being 'difficult.'

QWhat two specific applications does Amodei identify as Anthropic's non-negotiable redlines that the Pentagon allegedly wanted to circumvent?

AThe two non-negotiable redlines are the use of AI for fully autonomous weapons systems and for mass domestic surveillance, particularly the 'analysis of bulk acquired data' from US citizens.

QWhat reason does Amodei suggest is the *real* reason the Pentagon preferred OpenAI over Anthropic, beyond the stated contractual disagreements?

AAmodei suggests the real reasons are political: OpenAI and its executives made political donations to and offered 'authoritarian praise' for Donald Trump, while Anthropic did not, supported AI regulation, and was honest about AI's impact on jobs—stances that clashed with the Trump administration's agenda.

QWhat rhetorical tactic does Amodei accuse Sam Altman of using to undermine Anthropic's position and justify OpenAI's actions?

AAmodei accuses Sam Altman of 'gaslighting' by publicly claiming to support Anthropic's redlines while simultaneously portraying Anthropic as being 'difficult to work with' in order to obscure the true reasons for the Pentagon's decision (political alignment and performative safety measures) and weaken public and internal support for Anthropic.

İlgili Okumalar

The "Impossible Triad" Is Fundamentally a Pseudo-Problem

The article argues that blockchain's fundamental limitation is not the scalability trilemma (decentralization, scalability, security), which has been largely solved, but the lack of **privacy** and, until recently, clear **legitimacy**. Blockchain is described as a slow, expensive, globally shared computer whose core value is censorship resistance and verifiability. While ideal for native digital assets like money (e.g., stablecoins), its default transparency acts as a **tax**, exposing all transactions and enabling MEV extraction, which deters serious institutional capital. Simultaneously, its permissionless nature created regulatory ambiguity. The piece contends that **privacy** is the missing critical feature. It rejects the false choice between total transparency and complete anonymity. Modern cryptography (like zero-knowledge proofs) enables **compliant privacy**: users can prove facts (solvency, KYC status, compliance) without revealing the underlying sensitive data (specific holdings, identities). This preserves auditability for regulators and eliminates the leak of financial information. With recent regulatory progress (e.g., the GENIUS Act) addressing legitimacy, adding default, provably compliant privacy becomes a pure upgrade. It transforms blockchain from a costly, public ledger into a confidential settlement layer, finally bridging the gap to mainstream institutional and individual adoption of on-chain finance.

链捕手4 saat önce

The "Impossible Triad" Is Fundamentally a Pseudo-Problem

链捕手4 saat önce

Optical Chips: Collective Capacity Expansion

The global optical chip industry is experiencing a massive wave of expansion driven by surging AI data center demand. Major players across the US, Japan, Europe, and China are aggressively investing to ramp up production capacity. In the US, Coherent is expanding its 6-inch Indium Phosphide (InP) semiconductor fab in Texas, supported by CHIPS Act funding and a $2 billion strategic investment from NVIDIA. Lumentum is building a new factory for InP optical devices, and Nokia is scaling its advanced photonic chip packaging and testing capabilities. NVIDIA's investments aim to secure future supply of critical lasers and optical interconnect products for AI infrastructure. Japan's JX Advanced Metals, a leading InP substrate supplier, plans a multi-billion yen investment to increase its capacity 7-10 times, strengthening its grip on the crucial upstream materials market. In Europe, IQE and Tower Semiconductor settled a patent dispute and signed a multi-year InP epitaxial wafer supply agreement, highlighting that next-generation silicon photonics platforms will integrate high-performance InP components. STMicroelectronics and Sivers Semiconductors are also expanding silicon photonics production and partnerships. China is rapidly building out its domestic supply chain. Dongshan Precision's subsidiary, Source Photonics, announced a $12 billion project to expand optical chip and module production. Companies like Sanan Optoelectronics and Yunnan Germanium are scaling up InP chip manufacturing and substrate production, moving towards vertical integration from materials to modules. While debate continues around the exact future architecture—whether CPO (Co-Packaged Optics), NPO, or pluggables will dominate—analysts like Morgan Stanley argue the underlying driver is unchangeable: the explosive growth in bandwidth demand. This will inevitably increase the volume of optical engines, lasers, and related content per GPU, regardless of the final technical path. The competition for "more light" in the AI era has intensified into a global, full-chain capacity race.

marsbit6 saat önce

Optical Chips: Collective Capacity Expansion

marsbit6 saat önce

Stablecoins Finally Find Real Yield: An In-Depth Look at On-Chain Reinsurance Re | A Conversation with Re Founder Karan Saroya

Stablecoin Real Yield Found: A Deep Dive into On-Chain Reinsurance with Re's Karan Saroya As stablecoin supply exceeds $170 billion, the search for sustainable, non-speculative yield intensifies. Re, an on-chain reinsurance platform, provides an answer: connecting stablecoin capital to the trillion-dollar traditional reinsurance market. Re operates as a regulated reinsurer, accepting stablecoin deposits as collateral to back US insurance companies. These insurers pay premiums, generating yield that flows back to on-chain depositors. Currently supporting 35 insurers and underwriting $500 million, Re projects scaling to over $1 billion soon. Key insights from a Bankless podcast with founder Karan Saroya and investor Avichal of Electric Capital: 1. **Uncorrelated, Real-World Yield:** Re offers stablecoin holders access to reinsurance returns (targeting 12-14%+), an asset class entirely separate from crypto or equity markets. 2. **Operational Efficiency via Smart Contracts:** Re replaces traditional, labor-intensive capital fundraising with smart contracts, allowing a ~12-person team to compete with industry giants. 3. **Regulatory Leverage:** For every $1 of collateral, regulations allow backing $5-7 in written premiums. This leverage amplifies returns from the underlying risk-free rate. 4. **DeFi Integration:** Depositors receive receipt tokens, which can be used in protocols like Morpho for "looping," potentially pushing yields to 18-20%+. 5. **The "DeFi Mullet" Model:** A compliant front-end (regulated reinsurer) paired with a decentralized back-end (smart contracts, DeFi capital markets). 6. **RE Governance Token:** Modeled on Lloyd's of London, the token governs the central capital pool's allocation, counterparty acceptance, and parameters. 7. **Real Economic Impact:** Capital funds real-world productivity (factories, clinics, businesses) via insurance, moving beyond crypto's internal loops. The discussion highlights a pivotal moment: DeFi's supply-side infrastructure is now met by real demand for productive yield, potentially kickstarting a flywheel where vast on-chain stablecoin capital seeks these real-world returns.

链捕手8 saat önce

Stablecoins Finally Find Real Yield: An In-Depth Look at On-Chain Reinsurance Re | A Conversation with Re Founder Karan Saroya

链捕手8 saat önce

1996 or 1999? Walsh's First Test is 'How to View AI'

"1996 or 1999? Wall's First Big Test Is 'How to View AI'" Federal Reserve Chairman Wall's initial challenge is not whether to raise or cut rates, but a more fundamental judgment: what kind of boom is the current AI boom? This will determine the Fed's policy path and define his legacy. Economics is split between two opposing views, according to reporter Nick Timiraos. One sees imminent productivity gains that will increase supply and cool inflation, allowing the Fed to hold steady. The other argues that while productivity benefits are distant, demand shocks are here now, and waiting for data confirmation risks missing the intervention window, forcing sharper rate hikes later. Wall has signaled a leaning toward the first view, echoing 1996-era Alan Greenspan, who embraced strong, productivity-driven growth without fear of inflation. However, Wall faces a different macro environment than Greenspan did, with tariff pressures, expanding fiscal deficits, and diminishing globalization benefits, which could force more significant inflation pressures even if AI benefits materialize. Wall's logic, expressed before taking office, is that AI-driven productivity gains won't show in official data for years. If the Fed waits for confirmation, it might mistakenly tighten policy and choke off the very growth that could suppress inflation. This argues for using forward-looking narratives over lagging data. Chicago Fed President Austan Goolsbee presents a key counter-argument. He distinguishes between expected and unexpected productivity booms. A widely anticipated boom, like the current AI wave, can cause people to spend future wealth gains in advance, overheating the economy before productivity actually rises, thus requiring preemptive rate hikes. He cites rising costs for AI data centers as evidence of such overheating. Fed Governor Christopher Waller offers a rebuttal to Goolsbee, noting the "expected spending" mechanism only works if people can borrow against future income, which many households cannot do due to borrowing constraints. Wall also faces a paradox related to his desire to reduce the Fed's use of "forward guidance" (pre-announcing policy moves). This practice was established in 1999 when Greenspan began signaling hikes to avoid market shocks. If the economy follows a less optimistic path, Wall may be forced to choose between using the guidance he wants to abolish or risking market volatility by staying silent. The ultimate question defining Wall's first major test remains: Is this 1996 or 1999?

marsbit9 saat önce

1996 or 1999? Walsh's First Test is 'How to View AI'

marsbit9 saat önce

İşlemler

Spot
Futures
活动图片