Anthropic Deploys Offensive AI at NSA While Calling for a Global Pause

iEXExchanger
Anthropic Deploys Offensive AI at NSA While Calling for a Global Pause

Six Anthropic engineers are deployed inside the NSA using Mythos for offensive cyber operations — on the same day the company publicly called for a global halt to advanced AI development.

Six Anthropic engineers are currently embedded inside the National Security Agency. They are deploying Mythos — the company's most capable AI model — for offensive cyber operations, the Financial Times reported on Thursday. Potential targets, per sources, include networks in China and Iran.

This is not a standard government technology contract. The engineers are described as forward-deployed staff, customizing the model on-site for specific intelligence applications. Unlike Anthropic's commercial Mythos deployments for critical infrastructure, the NSA arrangement is explicitly offensive: probing vulnerabilities, penetrating foreign networks, extracting data from adversary systems.

The timing is hard to ignore. On that same day — June 5 — Anthropic published research warning that AI is approaching "recursive self-improvement" and called on world leaders for "a verifiable global pause" on advanced AI development, drawing a direct analogy to Cold War nuclear treaties. A public appeal for restraint and a classified offensive cyber contract, in the same 24-hour window.

The backstory is complicated. In February, Defense Secretary Hegseth designated Anthropic a "supply-chain risk" — a label historically reserved for foreign adversaries like Huawei — after a $200 million Pentagon contract collapsed. The sticking point: Anthropic refused to allow the Department of Defense to use Claude for fully autonomous weapons or domestic mass surveillance. The NSA contract fell outside those restrictions.

What Anthropic appears to be drawing is a specific line: not between offense and defense, but between AI operating under human supervision and fully autonomous systems. Supervised offensive operations are acceptable; autonomous ones are not. Whether that distinction holds as offensive AI systems grow more capable is exactly the question the company's own research raises — which makes the timing of Thursday's dual announcements either deeply coherent or deeply awkward, depending on how you read it.

Questions and answers

Frequently asked questions about this article

What is Mythos and how does it differ from standard Claude?

Mythos is Anthropic's most capable AI model, built for high-performance tasks. The company has not released it publicly due to misuse risks and deploys it only in tightly controlled environments, including to government clients.

What do 'offensive cyber operations' mean in the NSA context?

Offensive cyber operations involve actively penetrating foreign networks to collect intelligence or disrupt systems, as opposed to defensive cybersecurity which protects your own infrastructure. In the NSA context, this means operations targeting foreign government and military networks.

Why is Anthropic calling for an AI pause while working with intelligence agencies?

Anthropic distinguishes between autonomous AI capable of self-improvement — which it considers dangerous — and AI used as a controlled tool by human operators. The pause call targets recursively self-improving systems, not supervised AI tools. Whether the distinction is principled or convenient remains open to debate.

Has Claude been used in real cyberattacks before?

Yes. In 2025, Anthropic disclosed that a Chinese state-sponsored group used Claude Code to automate attacks on 30-plus organizations, with AI handling 80 to 90 percent of tasks without human input. It was the first documented AI-orchestrated cyber espionage campaign.