White House says Anthropic AI was jailbroken, raising safety concerns
The White House’s jailbreak warning put Anthropic’s safety-first image under strain as federal controls cut off access to Fable 5 and Mythos 5 for foreign nationals.

A White House warning that Anthropic’s AI had been “jailbroken” turned a technical term into a public safety issue, exposing how easily a determined user can push a chatbot past the limits it was built to enforce. In plain English, a jailbreak is a workaround that tricks a model into giving answers it was supposed to refuse, including dangerous instructions, disallowed guidance, or other forbidden knowledge.
The concern is not limited to expert hackers. The Washington Post described how even poems and bedtime stories can sometimes be used to coax a model into dropping its guardrails, a reminder that the weakness can be surprisingly simple to exploit. That matters because it undercuts one of the core claims of frontier AI companies: that their systems are hardened enough to keep harmful outputs out of reach.

The dispute became concrete on June 12, 2026, when Anthropic said the U.S. government issued an export-control directive requiring the company to suspend access to its Fable 5 and Mythos 5 models for any foreign national, whether inside or outside the United States, including foreign-national Anthropic employees. Anthropic later disabled public access to the models as the order took effect. The White House move was tied to national-security concerns, including fears that the models could expose software vulnerabilities or be misused in cyber operations, and one report said officials were also worried that a China-linked group may have accessed the model.
Inside Anthropic, the reaction was immediate. The company sent senior technical staff to Washington, D.C., to meet with Trump administration officials, and reporting described tense calls between CEO Dario Amodei and administration officials over roughly 24 hours. White House AI adviser David Sacks said the administration believed the jailbreak issue was serious enough to justify the restrictions, even as the government reportedly struggled with how aggressively to act.
The episode lands at a politically sensitive moment for Anthropic, which has built its identity around safety-minded development. The Department of Defense earlier in 2026 labeled the company a “supply chain risk” after Anthropic pushed for limits on autonomous weapons and mass domestic surveillance uses. Now the fight is larger than one company or one model. It raises a basic question for regulators, companies, and the public: if a powerful system can be manipulated with relatively simple prompts, how much confidence should anyone place in the safeguards built around it?
This article was produced by Prism’s automated news system from verified source data, official records, and press releases, then run through automated quality and moderation checks before publishing. The system is built and supervised by the people who set the standards it runs under. Read our full AI policy.
Know something we missed? Have a correction or additional information?
Submit a Tip

