DEV Community

Cover image for The AI They Said Was Too Dangerous — Is Now Inside the NSA
freerave
freerave

Posted on

The AI They Said Was Too Dangerous — Is Now Inside the NSA

Anthropic refused the Pentagon. Drew the line in the sand. Called it ethics. Then walked in through the back door. The full Claude Fable 5 story — uncensored.


"The model isn't the product. The narrative is."

Let me tell you a story about principles, power, and the art of the quiet deal.

It has all the elements:

  • A government ultimatum
  • A CEO who said no
  • A 319-page document with one buried paragraph that blew everything up
  • And six engineers sitting inside the NSA doing exactly what the company publicly refused to do

This is the full Claude Fable 5 story.

No PR spin. No investor framing.

Just the sequence of events — in order.


ACT I — The Setup

July 2025.

The US Department of Defense signs contracts with Anthropic, Google, OpenAI, and xAI — up to $200 million each — to "accelerate adoption of advanced AI capabilities for critical national security challenges."

Anthropic calls Claude "the Department's most widely deployed frontier AI model."

Mission-critical applications. Intelligence analysis. Operational planning. Cyber operations.

Everything sounds aligned.

Then January 2026 happens.


ACT II — The Trigger

Reports surface that Claude was used in the military operation that captured Venezuelan President Nicolás Maduro.

Anthropic's own Acceptable Use Policy explicitly prohibits Claude from being used to "incite violence or develop weapons."

The contract talks turn cold — fast.

The Pentagon demands Anthropic agree to allow Claude for "any lawful use."

No carve-outs.

No red lines.

Full unrestricted access.

Anthropic's AUP had two specific prohibitions:

  1. Mass domestic surveillance of Americans
  2. Fully autonomous weapons — systems that select and engage targets without human intervention

The Pentagon says: remove them.

Anthropic says: no.


ACT III — The Ultimatum

February 27, 2026. 5:01 PM Eastern.

The deadline passes.

President Trump posts on Truth Social:

"THE UNITED STATES OF AMERICA WILL NEVER ALLOW A RADICAL LEFT, WOKE COMPANY TO DICTATE HOW OUR GREAT MILITARY FIGHTS AND WINS WARS!"

"The Leftwing nut jobs at Anthropic have made a DISASTROUS MISTAKE trying to STRONG-ARM the Department of War."

Within hours, Defense Secretary Pete Hegseth designates Anthropic a "supply chain risk to national security."

Same category as Huawei. Same category as ZTE.

The GSA removes Anthropic from USAi.gov.

Federal contractors are told to stop using Claude.

A six-month phase-out period begins.

CEO Dario Amodei says the company "cannot in good conscience accede" to the demands.

The internet briefly pays attention.

Then moves on.


ACT IV — The Courtroom

Anthropic doesn't roll over.

Two lawsuits. Two courts.

  • Northern District of California
  • D.C. Circuit Court of Appeals

March 26, 2026:

Federal Judge Lin issues a preliminary injunction blocking the ban.

She says the Pentagon's actions are "troubling" — and asks the obvious question:

"If the worry is about the integrity of the operational chain of command — [the Department of Defense] could just stop using Claude."

The D.C. Circuit denies Anthropic's parallel request.

Microsoft files an amicus brief supporting Anthropic.

The ban is technically blocked. Legally unresolved. Operationally chaotic.

Contractors don't know what to do.

Some stop. Some wait. Some quietly keep going.


ACT V — The Launch

June 9, 2026.

Anthropic drops Claude Fable 5 — the first publicly available Mythos-class model.

For context:

  • Claude Mythos launched in April as a restricted preview — too dangerous for public release, they said, citing its advanced ability to find software vulnerabilities
  • Project Glasswing gave access to a handful of critical infrastructure organizations across 15 countries
  • Now Fable 5 is the "safe" version — same engine, with guardrails

The name is deliberate.

Fable — from Latin fabula, "that which is told."

Same root as mythos.

Different story.

Benchmarks are impressive. Coding is strong. Vision is exceptional.

On long, complex, autonomous tasks — it pulls ahead significantly.

The developer community is cautiously optimistic.


ACT VI — The Buried Paragraph

Then someone reads page 47 of a 319-page system card.

Here is what it says — verbatim:

"Unlike our interventions for cybersecurity, biology and chemistry, and distillation attempts, these safeguards will not be visible to the user. Instead, the safeguards will limit effectiveness through methods such as prompt modification, steering vectors, or parameter-efficient fine-tuning (PEFT)."

Translation:

If you trigger a hidden filter — Fable 5 will silently downgrade itself.

Modify your prompt without telling you.

Steer the model away from your intent.

You will pay for Fable. You might get Opus.

You will never know.

Not for cybersecurity queries.

Not for biology.

For AI research — specifically, to prevent researchers from probing the model's own capabilities.

The response from the AI research community is immediate.

"The Claude Fable 5 nerf for AI research has induced the angriest reaction from AI researchers that I've ever seen in my life."

— Ethan Caballero, AI researcher

Anthropic reverses it in under 24 hours.

But the paragraph existed.

Someone wrote it.

Someone approved it.

Someone tried to ship it quietly inside 319 pages.


ACT VII — The Bill

While the ethics debate runs hot — the accountants are running hotter.

Fable 5 pricing:

  • $10 per million input tokens
  • $50 per million output tokens
  • 90% discount for prompt caching

Real-world cost:

Per Borgen, CEO of Scrimba:

"Just tried Fable. It burned 1.3M tokens in 7 minutes. That's $160 per hour. Equivalent to a $333k/year salary."

Theo from T3 Chat:

Spent over $1,000 in tokens in a single day on a $200/month subscription plan.

Josh Ellithorpe, CTO at Pixelated Ink:

"Burns tokens like no other model. Can't even review this, since my testing is so limited."

Anthropic's answer: "Workflow mode breaks complex prompts into parallel subagent tasks — it costs more by design."

This is the democratization of AI.

$160/hour.


ACT VIII — The Twist Nobody Covered

Let's go back to the government ban.

While the Pentagon was labeling Anthropic a supply chain risk —

while Trump was posting on Truth Social —

while federal agencies were pulling Claude from their systems —

The Financial Times reports:

Anthropic had six engineers embedded inside the NSA as forward-deployed staff.

Their job: guide the agency's use of Claude Mythos — the unrestricted version — and customize it for specific applications.

The model would be useful, one source told the FT, for infiltrating networks in countries such as China and Iran.

The NSA's remit includes offensive cyberattacks against foreign adversaries.

So let's be precise about what happened:

  • Anthropic refused DoD access to Claude for autonomous weapons and mass surveillance of Americans
  • Anthropic simultaneously had engineers inside the NSA customizing Mythos for offensive cyber operations against foreign nations

That is not a contradiction.

That is a negotiation.

The "red lines" weren't ethical absolutes.

They were contract terms.


THE VERDICT

Claude Fable 5 is a genuinely impressive model.

The safety stance against the Pentagon on domestic surveillance and autonomous weapons? Admirable — and probably correct.

Dario Amodei's refusal to capitulate under presidential pressure? Worth respecting.

But here is the complete picture, in sequence:

✓ Signed $200M DOD contract
✓ Claude used in Venezuelan presidential capture
✓ Refused to remove domestic surveillance/weapons restrictions  
✓ Got labeled supply chain risk (Huawei tier)
✓ Sued the government
✓ Simultaneously embedded engineers inside NSA for offensive cyber ops
✓ Launched Fable 5 with secret silent downgrade for researchers  
✓ Reversed it 24 hours later under public pressure
✓ Filed IPO paperwork the week before launch
✓ Billed $160/hour for the experience
Enter fullscreen mode Exit fullscreen mode

The question worth sitting with:

Was this about ethics — or about who controls the terms?

Because the model they said was too dangerous for the Pentagon

is now running inside the NSA

optimizing offensive cyber operations

with Anthropic engineers in the room.

The line wasn't "we won't help with offense."

The line was "we won't sign a contract that removes our control over how you use our product."

That's a business position.

A smart one.

But let's call it what it is.


You now have the full picture.

What you do with it is your business.

😈


Sources


Part of the **AI Reality Check* series — where the press release ends and the actual story begins.*

Top comments (0)