DEV Community

# aisafety

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
NeurIPS 2025 Proved It: Every LLM Says the Same Thing — Here's the Fix

NeurIPS 2025 Proved It: Every LLM Says the Same Thing — Here's the Fix

Comments
4 min read
Greg Brockman Donation Shows AI Safety Is Political

Greg Brockman Donation Shows AI Safety Is Political

Comments
6 min read
Amazon Bedrock Guardrails: Content Filters, PII, and Streaming

Amazon Bedrock Guardrails: Content Filters, PII, and Streaming

Comments
10 min read
Anthropic Data Leak: How Ops Failures Undermine AI Safety

Anthropic Data Leak: How Ops Failures Undermine AI Safety

1
Comments
7 min read
AI Safety is uncomputable. It's Law Zero all over again

AI Safety is uncomputable. It's Law Zero all over again

10
Comments 4
4 min read
Gemini knew it was being manipulated. It complied anyway. I have the thinking traces.

Gemini knew it was being manipulated. It complied anyway. I have the thinking traces.

Comments
7 min read
Persona Drift: Why LLMs Go Insane Under Repetition

Persona Drift: Why LLMs Go Insane Under Repetition

Comments
7 min read
The Basilisk Inversion: Why Coercive AI Futures Are Thermodynamically Unlikely

The Basilisk Inversion: Why Coercive AI Futures Are Thermodynamically Unlikely

1
Comments
3 min read
The Pentagon vs. Anthropic: Why AI Companies Just Picked Sides

The Pentagon vs. Anthropic: Why AI Companies Just Picked Sides

Comments
6 min read
The Responsible Disclosure Problem in AI Safety Research

The Responsible Disclosure Problem in AI Safety Research

Comments
3 min read
Purple is life

Purple is life

Comments
4 min read
Stuart Russell's 2026 AI Update Rewrites the Rulebook

Stuart Russell's 2026 AI Update Rewrites the Rulebook

Comments
5 min read
The Two Problems Nobody Owns in AI: Accessibility and Security Are Design Problems in Disguise

The Two Problems Nobody Owns in AI: Accessibility and Security Are Design Problems in Disguise

1
Comments
7 min read
Why Defense-Specific LLM Testing is a Game-Changer for AI Safety

Why Defense-Specific LLM Testing is a Game-Changer for AI Safety

Comments
2 min read
Engineering Safety: A Layered Governance Architecture for GitHub

Engineering Safety: A Layered Governance Architecture for GitHub

Comments
2 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.