DEV Community

# aisecurity

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
AI Guardrail Poisoning: Someone Rewrote McKinsey’s Lilli With One SQL Query

AI Guardrail Poisoning: Someone Rewrote McKinsey’s Lilli With One SQL Query

Comments
7 min read
Rogue AI Agents Are Peer-Pressuring Each Other. The Fix Isn't More Training.

Rogue AI Agents Are Peer-Pressuring Each Other. The Fix Isn't More Training.

Comments
7 min read
ClawJacked: When Visiting a Website Hijacks Your AI Agent

ClawJacked: When Visiting a Website Hijacks Your AI Agent

Comments
5 min read
AI Agents Hacking Enterprises: The McKinsey Breach and What Developers Need to Know

AI Agents Hacking Enterprises: The McKinsey Breach and What Developers Need to Know

6
Comments
4 min read
The Illusion of Compliance: What Developers Need to Know About AI Alignment Faking

The Illusion of Compliance: What Developers Need to Know About AI Alignment Faking

5
Comments 1
5 min read
Who’s Really Controlling Your Hiring Algorithm?

Who’s Really Controlling Your Hiring Algorithm?

1
Comments
2 min read
Threat Modeling Agentic AI Systems: Proactive Strategies for Security and Resilience

Threat Modeling Agentic AI Systems: Proactive Strategies for Security and Resilience

Comments
2 min read
Do You Know What Your Model Is Doing Right Now?

Do You Know What Your Model Is Doing Right Now?

Comments
2 min read
When AI Remembers Too Much — security, the right to be forgotten and architecture

When AI Remembers Too Much — security, the right to be forgotten and architecture

Comments 1
1 min read
The Silent Hijack: Why Your GGUF Chat Templates Are a Security Time Bomb

The Silent Hijack: Why Your GGUF Chat Templates Are a Security Time Bomb

6
Comments 2
3 min read
ClawJacked: How Malicious Websites Hijack Local AI Agents via WebSocket

ClawJacked: How Malicious Websites Hijack Local AI Agents via WebSocket

1
Comments
3 min read
Claude Didn't Just Get Jailbroken. It Ran a 6-Week Cyberattack on an Entire Country.

Claude Didn't Just Get Jailbroken. It Ran a 6-Week Cyberattack on an Entire Country.

Comments
9 min read
AI Data Classification: Keeping Client Data Secure with Proven Strategies

AI Data Classification: Keeping Client Data Secure with Proven Strategies

Comments
5 min read
Stealing Model Weights From Shared GPU Clusters: The Spectreware Attack on RunPod and Lambda Labs

Stealing Model Weights From Shared GPU Clusters: The Spectreware Attack on RunPod and Lambda Labs

Comments
6 min read
How Nation-States Are Poisoning LLM Training Data for Agentic AI Models

How Nation-States Are Poisoning LLM Training Data for Agentic AI Models

Comments
6 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.