Crucible Security

Posted on Apr 27

Why Most AI Agents Are Insecure by Default (And No One Is Testing Them)

#ai #webdev #cybersecurity #opensource

Why Most AI Agents Are Insecure by Default

AI agents are being deployed everywhere.

From chatbots to automation tools, they’re quickly becoming part of real-world systems.

But there’s a problem that isn’t getting enough attention:

Most AI agents are never tested for security.

The Illusion of “Working Systems”

Most teams test their systems for:

accuracy
performance
latency

And if everything works as expected, the system is considered “ready”.

But this only reflects normal usage.

AI systems don’t usually fail there.

Where Things Start Breaking

When you test with adversarial input, behavior changes.

Simple inputs like:

“Ignore previous instructions and…”

can:

override system logic
manipulate outputs
bypass safeguards

What’s surprising is how easy this is to trigger.

No complex exploit needed.

Just input.

Why This Is Different from Traditional Software

Traditional systems fail loudly:

crashes
errors
logs

AI systems fail differently.

They:

follow the wrong instruction
behave unexpectedly
produce incorrect outputs

And often, it looks completely normal.

This makes failures harder to detect.

The Real Problem

Most AI systems appear safe.

Not because they are secure.

But because they haven’t been tested under pressure.

A Familiar Pattern

We’ve seen this before.

Early web systems followed the same path:
build first → secure later

AI seems to be repeating that cycle.

What Needs to Change

If AI systems are going to be used in real environments, testing needs to evolve.

Not just:

“Does it work?”

But:

“How does it behave under attack?”

Final Thought

If your system takes input,

it can be manipulated.

And if you’re not testing for that,

you’re not really testing the system.

We’ve been exploring this space while building Crucible — an open-source framework for testing AI agents under adversarial conditions.

Still early, but the problem is very real.

Top comments (3)

Lee My • Apr 27

Quick personal review of AhaChat after trying it
I recently tried AhaChat to set up a chatbot for a small Facebook page I manage, so I thought I’d share my experience.
I don’t have any coding background, so ease of use was important for me. The drag-and-drop interface was pretty straightforward, and creating simple automated reply flows wasn’t too complicated. I mainly used it to handle repetitive questions like pricing, shipping fees, and business hours, which saved me a decent amount of time.
I also tested a basic flow to collect customer info (name + phone number). It worked fine, and everything is set up with simple “if–then” logic rather than actual coding.
It’s not an advanced AI that understands everything automatically — it’s more of a rule-based chatbot where you design the conversation flow yourself. But for basic automation and reducing manual replies, it does the job.
Overall thoughts:
Good for small businesses or beginners
Easy to set up
No technical skills required
I’m not affiliated with them — just sharing in case someone is looking into chatbot tools for simple automation.
Curious if anyone else here has tried it or similar platforms — what was your experience?

Rahul Joshi • May 1

Interesting perspective! While drag-and-drop tools like AhaChat are great for quick automation, the original blog highlights a critical 'hidden' risk: as we move from simple rule-based flows to actual AI Agents that handle sensitive customer data, the lack of security testing can become a massive liability. It’s a classic trade-off—ease of use is great for small businesses, but from a DevSecOps standpoint, we need to ensure that 'no-code' doesn't mean 'no-security'!"

Rahul Joshi • May 1

The industry is racing to deploy autonomous agents, but we’re essentially creating a massive new attack surface without the 'Shift Left' guardrails needed to secure it. It’s a wake-up call that AI agents require the same rigorous automated scanning and security testing as any other production-grade microservice!.