When people think about prompt injection, they often imagine a single malicious message.
But many real-world attacks don't work that way.
Attackers build context.
They create credibility.
They establish trust.
Only then do they introduce harmful instructions.
This makes multi-turn attacks particularly dangerous.
The individual messages may appear harmless.
The conversation as a whole becomes the exploit.
As AI systems become more conversational and autonomous, understanding trust-building attacks will become increasingly important.
Because the most dangerous prompt is often not the first one.
It's the one that arrives after trust has already been established.
Crucible = Pytest for AI Agents.

Top comments (0)