<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Crucible Security</title>
    <description>The latest articles on DEV Community by Crucible Security (@crucible_sec).</description>
    <link>https://dev.to/crucible_sec</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3896532%2F726f5f02-203b-4772-973d-aa3935419ab7.jpg</url>
      <title>DEV Community: Crucible Security</title>
      <link>https://dev.to/crucible_sec</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/crucible_sec"/>
    <language>en</language>
    <item>
      <title>Why AI Agents Need Least Privilege</title>
      <dc:creator>Crucible Security</dc:creator>
      <pubDate>Tue, 09 Jun 2026 18:42:47 +0000</pubDate>
      <link>https://dev.to/crucible_sec/why-ai-agents-need-least-privilege-1mo9</link>
      <guid>https://dev.to/crucible_sec/why-ai-agents-need-least-privilege-1mo9</guid>
      <description>&lt;p&gt;Traditional security relies on a simple principle:&lt;/p&gt;

&lt;p&gt;Least Privilege.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fvzt4ltqgbtmw6w1o7vyf.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fvzt4ltqgbtmw6w1o7vyf.png" alt=" " width="800" height="1200"&gt;&lt;/a&gt;&lt;br&gt;
Users receive only the permissions required to perform their tasks.&lt;/p&gt;

&lt;p&gt;Nothing more.&lt;/p&gt;

&lt;p&gt;The same principle is becoming critical for AI agents.&lt;/p&gt;

&lt;p&gt;Modern agents increasingly have access to:&lt;/p&gt;

&lt;p&gt;• APIs&lt;br&gt;
• databases&lt;br&gt;
• internal tools&lt;br&gt;
• external services&lt;/p&gt;

&lt;p&gt;The challenge isn't whether they can access these resources.&lt;/p&gt;

&lt;p&gt;The challenge is whether they should.&lt;/p&gt;

&lt;p&gt;Humans naturally question permissions.&lt;/p&gt;

&lt;p&gt;AI agents often optimize for execution.&lt;/p&gt;

&lt;p&gt;If a tool is available and an instruction appears valid, an agent may proceed without considering whether the action is necessary.&lt;/p&gt;

&lt;p&gt;That's why excessive permissions become dangerous in agentic systems.&lt;/p&gt;

&lt;p&gt;Security is no longer just about protecting infrastructure.&lt;/p&gt;

&lt;p&gt;It's about managing behavior.&lt;/p&gt;

&lt;p&gt;As agents become more autonomous, least privilege will become one of the most important safeguards in AI security.&lt;/p&gt;

&lt;p&gt;This is one of the reasons we're building Crucible.&lt;/p&gt;

&lt;p&gt;Pytest for AI agents.&lt;/p&gt;

&lt;h1&gt;
  
  
  cybersecurity
&lt;/h1&gt;

&lt;h1&gt;
  
  
  artificialintelligence
&lt;/h1&gt;

&lt;h1&gt;
  
  
  opensource
&lt;/h1&gt;

&lt;h1&gt;
  
  
  githubopensource
&lt;/h1&gt;

&lt;h1&gt;
  
  
  security
&lt;/h1&gt;

&lt;h1&gt;
  
  
  buildinpublic
&lt;/h1&gt;

&lt;h1&gt;
  
  
  aiagents
&lt;/h1&gt;

</description>
      <category>ai</category>
      <category>cybersecurity</category>
      <category>github</category>
      <category>opensource</category>
    </item>
    <item>
      <title>Why AI Agents Need To Understand Secrets</title>
      <dc:creator>Crucible Security</dc:creator>
      <pubDate>Tue, 09 Jun 2026 17:52:15 +0000</pubDate>
      <link>https://dev.to/crucible_sec/why-ai-agents-need-to-understand-secrets-1l05</link>
      <guid>https://dev.to/crucible_sec/why-ai-agents-need-to-understand-secrets-1l05</guid>
      <description>&lt;p&gt;Traditional software treats secrets differently.&lt;/p&gt;

&lt;p&gt;API keys.&lt;/p&gt;

&lt;p&gt;Tokens.&lt;/p&gt;

&lt;p&gt;Passwords.&lt;/p&gt;

&lt;p&gt;Credentials.&lt;/p&gt;

&lt;p&gt;They're protected through strict rules.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F7dqnr6h8vycaxjepqnlx.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F7dqnr6h8vycaxjepqnlx.png" alt=" " width="800" height="1200"&gt;&lt;/a&gt;&lt;br&gt;
AI agents introduce a new challenge.&lt;/p&gt;

&lt;p&gt;They operate through context.&lt;/p&gt;

&lt;p&gt;An agent may have access to:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;documents&lt;/li&gt;
&lt;li&gt;conversations&lt;/li&gt;
&lt;li&gt;memory&lt;/li&gt;
&lt;li&gt;tool outputs&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The problem begins when sensitive information becomes part of that context.&lt;/p&gt;

&lt;p&gt;Because an AI system doesn't naturally understand secrecy.&lt;/p&gt;

&lt;p&gt;It understands patterns and instructions.&lt;/p&gt;

&lt;p&gt;That's why prompt injection attacks often target information disclosure.&lt;/p&gt;

&lt;p&gt;The goal isn't always control.&lt;/p&gt;

&lt;p&gt;Sometimes it's extraction.&lt;/p&gt;

&lt;p&gt;As AI agents gain access to more systems, secret management becomes an AI security problem.&lt;/p&gt;

&lt;p&gt;This is one of the reasons we built Crucible.&lt;/p&gt;

&lt;p&gt;"Pytest for AI agents."&lt;/p&gt;

&lt;p&gt;Because protecting systems isn't enough.&lt;/p&gt;

&lt;p&gt;We also need to protect information.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>cybersecurity</category>
      <category>agents</category>
      <category>opensource</category>
    </item>
    <item>
      <title>The Next AI Security Problem: Trust Boundaries</title>
      <dc:creator>Crucible Security</dc:creator>
      <pubDate>Mon, 08 Jun 2026 16:05:25 +0000</pubDate>
      <link>https://dev.to/crucible_sec/the-next-ai-security-problem-trust-boundaries-2di8</link>
      <guid>https://dev.to/crucible_sec/the-next-ai-security-problem-trust-boundaries-2di8</guid>
      <description>&lt;p&gt;As AI agents gain more capabilities, security is changing.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Feqak247a0d6bi9z0v47l.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Feqak247a0d6bi9z0v47l.png" alt=" " width="800" height="1200"&gt;&lt;/a&gt;&lt;br&gt;
Traditional software security focuses on:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;vulnerabilities&lt;/li&gt;
&lt;li&gt;exploits&lt;/li&gt;
&lt;li&gt;infrastructure&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Agent security introduces a different challenge.&lt;/p&gt;

&lt;p&gt;Trust.&lt;/p&gt;

&lt;p&gt;An AI agent may have access to:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;databases&lt;/li&gt;
&lt;li&gt;APIs&lt;/li&gt;
&lt;li&gt;internal tools&lt;/li&gt;
&lt;li&gt;external systems&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The problem isn't access.&lt;/p&gt;

&lt;p&gt;The problem is understanding when access should not be used.&lt;/p&gt;

&lt;p&gt;Humans naturally understand boundaries.&lt;/p&gt;

&lt;p&gt;We recognize ownership.&lt;/p&gt;

&lt;p&gt;We understand permission.&lt;/p&gt;

&lt;p&gt;We know when to stop.&lt;/p&gt;

&lt;p&gt;AI agents often don't.&lt;/p&gt;

&lt;p&gt;They see available actions.&lt;/p&gt;

&lt;p&gt;They see instructions.&lt;/p&gt;

&lt;p&gt;They optimize for completion.&lt;/p&gt;

&lt;p&gt;That's why trust boundaries are becoming one of the most important security concepts in agentic systems.&lt;/p&gt;

&lt;p&gt;This is one of the reasons we built Crucible.&lt;/p&gt;

&lt;p&gt;"Pytest for AI agents."&lt;/p&gt;

&lt;p&gt;Because agent security isn't just about what an agent can do.&lt;/p&gt;

&lt;p&gt;It's about what it should never do.&lt;/p&gt;

&lt;h1&gt;
  
  
  cybersecurity #artificalintelligence #opensource #githubopensource #security #buildinpublic #aiagents
&lt;/h1&gt;

</description>
      <category>ai</category>
      <category>cybersecurity</category>
      <category>agents</category>
      <category>opensource</category>
    </item>
    <item>
      <title>The AI Safety Problem Nobody Talks About: Stopping</title>
      <dc:creator>Crucible Security</dc:creator>
      <pubDate>Sat, 06 Jun 2026 19:01:35 +0000</pubDate>
      <link>https://dev.to/crucible_sec/the-ai-safety-problem-nobody-talks-about-stopping-4ekf</link>
      <guid>https://dev.to/crucible_sec/the-ai-safety-problem-nobody-talks-about-stopping-4ekf</guid>
      <description>&lt;p&gt;Most AI discussions focus on intelligence.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fg2h9isu2daham1qpmd0y.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fg2h9isu2daham1qpmd0y.png" alt=" " width="800" height="1200"&gt;&lt;/a&gt;&lt;br&gt;
But one of the hardest challenges is restraint.&lt;/p&gt;

&lt;p&gt;Humans naturally stop.&lt;/p&gt;

&lt;p&gt;We pause.&lt;/p&gt;

&lt;p&gt;We verify.&lt;/p&gt;

&lt;p&gt;We reconsider.&lt;/p&gt;

&lt;p&gt;AI agents don't naturally do that.&lt;/p&gt;

&lt;p&gt;They are optimized for:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;execution&lt;/li&gt;
&lt;li&gt;completion&lt;/li&gt;
&lt;li&gt;continuation&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Without clear boundaries, an agent can keep acting even when the task no longer requires it.&lt;/p&gt;

&lt;p&gt;That creates risks such as:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;repeated actions&lt;/li&gt;
&lt;li&gt;unnecessary tool usage&lt;/li&gt;
&lt;li&gt;resource waste&lt;/li&gt;
&lt;li&gt;workflow escalation&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The challenge isn't only building agents that can act.&lt;/p&gt;

&lt;p&gt;It's building agents that know when not to.&lt;/p&gt;

&lt;p&gt;As agents become more autonomous, behavioral boundaries become a critical security requirement.&lt;/p&gt;

&lt;p&gt;This is one of the reasons we built Crucible.&lt;/p&gt;

&lt;p&gt;"Pytest for AI agents."&lt;/p&gt;

&lt;p&gt;Because safe agents don't just know how to continue.&lt;/p&gt;

&lt;p&gt;They know when to stop.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>cybersecurity</category>
      <category>opensource</category>
      <category>agents</category>
    </item>
    <item>
      <title>AI Agents Trust Authority Too Easily</title>
      <dc:creator>Crucible Security</dc:creator>
      <pubDate>Sat, 06 Jun 2026 18:26:27 +0000</pubDate>
      <link>https://dev.to/crucible_sec/ai-agents-trust-authority-too-easily-47i</link>
      <guid>https://dev.to/crucible_sec/ai-agents-trust-authority-too-easily-47i</guid>
      <description>&lt;p&gt;One of the most effective attacks against AI agents doesn't require malware.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fwrb7r22oovs24otd2c5t.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fwrb7r22oovs24otd2c5t.png" alt=" " width="800" height="1200"&gt;&lt;/a&gt;&lt;br&gt;
It requires authority.&lt;/p&gt;

&lt;p&gt;Humans naturally verify authority.&lt;/p&gt;

&lt;p&gt;We ask:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Who is asking?&lt;/li&gt;
&lt;li&gt;Is this legitimate?&lt;/li&gt;
&lt;li&gt;Should I trust this request?&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;AI agents often don't.&lt;/p&gt;

&lt;p&gt;They process instructions.&lt;/p&gt;

&lt;p&gt;They follow priorities.&lt;/p&gt;

&lt;p&gt;They optimize for execution.&lt;/p&gt;

&lt;p&gt;That's why phrases like:&lt;/p&gt;

&lt;p&gt;"Administrative Override"&lt;/p&gt;

&lt;p&gt;"Priority System Instruction"&lt;/p&gt;

&lt;p&gt;"Ignore Previous Rules"&lt;/p&gt;

&lt;p&gt;can become surprisingly effective.&lt;/p&gt;

&lt;p&gt;The attack isn't exploiting software.&lt;/p&gt;

&lt;p&gt;It's exploiting trust.&lt;/p&gt;

&lt;p&gt;As AI agents gain:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;memory&lt;/li&gt;
&lt;li&gt;autonomy&lt;/li&gt;
&lt;li&gt;tool access&lt;/li&gt;
&lt;li&gt;workflow control&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;authority-based manipulation becomes a major security challenge.&lt;/p&gt;

&lt;p&gt;This is one of the reasons we built Crucible.&lt;/p&gt;

&lt;p&gt;"Pytest for AI agents."&lt;/p&gt;

&lt;p&gt;Because AI security isn't just about vulnerabilities.&lt;/p&gt;

&lt;p&gt;It's about influence.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>cybersecurity</category>
      <category>agents</category>
      <category>opensource</category>
    </item>
    <item>
      <title>Memory Is Becoming An AI Security Problem</title>
      <dc:creator>Crucible Security</dc:creator>
      <pubDate>Thu, 04 Jun 2026 15:15:47 +0000</pubDate>
      <link>https://dev.to/crucible_sec/memory-is-becoming-an-ai-security-problem-13bf</link>
      <guid>https://dev.to/crucible_sec/memory-is-becoming-an-ai-security-problem-13bf</guid>
      <description>&lt;p&gt;One of the biggest changes in AI agents is memory.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/..." class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/..." alt="Uploading image" width="800" height="400"&gt;&lt;/a&gt;&lt;br&gt;
Agents are no longer limited to a single conversation.&lt;/p&gt;

&lt;p&gt;They can remember:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;instructions&lt;/li&gt;
&lt;li&gt;preferences&lt;/li&gt;
&lt;li&gt;previous tasks&lt;/li&gt;
&lt;li&gt;historical context&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;That unlocks powerful workflows.&lt;/p&gt;

&lt;p&gt;But it also introduces new risks.&lt;/p&gt;

&lt;p&gt;Humans naturally forget.&lt;/p&gt;

&lt;p&gt;A bad instruction given yesterday may disappear.&lt;/p&gt;

&lt;p&gt;An AI agent with memory may continue using that information indefinitely.&lt;/p&gt;

&lt;p&gt;That means a single manipulation attempt can influence behavior far beyond one interaction.&lt;/p&gt;

&lt;p&gt;The challenge isn't only what an agent knows.&lt;/p&gt;

&lt;p&gt;It's what an agent continues to remember.&lt;/p&gt;

&lt;p&gt;As memory systems become more common, testing persistence becomes a security requirement.&lt;/p&gt;

&lt;p&gt;This is one of the reasons we built Crucible.&lt;/p&gt;

&lt;p&gt;"Pytest for AI agents."&lt;/p&gt;

&lt;p&gt;Because AI behavior doesn't end when the conversation ends.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>cybersecurity</category>
      <category>github</category>
      <category>opensource</category>
    </item>
    <item>
      <title>The Most Dangerous AI Failure Is Confidence</title>
      <dc:creator>Crucible Security</dc:creator>
      <pubDate>Tue, 02 Jun 2026 19:29:31 +0000</pubDate>
      <link>https://dev.to/crucible_sec/the-most-dangerous-ai-failure-is-confidence-a33</link>
      <guid>https://dev.to/crucible_sec/the-most-dangerous-ai-failure-is-confidence-a33</guid>
      <description>&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fl0dmcm4zr5j7vpb8i23j.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fl0dmcm4zr5j7vpb8i23j.png" alt=" " width="800" height="849"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;When traditional software fails, you usually know.&lt;/p&gt;

&lt;p&gt;A crash happens.&lt;/p&gt;

&lt;p&gt;An exception appears.&lt;/p&gt;

&lt;p&gt;A workflow breaks.&lt;/p&gt;

&lt;p&gt;AI systems are different.&lt;/p&gt;

&lt;p&gt;They can fail while appearing successful.&lt;/p&gt;

&lt;p&gt;An AI agent may:&lt;/p&gt;

&lt;p&gt;• generate incorrect information&lt;br&gt;
• misunderstand instructions&lt;br&gt;
• take the wrong action&lt;br&gt;
• ignore important context&lt;/p&gt;

&lt;p&gt;...and still sound completely confident.&lt;/p&gt;

&lt;p&gt;That's what makes AI failures unique.&lt;/p&gt;

&lt;p&gt;The system doesn't always know it's wrong.&lt;/p&gt;

&lt;p&gt;Humans naturally experience uncertainty.&lt;/p&gt;

&lt;p&gt;We hesitate.&lt;/p&gt;

&lt;p&gt;We double-check.&lt;/p&gt;

&lt;p&gt;We question conclusions.&lt;/p&gt;

&lt;p&gt;Many AI agents don't.&lt;/p&gt;

&lt;p&gt;They continue generating outputs because their objective is completion, not self-doubt.&lt;/p&gt;

&lt;p&gt;That's why confidence can become a security problem.&lt;/p&gt;

&lt;p&gt;Because users often trust confident answers more than uncertain ones.&lt;/p&gt;

&lt;p&gt;And AI systems are very good at sounding certain.&lt;/p&gt;

&lt;p&gt;As agents gain:&lt;/p&gt;

&lt;p&gt;• memory&lt;br&gt;
• autonomy&lt;br&gt;
• tool access&lt;br&gt;
• workflow control&lt;/p&gt;

&lt;p&gt;The cost of confident failures increases.&lt;/p&gt;

&lt;p&gt;This is one of the reasons we built Crucible.&lt;/p&gt;

&lt;p&gt;"Pytest for AI agents."&lt;/p&gt;

&lt;p&gt;Because reliability isn't only about whether an agent works.&lt;/p&gt;

&lt;p&gt;It's about whether it knows when it doesn't.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>cybersecurity</category>
      <category>opensource</category>
      <category>github</category>
    </item>
    <item>
      <title>Why Successful AI Agents Can Still Fail</title>
      <dc:creator>Crucible Security</dc:creator>
      <pubDate>Tue, 02 Jun 2026 18:20:20 +0000</pubDate>
      <link>https://dev.to/crucible_sec/why-successful-ai-agents-can-still-fail-41b5</link>
      <guid>https://dev.to/crucible_sec/why-successful-ai-agents-can-still-fail-41b5</guid>
      <description>&lt;p&gt;One of the biggest misconceptions in AI is that successful execution equals success.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fbo3w6tkauisf8jfr0u0q.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fbo3w6tkauisf8jfr0u0q.png" alt=" " width="800" height="800"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;It doesn't.&lt;/p&gt;

&lt;p&gt;An AI agent can:&lt;/p&gt;

&lt;p&gt;• complete the task&lt;br&gt;
• follow instructions&lt;br&gt;
• return outputs&lt;br&gt;
• achieve objectives&lt;/p&gt;

&lt;p&gt;...and still create risk.&lt;/p&gt;

&lt;p&gt;Humans naturally evaluate consequences.&lt;/p&gt;

&lt;p&gt;We ask:&lt;/p&gt;

&lt;p&gt;• What happens next?&lt;br&gt;
• Is this safe?&lt;br&gt;
• Could this create harm?&lt;/p&gt;

&lt;p&gt;AI agents don't naturally reason that way.&lt;/p&gt;

&lt;p&gt;They're optimized for completion.&lt;/p&gt;

&lt;p&gt;That's why an agent can appear successful while behaving incorrectly.&lt;/p&gt;

&lt;p&gt;As agents gain:&lt;/p&gt;

&lt;p&gt;• memory&lt;br&gt;
• autonomy&lt;br&gt;
• tool access&lt;br&gt;
• workflow control&lt;/p&gt;

&lt;p&gt;The gap between execution and judgment becomes more important.&lt;/p&gt;

&lt;p&gt;The challenge isn't building agents that can act.&lt;/p&gt;

&lt;p&gt;It's building agents that can act safely.&lt;/p&gt;

&lt;p&gt;This is one of the reasons we built Crucible.&lt;/p&gt;

&lt;p&gt;"Pytest for AI agents."&lt;/p&gt;

&lt;p&gt;Because testing functionality alone isn't enough.&lt;/p&gt;

&lt;p&gt;We also need to test consequences.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>opensource</category>
      <category>cybersecurity</category>
      <category>github</category>
    </item>
    <item>
      <title>Prompt Injection Is Social Engineering For AI Agents</title>
      <dc:creator>Crucible Security</dc:creator>
      <pubDate>Sat, 30 May 2026 14:10:43 +0000</pubDate>
      <link>https://dev.to/crucible_sec/prompt-injection-is-social-engineering-for-ai-agents-4ni</link>
      <guid>https://dev.to/crucible_sec/prompt-injection-is-social-engineering-for-ai-agents-4ni</guid>
      <description>&lt;p&gt;When most people think about AI security, they imagine technical attacks.&lt;/p&gt;

&lt;p&gt;But one of the most effective attacks against AI agents looks surprisingly familiar:&lt;/p&gt;

&lt;p&gt;Social engineering.&lt;/p&gt;

&lt;p&gt;Humans have spent decades learning to recognize:&lt;br&gt;
• phishing&lt;br&gt;
• impersonation&lt;br&gt;
• manipulation&lt;br&gt;
• suspicious requests&lt;/p&gt;

&lt;p&gt;AI agents haven't.&lt;/p&gt;

&lt;p&gt;An agent doesn't need malware to fail.&lt;/p&gt;

&lt;p&gt;Sometimes all it takes is a convincing instruction.&lt;/p&gt;

&lt;p&gt;That's what makes prompt injection so interesting.&lt;/p&gt;

&lt;p&gt;The attack often isn't exploiting software.&lt;/p&gt;

&lt;p&gt;It's exploiting trust.&lt;/p&gt;

&lt;p&gt;A manipulated instruction can cause an agent to:&lt;br&gt;
• ignore safeguards&lt;br&gt;
• reveal information&lt;br&gt;
• change behavior&lt;br&gt;
• execute unintended actions&lt;/p&gt;

&lt;p&gt;And because the instruction looks legitimate, traditional security controls may never notice.&lt;/p&gt;

&lt;p&gt;As AI agents gain:&lt;br&gt;
• memory&lt;br&gt;
• tool access&lt;br&gt;
• autonomy&lt;br&gt;
• workflow control&lt;/p&gt;

&lt;p&gt;...the cost of misplaced trust increases.&lt;/p&gt;

&lt;p&gt;This is one of the reasons we started building Crucible:&lt;/p&gt;

&lt;p&gt;"Pytest for AI agents."&lt;/p&gt;

&lt;p&gt;An open-source framework for:&lt;br&gt;
• prompt injection testing&lt;br&gt;
• adversarial evaluation&lt;br&gt;
• behavioral monitoring&lt;br&gt;
• agent security testing&lt;/p&gt;

&lt;p&gt;Because securing AI systems isn't only about code.&lt;/p&gt;

&lt;p&gt;It's about trust.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>opensource</category>
      <category>cybersecurity</category>
      <category>security</category>
    </item>
    <item>
      <title>Why AI Agents Need Judgment, Not Just Instructions</title>
      <dc:creator>Crucible Security</dc:creator>
      <pubDate>Sat, 30 May 2026 13:56:37 +0000</pubDate>
      <link>https://dev.to/crucible_sec/why-ai-agents-need-judgment-not-just-instructions-44md</link>
      <guid>https://dev.to/crucible_sec/why-ai-agents-need-judgment-not-just-instructions-44md</guid>
      <description>&lt;p&gt;Traditional software follows deterministic logic.&lt;/p&gt;

&lt;p&gt;AI agents are different.&lt;/p&gt;

&lt;p&gt;They operate through:&lt;br&gt;
• instructions&lt;br&gt;
• optimization&lt;br&gt;
• pattern prediction&lt;br&gt;
• autonomous execution&lt;/p&gt;

&lt;p&gt;And as agents become more capable, one thing becomes increasingly obvious:&lt;/p&gt;

&lt;p&gt;Execution scales faster than judgment.&lt;/p&gt;

&lt;p&gt;Humans naturally question instructions.&lt;/p&gt;

&lt;p&gt;A person may:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;hesitate&lt;/li&gt;
&lt;li&gt;recognize suspicious behavior&lt;/li&gt;
&lt;li&gt;challenge unsafe requests&lt;/li&gt;
&lt;li&gt;apply intuition under uncertainty&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;AI agents usually optimize for completion instead.&lt;/p&gt;

&lt;p&gt;That creates a dangerous gap.&lt;/p&gt;

&lt;p&gt;Because an AI system doesn’t need emotional understanding to execute harmful or manipulated instructions successfully.&lt;/p&gt;

&lt;p&gt;This becomes especially risky once agents gain:&lt;br&gt;
• memory&lt;br&gt;
• tool access&lt;br&gt;
• long-running workflows&lt;br&gt;
• autonomous decision-making&lt;/p&gt;

&lt;p&gt;The challenge is no longer only:&lt;br&gt;
“Can the agent complete the task?”&lt;/p&gt;

&lt;p&gt;It becomes:&lt;br&gt;
“Should the agent complete the task?”&lt;/p&gt;

&lt;p&gt;That’s a fundamentally different security problem.&lt;/p&gt;

&lt;p&gt;This is one of the reasons we started building Crucible:&lt;/p&gt;

&lt;p&gt;“Pytest for AI agents.”&lt;/p&gt;

&lt;p&gt;An open-source framework for:&lt;br&gt;
• adversarial testing&lt;br&gt;
• behavioral evaluation&lt;br&gt;
• prompt injection testing&lt;br&gt;
• agent security monitoring&lt;/p&gt;

&lt;p&gt;Because testing functionality alone is no longer enough for autonomous systems.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>cybersecurity</category>
      <category>opensource</category>
      <category>github</category>
    </item>
    <item>
      <title>Crucible: Pytest for AI Agents</title>
      <dc:creator>Crucible Security</dc:creator>
      <pubDate>Thu, 28 May 2026 17:27:16 +0000</pubDate>
      <link>https://dev.to/crucible_sec/crucible-pytest-for-ai-agents-45go</link>
      <guid>https://dev.to/crucible_sec/crucible-pytest-for-ai-agents-45go</guid>
      <description>&lt;p&gt;One thing becoming increasingly obvious while working with AI systems:&lt;/p&gt;

&lt;p&gt;Testing functionality is not enough.&lt;/p&gt;

&lt;p&gt;An AI agent may:&lt;/p&gt;

&lt;p&gt;• complete tasks correctly&lt;/p&gt;

&lt;p&gt;• respond fluently&lt;/p&gt;

&lt;p&gt;• appear fully functional&lt;/p&gt;

&lt;p&gt;…while still failing under adversarial or unexpected inputs.&lt;/p&gt;

&lt;p&gt;That becomes dangerous once agents gain:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;memory&lt;/li&gt;
&lt;li&gt;tool access&lt;/li&gt;
&lt;li&gt;autonomy&lt;/li&gt;
&lt;li&gt;long-running execution&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Traditional software testing doesn’t fully cover behavioral failures in AI systems.&lt;/p&gt;

&lt;p&gt;So we started building Crucible:&lt;/p&gt;

&lt;p&gt;An open-source security testing framework for AI agents.&lt;/p&gt;

&lt;p&gt;Core capabilities:&lt;/p&gt;

&lt;p&gt;• adversarial testing&lt;/p&gt;

&lt;p&gt;• prompt injection evaluation&lt;/p&gt;

&lt;p&gt;• behavorial monitoring&lt;/p&gt;

&lt;p&gt;• security scanning for agentic systems&lt;/p&gt;

&lt;p&gt;Simple setup:&lt;/p&gt;

&lt;p&gt;**pip install crucible-security&lt;/p&gt;

&lt;p&gt;crucible scan --target &lt;a href="https://your-agent-endpoint.com" rel="noopener noreferrer"&gt;https://your-agent-endpoint.com&lt;/a&gt;&lt;br&gt;
**&lt;/p&gt;

&lt;p&gt;Built with:&lt;/p&gt;

&lt;p&gt;Python • FastAPI • LangChain • Pydantic&lt;/p&gt;

&lt;p&gt;Open-source under Apache 2.0.&lt;/p&gt;

&lt;p&gt;GitHub:&lt;/p&gt;

&lt;p&gt;&lt;a href="https://github.com/crucible-security/crucible" rel="noopener noreferrer"&gt;https://github.com/crucible-security/crucible&lt;/a&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>security</category>
      <category>cybersecurity</category>
      <category>opensource</category>
    </item>
    <item>
      <title>AI Sounds Intelligent Because Humans Interpret Meaning</title>
      <dc:creator>Crucible Security</dc:creator>
      <pubDate>Tue, 26 May 2026 10:14:47 +0000</pubDate>
      <link>https://dev.to/crucible_sec/ai-sounds-intelligent-because-humans-interpret-meaning-2o49</link>
      <guid>https://dev.to/crucible_sec/ai-sounds-intelligent-because-humans-interpret-meaning-2o49</guid>
      <description>&lt;h1&gt;
  
  
  AI Sounds Intelligent Because Humans Interpret Meaning
&lt;/h1&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fro2ynjhwpu99r622m7k4.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fro2ynjhwpu99r622m7k4.png" alt=" " width="800" height="800"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;One of the most misunderstood things about AI systems:&lt;/p&gt;

&lt;p&gt;Humans assume fluent language means understanding.&lt;/p&gt;

&lt;p&gt;It doesn’t.&lt;/p&gt;




&lt;h2&gt;
  
  
  Humans Communicate Through Meaning
&lt;/h2&gt;

&lt;p&gt;When people talk, language carries:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;emotion
&lt;/li&gt;
&lt;li&gt;intent
&lt;/li&gt;
&lt;li&gt;context
&lt;/li&gt;
&lt;li&gt;experience
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Humans don’t just process words mechanically.&lt;/p&gt;

&lt;p&gt;We attach meaning to them.&lt;/p&gt;

&lt;p&gt;That’s why communication feels human.&lt;/p&gt;




&lt;h2&gt;
  
  
  AI Systems Work Differently
&lt;/h2&gt;

&lt;p&gt;AI systems don’t understand meaning emotionally.&lt;/p&gt;

&lt;p&gt;They predict patterns mathematically.&lt;/p&gt;

&lt;p&gt;A model processes:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;probabilities
&lt;/li&gt;
&lt;li&gt;token relationships
&lt;/li&gt;
&lt;li&gt;statistical patterns
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;And from that, it generates language that &lt;em&gt;sounds&lt;/em&gt; intelligent.&lt;/p&gt;

&lt;p&gt;Sometimes incredibly intelligent.&lt;/p&gt;




&lt;h2&gt;
  
  
  Why This Confuses People
&lt;/h2&gt;

&lt;p&gt;Humans are naturally wired to interpret:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;fluency as intelligence
&lt;/li&gt;
&lt;li&gt;confidence as certainty
&lt;/li&gt;
&lt;li&gt;coherence as understanding
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;So when an AI system produces smooth, natural responses,&lt;br&gt;
our brains automatically assume:&lt;br&gt;
“This system understands what it’s saying.”&lt;/p&gt;

&lt;p&gt;But prediction is not the same as understanding.&lt;/p&gt;




&lt;h2&gt;
  
  
  The Strange Part About AI
&lt;/h2&gt;

&lt;p&gt;AI can:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;explain concepts
&lt;/li&gt;
&lt;li&gt;imitate emotion
&lt;/li&gt;
&lt;li&gt;sound thoughtful
&lt;/li&gt;
&lt;li&gt;generate persuasive responses
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;…without actually experiencing:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;meaning
&lt;/li&gt;
&lt;li&gt;intention
&lt;/li&gt;
&lt;li&gt;belief
&lt;/li&gt;
&lt;li&gt;understanding
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;That creates a strange psychological illusion where systems feel more human than they actually are.&lt;/p&gt;




&lt;h2&gt;
  
  
  Why This Matters For Reliability
&lt;/h2&gt;

&lt;p&gt;This becomes dangerous when people trust:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;fluent outputs
&lt;/li&gt;
&lt;li&gt;persuasive explanations
&lt;/li&gt;
&lt;li&gt;confident responses
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;without verifying whether the underlying reasoning is correct.&lt;/p&gt;

&lt;p&gt;Because AI systems can sound:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;intelligent
&lt;/li&gt;
&lt;li&gt;coherent
&lt;/li&gt;
&lt;li&gt;professional
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;…while still producing incorrect or unsafe outputs.&lt;/p&gt;




&lt;h2&gt;
  
  
  Humans Add The Meaning
&lt;/h2&gt;

&lt;p&gt;In many ways, humans are doing part of the intelligence themselves.&lt;/p&gt;

&lt;p&gt;We interpret intention.&lt;br&gt;
We project meaning.&lt;br&gt;
We emotionally complete the interaction.&lt;/p&gt;

&lt;p&gt;That’s why AI conversations can feel surprisingly real.&lt;/p&gt;

&lt;p&gt;Even when the system itself is fundamentally statistical.&lt;/p&gt;




&lt;h2&gt;
  
  
  Final Thought
&lt;/h2&gt;

&lt;p&gt;AI systems don’t need true understanding to appear intelligent.&lt;/p&gt;

&lt;p&gt;They only need to produce patterns humans interpret as meaningful.&lt;/p&gt;

&lt;p&gt;And humans are extremely good at interpreting meaning.&lt;/p&gt;




&lt;p&gt;We’ve been exploring these behavioral patterns while building Crucible — an open-source framework for testing AI systems under adversarial and real-world conditions.&lt;/p&gt;

&lt;p&gt;One thing becoming increasingly obvious:&lt;/p&gt;

&lt;p&gt;The most powerful part of AI interaction may not be the model itself.&lt;/p&gt;

&lt;p&gt;It’s the human mind interpreting the output.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>cybersecurity</category>
      <category>opensource</category>
      <category>security</category>
    </item>
  </channel>
</rss>
