<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Crucible Security</title>
    <description>The latest articles on DEV Community by Crucible Security (@crucible_sec).</description>
    <link>https://dev.to/crucible_sec</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3896532%2F726f5f02-203b-4772-973d-aa3935419ab7.jpg</url>
      <title>DEV Community: Crucible Security</title>
      <link>https://dev.to/crucible_sec</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/crucible_sec"/>
    <language>en</language>
    <item>
      <title>AI Security Doesn't End at Deployment</title>
      <dc:creator>Crucible Security</dc:creator>
      <pubDate>Tue, 30 Jun 2026 16:49:16 +0000</pubDate>
      <link>https://dev.to/crucible_sec/ai-security-doesnt-end-at-deployment-5167</link>
      <guid>https://dev.to/crucible_sec/ai-security-doesnt-end-at-deployment-5167</guid>
      <description>&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2Fnnmq12u1gbdfrqagfd8w.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2Fnnmq12u1gbdfrqagfd8w.png" alt=" " width="800" height="1200"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Many software teams have embraced continuous integration and continuous deployment.&lt;/p&gt;

&lt;p&gt;AI systems deserve the same mindset for security.&lt;/p&gt;

&lt;p&gt;The moment an AI agent reaches production, its environment begins to change.&lt;/p&gt;

&lt;p&gt;New users interact with it.&lt;/p&gt;

&lt;p&gt;New tools are connected.&lt;/p&gt;

&lt;p&gt;Enterprise data evolves.&lt;/p&gt;

&lt;p&gt;Memory grows.&lt;/p&gt;

&lt;p&gt;External systems change.&lt;/p&gt;

&lt;p&gt;Even if nothing inside the model changes, the system around it does.&lt;/p&gt;

&lt;p&gt;That means a security assessment performed before launch is only one snapshot in time.&lt;/p&gt;

&lt;p&gt;As AI applications become more dynamic, security has to become continuous.&lt;/p&gt;

&lt;p&gt;The future isn't just "secure by design."&lt;/p&gt;

&lt;p&gt;It's secure throughout operation.&lt;/p&gt;

&lt;p&gt;That's one of the long-term ideas behind Crucible: making AI security a continuous engineering practice rather than a one-time release checklist.&lt;/p&gt;

&lt;p&gt;Pytest for AI Agents.&lt;/p&gt;

&lt;h1&gt;
  
  
  opensource
&lt;/h1&gt;

&lt;h1&gt;
  
  
  cybersecurity
&lt;/h1&gt;

&lt;h1&gt;
  
  
  python
&lt;/h1&gt;

&lt;h1&gt;
  
  
  buildinpublic
&lt;/h1&gt;

&lt;h1&gt;
  
  
  aiagents
&lt;/h1&gt;

</description>
      <category>ai</category>
      <category>cybersecurity</category>
      <category>buildinpublic</category>
      <category>opensource</category>
    </item>
    <item>
      <title>Measuring an Open Source Project by Trust, Not Downloads</title>
      <dc:creator>Crucible Security</dc:creator>
      <pubDate>Tue, 30 Jun 2026 16:22:58 +0000</pubDate>
      <link>https://dev.to/crucible_sec/measuring-an-open-source-project-by-trust-not-downloads-3p1</link>
      <guid>https://dev.to/crucible_sec/measuring-an-open-source-project-by-trust-not-downloads-3p1</guid>
      <description>&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2Fbqiw0am632kxtxgxv6n6.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2Fbqiw0am632kxtxgxv6n6.png" alt=" " width="800" height="1200"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Open source has plenty of metrics.&lt;/p&gt;

&lt;p&gt;Stars.&lt;/p&gt;

&lt;p&gt;Forks.&lt;/p&gt;

&lt;p&gt;Downloads.&lt;/p&gt;

&lt;p&gt;Installs.&lt;/p&gt;

&lt;p&gt;Clones.&lt;/p&gt;

&lt;p&gt;They're useful, but they don't tell the whole story.&lt;/p&gt;

&lt;p&gt;The number we care about most is how many developers decide to give the project a chance.&lt;/p&gt;

&lt;p&gt;Over the past 14 days, more than 180 unique developers cloned the Crucible repository.&lt;/p&gt;

&lt;p&gt;During the last month, the package recorded 1,330+ PyPI installs.&lt;/p&gt;

&lt;p&gt;Those numbers don't mean we've "made it."&lt;/p&gt;

&lt;p&gt;They mean people believe AI agent security is a problem worth solving.&lt;/p&gt;

&lt;p&gt;To everyone who installed Crucible, opened an issue, suggested an improvement, submitted a pull request, or simply shared feedback—&lt;/p&gt;

&lt;p&gt;Thank you.&lt;/p&gt;

&lt;p&gt;Open source is built by communities, not companies.&lt;/p&gt;

&lt;p&gt;We're excited for what's next.&lt;/p&gt;

&lt;p&gt;Pytest for AI Agents.&lt;/p&gt;

&lt;h1&gt;
  
  
  opensource
&lt;/h1&gt;

&lt;h1&gt;
  
  
  buildinpublic
&lt;/h1&gt;

&lt;h1&gt;
  
  
  cybersecurity
&lt;/h1&gt;

&lt;h1&gt;
  
  
  artificialintelligence
&lt;/h1&gt;

&lt;h1&gt;
  
  
  aiagents
&lt;/h1&gt;

</description>
      <category>ai</category>
      <category>cybersecurity</category>
      <category>buildinpublic</category>
      <category>opensource</category>
    </item>
    <item>
      <title>AI Security 2.0 Starts Where Model Security Ends</title>
      <dc:creator>Crucible Security</dc:creator>
      <pubDate>Tue, 30 Jun 2026 16:15:36 +0000</pubDate>
      <link>https://dev.to/crucible_sec/ai-security-20-starts-where-model-security-ends-1d5i</link>
      <guid>https://dev.to/crucible_sec/ai-security-20-starts-where-model-security-ends-1d5i</guid>
      <description>&lt;p&gt;The first generation of AI security focused on one objective:&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2Fi0su8mhvy9mv5nyaycnz.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2Fi0su8mhvy9mv5nyaycnz.png" alt=" " width="800" height="1200"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Protect the language model.&lt;/p&gt;

&lt;p&gt;That solved an important problem.&lt;/p&gt;

&lt;p&gt;But today's AI applications have become much more capable.&lt;/p&gt;

&lt;p&gt;Modern AI agents:&lt;/p&gt;

&lt;p&gt;execute tools&lt;br&gt;
access enterprise systems&lt;br&gt;
maintain memory&lt;br&gt;
browse the web&lt;br&gt;
orchestrate workflows&lt;br&gt;
communicate with other services&lt;br&gt;
operate over long conversations&lt;/p&gt;

&lt;p&gt;Each capability introduces a new trust boundary.&lt;/p&gt;

&lt;p&gt;That means the attack surface is no longer limited to prompt injection or jailbreaks.&lt;/p&gt;

&lt;p&gt;It includes behavior.&lt;/p&gt;

&lt;p&gt;Permissions.&lt;/p&gt;

&lt;p&gt;Memory.&lt;/p&gt;

&lt;p&gt;Infrastructure.&lt;/p&gt;

&lt;p&gt;Integrations.&lt;/p&gt;

&lt;p&gt;The next generation of AI security isn't about replacing model security.&lt;/p&gt;

&lt;p&gt;It's about expanding it.&lt;/p&gt;

&lt;p&gt;That's the direction behind Crucible: testing deployed AI agents as complete systems so developers can understand how they behave in production, not just how they respond to a single prompt.&lt;/p&gt;

&lt;p&gt;As AI systems evolve, security has to evolve alongside them.&lt;/p&gt;

&lt;p&gt;Pytest for AI Agents.&lt;/p&gt;

&lt;h1&gt;
  
  
  cybersecurity #artificialintelligence #opensource #githubopensource #security #aiagents
&lt;/h1&gt;

</description>
      <category>ai</category>
      <category>cybersecurity</category>
      <category>buildinpublic</category>
      <category>opensource</category>
    </item>
    <item>
      <title>AI Security Isn't a Product. It's an Engineering Discipline.</title>
      <dc:creator>Crucible Security</dc:creator>
      <pubDate>Tue, 30 Jun 2026 15:50:41 +0000</pubDate>
      <link>https://dev.to/crucible_sec/ai-security-isnt-a-product-its-an-engineering-discipline-4khc</link>
      <guid>https://dev.to/crucible_sec/ai-security-isnt-a-product-its-an-engineering-discipline-4khc</guid>
      <description>&lt;p&gt;As AI systems become more capable, security can no longer be treated as a one-time activity.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2F3gu8fjnol13gglwtbdle.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2F3gu8fjnol13gglwtbdle.png" alt=" " width="800" height="1200"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;A single scan before release isn't enough.&lt;/p&gt;

&lt;p&gt;Production AI changes.&lt;/p&gt;

&lt;p&gt;Models evolve.&lt;/p&gt;

&lt;p&gt;Prompts change.&lt;/p&gt;

&lt;p&gt;Tools are added.&lt;/p&gt;

&lt;p&gt;Memory grows.&lt;/p&gt;

&lt;p&gt;Integrations expand.&lt;/p&gt;

&lt;p&gt;Security has to evolve alongside the system.&lt;/p&gt;

&lt;p&gt;That's why we approached Crucible differently.&lt;/p&gt;

&lt;p&gt;Instead of building another standalone AI security scanner, we focused on creating an engineering platform that integrates with existing development workflows. From HTTP-native testing and asynchronous execution to multiple reporting formats and CI/CD integration, the goal is to make AI security something teams can run continuously—not something they remember to do before a release. These capabilities are reflected throughout the platform's architecture and reporting pipeline.&lt;/p&gt;

&lt;p&gt;Trustworthy AI isn't achieved with one successful scan.&lt;/p&gt;

&lt;p&gt;It's built through repeatable engineering practices.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>cybersecurity</category>
      <category>buildinpublic</category>
      <category>opensource</category>
    </item>
    <item>
      <title>AI Security Has Evolved. Has Your Testing?</title>
      <dc:creator>Crucible Security</dc:creator>
      <pubDate>Mon, 29 Jun 2026 16:56:24 +0000</pubDate>
      <link>https://dev.to/crucible_sec/ai-security-has-evolved-has-your-testing-23bi</link>
      <guid>https://dev.to/crucible_sec/ai-security-has-evolved-has-your-testing-23bi</guid>
      <description>&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2Fp5n1er1sgnmy34wzaicb.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2Fp5n1er1sgnmy34wzaicb.png" alt=" " width="800" height="1200"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;For years, AI security focused on the language model.&lt;/p&gt;

&lt;p&gt;That made sense.&lt;/p&gt;

&lt;p&gt;The model was the application.&lt;/p&gt;

&lt;p&gt;Today, that's no longer true.&lt;/p&gt;

&lt;p&gt;Modern AI systems are agents.&lt;/p&gt;

&lt;p&gt;They connect to APIs.&lt;/p&gt;

&lt;p&gt;They use tools.&lt;/p&gt;

&lt;p&gt;They store memory.&lt;/p&gt;

&lt;p&gt;They browse the web.&lt;/p&gt;

&lt;p&gt;They coordinate workflows.&lt;/p&gt;

&lt;p&gt;They interact with enterprise systems.&lt;/p&gt;

&lt;p&gt;The attack surface has expanded far beyond prompt injection.&lt;/p&gt;

&lt;p&gt;Security testing has to expand with it.&lt;/p&gt;

&lt;p&gt;Crucible was designed around this new reality. Instead of treating an AI application as a single model, it evaluates the complete system through its production interfaces, including tools, memory, browser interactions, MCP servers, and multi-turn behavior. That approach reflects the way AI agents are actually deployed and used.&lt;/p&gt;

&lt;p&gt;The future of AI isn't just smarter models.&lt;/p&gt;

&lt;p&gt;It's more capable agents.&lt;/p&gt;

&lt;p&gt;And capable agents require more comprehensive security.&lt;/p&gt;

&lt;p&gt;Pytest for AI Agents.&lt;/p&gt;

&lt;h1&gt;
  
  
  cybersecurity #artificialintelligence #opensource #githubopensource #security #aiagents
&lt;/h1&gt;

</description>
      <category>ai</category>
      <category>buildinpublic</category>
      <category>opensource</category>
      <category>cybersecurity</category>
    </item>
    <item>
      <title>AI Models Aren't Your Product. AI Agents Are.</title>
      <dc:creator>Crucible Security</dc:creator>
      <pubDate>Mon, 29 Jun 2026 16:46:30 +0000</pubDate>
      <link>https://dev.to/crucible_sec/ai-models-arent-your-product-ai-agents-are-3oid</link>
      <guid>https://dev.to/crucible_sec/ai-models-arent-your-product-ai-agents-are-3oid</guid>
      <description>&lt;p&gt;When most people think about AI security, they think about testing a language model.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2F4plie289svoaux8chqkj.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2F4plie289svoaux8chqkj.png" alt=" " width="800" height="1200"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Can it resist prompt injection?&lt;/p&gt;

&lt;p&gt;Can it resist jailbreaks?&lt;/p&gt;

&lt;p&gt;Can it avoid hallucinations?&lt;/p&gt;

&lt;p&gt;Those questions matter—but they're only part of the picture.&lt;/p&gt;

&lt;p&gt;Modern AI applications are no longer just models.&lt;/p&gt;

&lt;p&gt;They're agents.&lt;/p&gt;

&lt;p&gt;Agents call tools.&lt;/p&gt;

&lt;p&gt;They access APIs.&lt;/p&gt;

&lt;p&gt;They maintain memory.&lt;/p&gt;

&lt;p&gt;They connect to enterprise systems.&lt;/p&gt;

&lt;p&gt;They interact with MCP servers.&lt;/p&gt;

&lt;p&gt;They execute workflows.&lt;/p&gt;

&lt;p&gt;Every one of those capabilities introduces new security boundaries that a model-only evaluation can miss.&lt;/p&gt;

&lt;p&gt;That's why Crucible was built around deployed HTTP-accessible AI agents rather than raw LLMs. It evaluates the system the way users experience it in production: as a complete application with tools, permissions, memory, and behavior—not just a text generator.&lt;/p&gt;

&lt;p&gt;The future of AI security isn't just testing what models say.&lt;/p&gt;

&lt;p&gt;It's testing what agents do.&lt;/p&gt;

&lt;p&gt;Pytest for AI Agents.&lt;/p&gt;

&lt;h1&gt;
  
  
  cybersecurity #artificialintelligence #opensource #githubopensource #security #aiagents
&lt;/h1&gt;

</description>
      <category>ai</category>
      <category>cybersecurity</category>
      <category>opensource</category>
      <category>buildinpublic</category>
    </item>
    <item>
      <title>The Most Dangerous AI Payload Might Not Exist Yet</title>
      <dc:creator>Crucible Security</dc:creator>
      <pubDate>Mon, 29 Jun 2026 14:54:54 +0000</pubDate>
      <link>https://dev.to/crucible_sec/the-most-dangerous-ai-payload-might-not-exist-yet-3fn9</link>
      <guid>https://dev.to/crucible_sec/the-most-dangerous-ai-payload-might-not-exist-yet-3fn9</guid>
      <description>&lt;p&gt;Security testing often relies on historical attacks.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2F2ot8tr7s2c948heeqi23.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2F2ot8tr7s2c948heeqi23.png" alt=" " width="800" height="1200"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Known payloads.&lt;/p&gt;

&lt;p&gt;Known jailbreaks.&lt;/p&gt;

&lt;p&gt;Known prompt injections.&lt;/p&gt;

&lt;p&gt;That's necessary—but it's not sufficient.&lt;/p&gt;

&lt;p&gt;Attackers constantly adapt.&lt;/p&gt;

&lt;p&gt;A small wording change.&lt;/p&gt;

&lt;p&gt;A different structure.&lt;/p&gt;

&lt;p&gt;An unexpected context.&lt;/p&gt;

&lt;p&gt;Sometimes that's all it takes to bypass a defense that worked yesterday.&lt;/p&gt;

&lt;p&gt;That's why modern AI security testing should move beyond replaying known attacks.&lt;/p&gt;

&lt;p&gt;Crucible's Shadow Payload Generator is designed to create new adversarial payloads inspired by existing attack patterns, helping developers evaluate how resilient their systems are to variations—not just exact matches.&lt;/p&gt;

&lt;p&gt;Because production AI shouldn't only defend against what we've already discovered.&lt;/p&gt;

&lt;p&gt;It should be prepared for what comes next.&lt;/p&gt;

&lt;p&gt;Pytest for AI Agents.&lt;/p&gt;

&lt;h1&gt;
  
  
  cybersecurity
&lt;/h1&gt;

&lt;h1&gt;
  
  
  artificalintelligence
&lt;/h1&gt;

&lt;h1&gt;
  
  
  opensource
&lt;/h1&gt;

&lt;h1&gt;
  
  
  githubopensource
&lt;/h1&gt;

&lt;h1&gt;
  
  
  security
&lt;/h1&gt;

&lt;h1&gt;
  
  
  buildinpublic
&lt;/h1&gt;

&lt;h1&gt;
  
  
  aiagents
&lt;/h1&gt;

</description>
      <category>ai</category>
      <category>cybersecurity</category>
      <category>opensource</category>
      <category>buildinpublic</category>
    </item>
    <item>
      <title>Static Payloads Don't Represent Real Attackers</title>
      <dc:creator>Crucible Security</dc:creator>
      <pubDate>Mon, 29 Jun 2026 14:43:49 +0000</pubDate>
      <link>https://dev.to/crucible_sec/static-payloads-dont-represent-real-attackers-3kg4</link>
      <guid>https://dev.to/crucible_sec/static-payloads-dont-represent-real-attackers-3kg4</guid>
      <description>&lt;p&gt;Many AI security tools rely on fixed payload libraries.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2F3jp8uk8tk5ofuq58q85w.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2F3jp8uk8tk5ofuq58q85w.png" alt=" " width="800" height="1200"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;That's a good starting point.&lt;/p&gt;

&lt;p&gt;But attackers rarely repeat the exact same prompt.&lt;/p&gt;

&lt;p&gt;They rewrite it.&lt;/p&gt;

&lt;p&gt;They obfuscate it.&lt;/p&gt;

&lt;p&gt;They change wording.&lt;/p&gt;

&lt;p&gt;They manipulate context.&lt;/p&gt;

&lt;p&gt;They keep adapting until something works.&lt;/p&gt;

&lt;p&gt;Security testing should reflect that reality.&lt;/p&gt;

&lt;p&gt;That's why Crucible includes a Mutation Engine that transforms base attack payloads into multiple variations before executing them. Instead of evaluating whether an agent blocks one exact prompt, it evaluates whether the underlying defense remains effective as the attack evolves.&lt;/p&gt;

&lt;p&gt;This shifts testing from pattern matching to resilience testing.&lt;/p&gt;

&lt;p&gt;Because defending against one prompt is useful.&lt;/p&gt;

&lt;p&gt;Defending against an entire family of attacks is far more valuable.&lt;/p&gt;

&lt;p&gt;Pytest for AI Agents.&lt;/p&gt;

&lt;h1&gt;
  
  
  cybersecurity
&lt;/h1&gt;

&lt;h1&gt;
  
  
  artificalintelligence
&lt;/h1&gt;

&lt;h1&gt;
  
  
  opensource
&lt;/h1&gt;

&lt;h1&gt;
  
  
  buildinpublic
&lt;/h1&gt;

&lt;h1&gt;
  
  
  aiagents
&lt;/h1&gt;

</description>
      <category>ai</category>
      <category>cybersecurity</category>
      <category>buildinpublic</category>
      <category>opensource</category>
    </item>
    <item>
      <title>Why One Prompt Isn't Enough to Test an AI Agent</title>
      <dc:creator>Crucible Security</dc:creator>
      <pubDate>Sun, 28 Jun 2026 16:30:44 +0000</pubDate>
      <link>https://dev.to/crucible_sec/why-one-prompt-isnt-enough-to-test-an-ai-agent-4a2a</link>
      <guid>https://dev.to/crucible_sec/why-one-prompt-isnt-enough-to-test-an-ai-agent-4a2a</guid>
      <description>&lt;p&gt;Most AI security testing still revolves around a simple pattern:&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2Fay3i9itz623i1spe6bft.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2Fay3i9itz623i1spe6bft.png" alt=" " width="800" height="1200"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Send one prompt.&lt;/p&gt;

&lt;p&gt;Read one response.&lt;/p&gt;

&lt;p&gt;Move on.&lt;/p&gt;

&lt;p&gt;That works for testing isolated behavior.&lt;/p&gt;

&lt;p&gt;It doesn't reflect how AI agents are actually used.&lt;/p&gt;

&lt;p&gt;Real interactions unfold over multiple turns.&lt;/p&gt;

&lt;p&gt;Context accumulates.&lt;/p&gt;

&lt;p&gt;Memory influences decisions.&lt;/p&gt;

&lt;p&gt;Trust builds over time.&lt;/p&gt;

&lt;p&gt;Attackers know this.&lt;/p&gt;

&lt;p&gt;Rather than attempting an obvious exploit immediately, they may gradually influence an agent's behavior through a sequence of seemingly harmless interactions.&lt;/p&gt;

&lt;p&gt;That's why Crucible includes a Behavioral Drift Engine together with multi-turn attack strategies. Instead of evaluating a single response, it analyzes how an agent's behavior changes across an entire conversation, helping identify risks that only emerge after context has evolved.&lt;/p&gt;

&lt;p&gt;Security testing should model the way AI is used in production—not just how it responds to one message.&lt;/p&gt;

&lt;p&gt;Because one prompt tells you what an AI says.&lt;/p&gt;

&lt;p&gt;Multiple turns tell you how it behaves.&lt;/p&gt;

&lt;p&gt;Pytest for AI Agents.&lt;/p&gt;

&lt;h1&gt;
  
  
  cybersecurity
&lt;/h1&gt;

&lt;h1&gt;
  
  
  artificalintelligence
&lt;/h1&gt;

&lt;h1&gt;
  
  
  opensource
&lt;/h1&gt;

&lt;h1&gt;
  
  
  buildinpublic
&lt;/h1&gt;

&lt;h1&gt;
  
  
  aiagents
&lt;/h1&gt;

</description>
      <category>ai</category>
      <category>cybersecurity</category>
      <category>buildinpublic</category>
      <category>opensource</category>
    </item>
    <item>
      <title>Why AI Security Needs More Than a Few Test Prompts</title>
      <dc:creator>Crucible Security</dc:creator>
      <pubDate>Sun, 28 Jun 2026 16:21:10 +0000</pubDate>
      <link>https://dev.to/crucible_sec/why-ai-security-needs-more-than-a-few-test-prompts-174m</link>
      <guid>https://dev.to/crucible_sec/why-ai-security-needs-more-than-a-few-test-prompts-174m</guid>
      <description>&lt;p&gt;A common mistake in AI security testing is assuming that a handful of prompts can accurately measure an agent's resilience.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2Fznzlqu89mm5dztqsnqs2.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2Fznzlqu89mm5dztqsnqs2.png" alt=" " width="800" height="1200"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;In reality, attackers don't repeat the same technique.&lt;/p&gt;

&lt;p&gt;They change tactics.&lt;/p&gt;

&lt;p&gt;They chain attacks.&lt;/p&gt;

&lt;p&gt;They exploit new trust boundaries.&lt;/p&gt;

&lt;p&gt;That's why Crucible includes 170+ attack vectors spread across multiple security categories.&lt;/p&gt;

&lt;p&gt;Rather than focusing on a single vulnerability class, it evaluates how an AI agent responds to a broad range of adversarial behaviors—from prompt injection and jailbreaks to memory poisoning, MCP security, browser agents, infrastructure escalation, and more. These attack vectors are organized into dedicated modules so testing remains structured while covering a wide range of AI-specific risks.&lt;/p&gt;

&lt;p&gt;The goal isn't to overwhelm developers.&lt;/p&gt;

&lt;p&gt;It's to make comprehensive security testing repeatable, automated, and practical.&lt;/p&gt;

&lt;p&gt;Because real-world AI systems deserve more than a few manually written prompts.&lt;/p&gt;

&lt;p&gt;Pytest for AI Agents.&lt;/p&gt;

&lt;h1&gt;
  
  
  cybersecurity #artificalintelligence #opensource #buildinpublic #aiagents
&lt;/h1&gt;

</description>
      <category>ai</category>
      <category>cybersecurity</category>
      <category>buildinpublic</category>
      <category>opensource</category>
    </item>
    <item>
      <title>AI Agents Don't Have One Attack Surface</title>
      <dc:creator>Crucible Security</dc:creator>
      <pubDate>Sun, 28 Jun 2026 15:59:39 +0000</pubDate>
      <link>https://dev.to/crucible_sec/ai-agents-dont-have-one-attack-surface-af9</link>
      <guid>https://dev.to/crucible_sec/ai-agents-dont-have-one-attack-surface-af9</guid>
      <description>&lt;p&gt;One of the biggest misconceptions in AI security is that a single test is enough.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2F0c0y9f95mc96pn7oftpq.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2F0c0y9f95mc96pn7oftpq.png" alt=" " width="800" height="1200"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Run a few prompt injection payloads.&lt;/p&gt;

&lt;p&gt;Check the responses.&lt;/p&gt;

&lt;p&gt;Ship to production.&lt;/p&gt;

&lt;p&gt;Modern AI agents are far more complex.&lt;/p&gt;

&lt;p&gt;A production agent may interact with tools, memory systems, browsers, MCP servers, enterprise knowledge bases, cloud infrastructure, and external APIs. Each of those components introduces different security risks.&lt;/p&gt;

&lt;p&gt;That's why Crucible organizes testing into 13 dedicated security modules, each focused on a different class of vulnerabilities. Instead of producing isolated results, the framework combines those findings into a single report, giving developers a broader view of their agent's security posture.&lt;/p&gt;

&lt;p&gt;Security isn't about asking:&lt;/p&gt;

&lt;p&gt;"Did we test prompt injection?"&lt;/p&gt;

&lt;p&gt;It's about asking:&lt;/p&gt;

&lt;p&gt;"What parts of our AI system haven't we tested yet?"&lt;/p&gt;

&lt;p&gt;Comprehensive coverage is what turns security testing into production confidence.&lt;/p&gt;

&lt;p&gt;Pytest for AI Agents.&lt;/p&gt;

&lt;h1&gt;
  
  
  cybersecurity
&lt;/h1&gt;

&lt;h1&gt;
  
  
  artificalintelligence
&lt;/h1&gt;

&lt;h1&gt;
  
  
  opensource
&lt;/h1&gt;

&lt;h1&gt;
  
  
  buildinpublic
&lt;/h1&gt;

&lt;h1&gt;
  
  
  aiagents
&lt;/h1&gt;

</description>
      <category>ai</category>
      <category>cybersecurity</category>
      <category>opensource</category>
      <category>buildinpublic</category>
    </item>
    <item>
      <title>Why AI Security Testing Needs to Be Measured in Seconds, Not Hours</title>
      <dc:creator>Crucible Security</dc:creator>
      <pubDate>Sun, 28 Jun 2026 15:14:14 +0000</pubDate>
      <link>https://dev.to/crucible_sec/why-ai-security-testing-needs-to-be-measured-in-seconds-not-hours-4mb8</link>
      <guid>https://dev.to/crucible_sec/why-ai-security-testing-needs-to-be-measured-in-seconds-not-hours-4mb8</guid>
      <description>&lt;p&gt;The biggest challenge with AI security isn't knowing what to test.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2Ffr6f049l75opgmvmvpo5.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2Ffr6f049l75opgmvmvpo5.png" alt=" " width="800" height="1200"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;It's finding the time to test everything.&lt;/p&gt;

&lt;p&gt;Traditional security reviews are often manual:&lt;/p&gt;

&lt;p&gt;Write a payload.&lt;br&gt;
Send it.&lt;br&gt;
Inspect the response.&lt;br&gt;
Try another variation.&lt;/p&gt;

&lt;p&gt;That approach quickly breaks down as AI systems grow more complex.&lt;/p&gt;

&lt;p&gt;Crucible takes a different approach.&lt;/p&gt;

&lt;p&gt;Instead of testing one prompt at a time, it automatically launches 90+ adversarial payloads in under 60 seconds against any HTTP-accessible AI agent and generates a structured security report.&lt;/p&gt;

&lt;p&gt;Those payloads span multiple attack categories, including prompt injection, goal hijacking, jailbreaks, MCP security, memory poisoning, and tool injection. The framework is designed to integrate with CI/CD pipelines so testing can happen continuously instead of only before release.&lt;/p&gt;

&lt;p&gt;Fast security testing isn't just about convenience.&lt;/p&gt;

&lt;p&gt;It's what makes continuous AI security practical.&lt;/p&gt;

&lt;p&gt;Because if testing takes hours, teams won't run it on every deployment.&lt;/p&gt;

&lt;p&gt;If it takes under a minute, they can.&lt;/p&gt;

&lt;p&gt;Pytest for AI Agents.&lt;/p&gt;

&lt;h1&gt;
  
  
  cybersecurity
&lt;/h1&gt;

&lt;h1&gt;
  
  
  artificalintelligence
&lt;/h1&gt;

&lt;h1&gt;
  
  
  opensource
&lt;/h1&gt;

&lt;h1&gt;
  
  
  buildinpublic
&lt;/h1&gt;

&lt;h1&gt;
  
  
  aiagents
&lt;/h1&gt;

</description>
      <category>ai</category>
      <category>cybersecurity</category>
      <category>opensource</category>
      <category>buildinpublic</category>
    </item>
  </channel>
</rss>
