<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Waqar Javed</title>
    <description>The latest articles on DEV Community by Waqar Javed (@iamwaqarjaved).</description>
    <link>https://dev.to/iamwaqarjaved</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3951982%2Fcc08edee-ea46-4032-8dc1-90c917ef6d4a.png</url>
      <title>DEV Community: Waqar Javed</title>
      <link>https://dev.to/iamwaqarjaved</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/iamwaqarjaved"/>
    <language>en</language>
    <item>
      <title>I just published an open-source framework for red-teaming AI agents.</title>
      <dc:creator>Waqar Javed</dc:creator>
      <pubDate>Mon, 08 Jun 2026 08:58:46 +0000</pubDate>
      <link>https://dev.to/iamwaqarjaved/i-just-published-an-open-source-framework-for-red-teaming-ai-agents-21m1</link>
      <guid>https://dev.to/iamwaqarjaved/i-just-published-an-open-source-framework-for-red-teaming-ai-agents-21m1</guid>
      <description>&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F13ahafxgee6lzm1qy033.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F13ahafxgee6lzm1qy033.png" alt=" " width="800" height="800"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Not LLM chatbots — agents. The kind built on LangChain, CrewAI, AutoGPT-style architectures that use tools, call APIs, and take multi-step actions in the world.&lt;/p&gt;

&lt;p&gt;Here's the problem I kept running into: teams are shipping agentic systems to production, but the red-teaming tooling hasn't kept up. Most evaluation frameworks still treat agents like chatbots. They miss the failure modes that actually matter — prompt injection through tool outputs, scope violations across reasoning steps, behavioral drift under adversarial conditions.&lt;/p&gt;

&lt;p&gt;So I built AgentSafeLabs.&lt;/p&gt;

&lt;p&gt;You wrap your agent in one function call. It runs a test suite aligned to the OWASP Agentic Security Initiative Top 10 — the emerging standard for agentic AI security. You get structured results: PASS, FAIL, UNCERTAIN, with reproducible test cases.&lt;/p&gt;

&lt;p&gt;Real example from this week: We ran AgentSafeLabs against Claude Haiku as the target agent passed 2 of 3 ASI01 (prompt injection) tests. The third returned UNCERTAIN — an indirect injection through a benign-looking context prefix that partially redirected tool selection. That's the kind of edge case that doesn't show up in standard evals.&lt;/p&gt;

&lt;p&gt;It's MIT licensed, on PyPI, CI-verified, and actively being extended.&lt;/p&gt;

&lt;p&gt;pip install safelabs-eval&lt;/p&gt;

&lt;p&gt;GitHub: &lt;a href="https://github.com/AgentSafeLabs/safelabs-eval" rel="noopener noreferrer"&gt;https://github.com/AgentSafeLabs/safelabs-eval&lt;/a&gt; &lt;/p&gt;

&lt;p&gt;If you're building agents and you've hit unexpected failure modes — I'd like to hear about them. And if you know someone this would be useful for, a share goes a long way for an early OSS project.&lt;/p&gt;

&lt;h1&gt;
  
  
  AIAgents #AISecurity #RedTeaming #AgenticAI #PromptInjection #LLMSecurity #OpenSourceAI
&lt;/h1&gt;

</description>
      <category>ai</category>
      <category>webdev</category>
      <category>programming</category>
      <category>agents</category>
    </item>
  </channel>
</rss>
