<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Atanas Egriderliev</title>
    <description>The latest articles on DEV Community by Atanas Egriderliev (@mlnas).</description>
    <link>https://dev.to/mlnas</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3780202%2F1f12efeb-299c-42c0-8c8c-79e5270b8b68.png</url>
      <title>DEV Community: Atanas Egriderliev</title>
      <link>https://dev.to/mlnas</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/mlnas"/>
    <language>en</language>
    <item>
      <title>Built runtime security for AI agents</title>
      <dc:creator>Atanas Egriderliev</dc:creator>
      <pubDate>Wed, 18 Feb 2026 22:08:39 +0000</pubDate>
      <link>https://dev.to/mlnas/built-runtime-security-for-ai-agents-576h</link>
      <guid>https://dev.to/mlnas/built-runtime-security-for-ai-agents-576h</guid>
      <description>&lt;p&gt;Noticed that a big gap is that there's no standard way to control what agents are allowed to do at runtime.&lt;br&gt;
If an agent gets manipulated (via prompt injection) or just makes a bad decision, it can cause real damage before you notice. Examples I've seen:&lt;/p&gt;

&lt;p&gt;Agent exports entire customer database&lt;br&gt;
Agent emails PII to external addresses&lt;br&gt;
Agent makes unauthorised financial transactions&lt;br&gt;
Agent installs compromised tools&lt;/p&gt;

&lt;p&gt;So I built Agent-SPM - a security layer that enforces policies on agent actions in real-time.&lt;/p&gt;

&lt;p&gt;How it works:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;You define what your agent can/can't do in a policy file.&lt;/li&gt;
&lt;li&gt;Every action the agent wants to take gets checked against these rules before execution. It's like a firewall, but for AI agents instead of network traffic.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;What I'm trying to solve:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Prevent data leaks (detect SSNs, credit cards, API keys in tool arguments)&lt;/li&gt;
&lt;li&gt;Stop unauthorised actions (no bulk exports, no dangerous commands)&lt;/li&gt;
&lt;li&gt;Enable human oversight (require approval for high-risk operations)&lt;/li&gt;
&lt;li&gt;Emergency controls (kill switch to disable rogue agents)&lt;/li&gt;
&lt;li&gt;Compliance (automatic audit trails for regulations)&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Technical details:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Open source (MIT license)&lt;/li&gt;
&lt;li&gt;Works with any LLM framework (LangChain, CrewAI, Claude, custom)&lt;/li&gt;
&lt;li&gt;Zero infrastructure (runs in your agent's process)&lt;/li&gt;
&lt;li&gt;8 composable packages (install only what you need)&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Repo: &lt;a href="https://github.com/mlnas/agent-runtime-security" rel="noopener noreferrer"&gt;https://github.com/mlnas/agent-runtime-security&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Would genuinely appreciate feedback and/or contributions.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>opensource</category>
      <category>security</category>
      <category>architecture</category>
    </item>
  </channel>
</rss>
