<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: mr_miou</title>
    <description>The latest articles on DEV Community by mr_miou (@miou).</description>
    <link>https://dev.to/miou</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3950980%2F886a42c8-aefd-4885-872b-d289940aa54f.png</url>
      <title>DEV Community: mr_miou</title>
      <link>https://dev.to/miou</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/miou"/>
    <language>en</language>
    <item>
      <title>Why most AI fails at IDOR (and how AMAS fixes it with causal reasoning)</title>
      <dc:creator>mr_miou</dc:creator>
      <pubDate>Mon, 25 May 2026 15:42:49 +0000</pubDate>
      <link>https://dev.to/miou/why-most-ai-fails-at-idor-and-how-amas-fixes-it-with-causal-reasoning-517k</link>
      <guid>https://dev.to/miou/why-most-ai-fails-at-idor-and-how-amas-fixes-it-with-causal-reasoning-517k</guid>
      <description>&lt;h2&gt;
  
  
  The problem no one talks about
&lt;/h2&gt;

&lt;p&gt;Large language models are great at pattern matching.&lt;br&gt;&lt;br&gt;
Show them enough “vulnerable” examples, and they learn the &lt;em&gt;words&lt;/em&gt; – not the &lt;em&gt;reason&lt;/em&gt;.&lt;/p&gt;

&lt;p&gt;That’s why they struggle with &lt;strong&gt;logical vulnerabilities&lt;/strong&gt; like IDOR, privilege escalation, and workflow bypasses.&lt;/p&gt;

&lt;p&gt;These attacks depend on &lt;em&gt;who&lt;/em&gt; the actor is, &lt;em&gt;who&lt;/em&gt; owns the resource, and &lt;em&gt;where&lt;/em&gt; the trust boundary sits – not just on the presence of keywords like “authorization”.&lt;/p&gt;

&lt;h2&gt;
  
  
  Enter AMAS
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;AMAS&lt;/strong&gt; (AI Multi‑Agent Security Analysis System) is a security reasoning substrate.&lt;br&gt;&lt;br&gt;
It doesn’t teach the model to memorise patterns.&lt;br&gt;&lt;br&gt;
It teaches it to understand &lt;strong&gt;causality&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;Instead of “this is an IDOR”, AMAS models:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Identity &amp;amp; Session&lt;/strong&gt; – who is the actor? who is the owner?
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Enforcement failures&lt;/strong&gt; – which security control was missing or broken?
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Temporal transitions&lt;/strong&gt; – what changed before and after the request?
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Causal graphs&lt;/strong&gt; – how does a chain of events lead to a breach?&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  How it works (in a nutshell)
&lt;/h2&gt;

&lt;ol&gt;
&lt;li&gt;
&lt;strong&gt;Ingest real‑world CVEs&lt;/strong&gt; – streaming NVD feeds, filter by relevance (access control, auth, business logic).
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Convert each CVE into a causal graph&lt;/strong&gt; – extract actors, resources, trust boundaries, and missing controls.
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Mix with high‑quality synthetic data&lt;/strong&gt; – generated by a semantic mutation engine that changes &lt;em&gt;mechanics&lt;/em&gt; (infrastructure, auth model, async behaviour), not just wording.
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Quality control&lt;/strong&gt; – deduplication, domain coherence checks, repetition validation.
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Export&lt;/strong&gt; – &lt;code&gt;train.jsonl&lt;/code&gt; ready for fine‑tuning, plus detailed manifests.&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;Everything is &lt;strong&gt;deterministic&lt;/strong&gt; (same seed → same dataset) and &lt;strong&gt;scalable&lt;/strong&gt; (handles 200k+ CVEs).&lt;/p&gt;

&lt;h2&gt;
  
  
  Results (so far)
&lt;/h2&gt;

&lt;p&gt;After fine‑tuning a Mistral‑7B model on AMAS‑generated data:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;✅ &lt;strong&gt;Generalisation&lt;/strong&gt; – same vulnerability, different wording → correctly identified.
&lt;/li&gt;
&lt;li&gt;✅ &lt;strong&gt;Negative tests&lt;/strong&gt; – secure login requests → no false alarms.
&lt;/li&gt;
&lt;li&gt;✅ &lt;strong&gt;Domain shift&lt;/strong&gt; – secure scenarios in fintech, e‑commerce, healthcare → all classified correctly.
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Duplication dropped from &lt;strong&gt;52% to &amp;lt;10%&lt;/strong&gt;, and domain corruption was &lt;strong&gt;eliminated&lt;/strong&gt; in the latest pipeline.&lt;/p&gt;

&lt;h2&gt;
  
  
  Next steps
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;Open‑source release (GitHub) – complete pipeline, docs, examples.
&lt;/li&gt;
&lt;li&gt;Research paper (causal reasoning + identity‑aware training).
&lt;/li&gt;
&lt;li&gt;Pre‑trained model weights.&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Why share this now?
&lt;/h2&gt;

&lt;p&gt;Because the industry keeps building pattern‑matching scanners.&lt;br&gt;&lt;br&gt;
We need &lt;strong&gt;reasoning engines&lt;/strong&gt;.  &lt;/p&gt;

&lt;p&gt;If you’re into AI security, causal ML, or synthetic data – let’s connect.&lt;br&gt;&lt;br&gt;
I’ll drop the repo link here as soon as it’s public.&lt;/p&gt;




&lt;p&gt;&lt;em&gt;Comments and questions are very welcome. Let’s make AI actually understand security.&lt;/em&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>security</category>
      <category>llm</category>
      <category>cybersecurity</category>
    </item>
  </channel>
</rss>
