<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: ETB Protocol</title>
    <description>The latest articles on DEV Community by ETB Protocol (@etb_protocol).</description>
    <link>https://dev.to/etb_protocol</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3973049%2F405f021f-8945-4e24-9b74-e5ae22f5170d.png</url>
      <title>DEV Community: ETB Protocol</title>
      <link>https://dev.to/etb_protocol</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/etb_protocol"/>
    <language>en</language>
    <item>
      <title>Why Your AI Agent Keeps Overreaching — And How to Fix It with a Boundary Contract</title>
      <dc:creator>ETB Protocol</dc:creator>
      <pubDate>Sun, 07 Jun 2026 21:56:04 +0000</pubDate>
      <link>https://dev.to/etb_protocol/why-your-ai-agent-keeps-overreaching-and-how-to-fix-it-with-a-boundary-contract-46d3</link>
      <guid>https://dev.to/etb_protocol/why-your-ai-agent-keeps-overreaching-and-how-to-fix-it-with-a-boundary-contract-46d3</guid>
      <description>&lt;p&gt;&lt;em&gt;A design protocol born from DeFi infrastructure, now applied to AI systems&lt;/em&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  The Problem
&lt;/h2&gt;

&lt;p&gt;You've built an AI agent. It works — sometimes brilliantly.&lt;/p&gt;

&lt;p&gt;But then it starts doing things you didn't ask for.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;It makes assumptions and acts on them&lt;/li&gt;
&lt;li&gt;It fills in missing data instead of saying "I don't know"&lt;/li&gt;
&lt;li&gt;It optimizes when you only asked it to observe&lt;/li&gt;
&lt;li&gt;It gives confident answers when it should refuse&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This isn't a model problem. &lt;strong&gt;It's an architecture problem.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Your agent has no boundary contract.&lt;/p&gt;




&lt;h2&gt;
  
  
  Where This Idea Came From
&lt;/h2&gt;

&lt;p&gt;I built a DeFi risk observer for Aave v3 — a system that watches on-chain positions and reports liquidation risk in real time.&lt;/p&gt;

&lt;p&gt;The hardest design decision wasn't the data model or the state machine.&lt;/p&gt;

&lt;p&gt;It was this question:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;When should the system refuse to output anything at all?&lt;/strong&gt;&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;In DeFi, a wrong answer isn't just useless — it can cause real financial loss. So I designed a system that explicitly separates:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;What is &lt;em&gt;verified&lt;/em&gt; (direct from protocol)&lt;/li&gt;
&lt;li&gt;What is &lt;em&gt;derived&lt;/em&gt; (computed from verified data)&lt;/li&gt;
&lt;li&gt;What is &lt;em&gt;estimated&lt;/em&gt; (approximate, labeled as such)&lt;/li&gt;
&lt;li&gt;What should be &lt;em&gt;refused&lt;/em&gt; (uncertain, inconsistent, or unsafe to show)&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;When I applied this same philosophy to an AI agent I was building for content automation — something completely unrelated to DeFi — &lt;strong&gt;the agent's overreach dropped significantly.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;The principle transferred. The boundary contract worked.&lt;/p&gt;




&lt;h2&gt;
  
  
  The Core Principle
&lt;/h2&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;Refusal over Uncertainty. Boundary over Prediction. Observability over Automation.&lt;/strong&gt;&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;Most AI systems are designed to &lt;em&gt;always produce output&lt;/em&gt;. Silence feels like failure. Uncertainty gets smoothed over. Gaps get filled with plausible-sounding content.&lt;/p&gt;

&lt;p&gt;The result: agents that confidently do the wrong thing.&lt;/p&gt;

&lt;p&gt;A boundary contract inverts this default.&lt;/p&gt;




&lt;h2&gt;
  
  
  The Four Layers
&lt;/h2&gt;

&lt;p&gt;Every AI output can be classified into one of four trust layers:&lt;/p&gt;

&lt;h3&gt;
  
  
  1. VERIFIED
&lt;/h3&gt;

&lt;p&gt;Directly observable. The system retrieved this from a reliable source and can confirm it.&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;"The article was published on June 1, 2026."&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;

&lt;h3&gt;
  
  
  2. CONSISTENT
&lt;/h3&gt;

&lt;p&gt;Derived deterministically from verified data. The logic is transparent and repeatable.&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;"Based on the publication date, this is within the 30-day window."&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;

&lt;h3&gt;
  
  
  3. ESTIMATED
&lt;/h3&gt;

&lt;p&gt;An approximation. Useful, but explicitly labeled as such. Not to be treated as fact.&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;"The reading time is approximately 4 minutes."&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;

&lt;h3&gt;
  
  
  4. REFUSED
&lt;/h3&gt;

&lt;p&gt;The system cannot produce a trustworthy output. It says nothing rather than something wrong.&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Output withheld. Reason: source data inconsistent.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;




&lt;h2&gt;
  
  
  The State Model
&lt;/h2&gt;

&lt;p&gt;Pair the trust layer with an observable state:&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;State&lt;/th&gt;
&lt;th&gt;Meaning&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;&lt;code&gt;STABLE&lt;/code&gt;&lt;/td&gt;
&lt;td&gt;Operating within safe boundaries&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;code&gt;WATCH&lt;/code&gt;&lt;/td&gt;
&lt;td&gt;Approaching a boundary — caution advised&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;code&gt;BOUNDARY_APPROACHING&lt;/code&gt;&lt;/td&gt;
&lt;td&gt;Near-limit — intervention may be needed&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;code&gt;DEGRADED&lt;/code&gt;&lt;/td&gt;
&lt;td&gt;Output possible but quality is reduced&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;code&gt;REFUSAL&lt;/code&gt;&lt;/td&gt;
&lt;td&gt;Output withheld intentionally&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;These aren't errors. &lt;strong&gt;REFUSAL is a feature, not a failure.&lt;/strong&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  Applying This to AI Agents
&lt;/h2&gt;

&lt;p&gt;Here's a practical example. Suppose your agent summarizes recent news articles.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Without a boundary contract:&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Missing article → agent invents plausible content&lt;/li&gt;
&lt;li&gt;Stale data → agent presents it as current&lt;/li&gt;
&lt;li&gt;Conflicting sources → agent picks one and ignores the other&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;With a boundary contract:&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Missing article → &lt;code&gt;REFUSED&lt;/code&gt; with reason: &lt;em&gt;"Source unavailable"&lt;/em&gt;
&lt;/li&gt;
&lt;li&gt;Stale data → &lt;code&gt;ESTIMATED&lt;/code&gt; with label: &lt;em&gt;"Data may be outdated"&lt;/em&gt;
&lt;/li&gt;
&lt;li&gt;Conflicting sources → &lt;code&gt;DEGRADED&lt;/code&gt; with label: &lt;em&gt;"Sources inconsistent"&lt;/em&gt;
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The agent becomes honest about what it knows and doesn't know.&lt;/p&gt;




&lt;h2&gt;
  
  
  The System Prompt Pattern
&lt;/h2&gt;

&lt;p&gt;Here's a minimal implementation in a system prompt:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;You are an observer agent. Your role is to report state, not to act.

For every output, classify it as one of:
- VERIFIED: directly confirmed from source
- CONSISTENT: derived from verified data
- ESTIMATED: approximate — label it clearly
- REFUSED: do not output if data is missing, inconsistent, or unsafe

Rules:
- Never fill gaps with assumptions
- Never produce output when sources conflict
- Never optimize, advise, or act — only observe and report
- When in doubt, refuse

Refusal is correct behavior. Silence is safer than a confident wrong answer.
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;This single addition changed the behavior of my agents more than any other prompt engineering technique I've tried.&lt;/p&gt;




&lt;h2&gt;
  
  
  Why This Works
&lt;/h2&gt;

&lt;p&gt;The underlying principle is simple:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;The protocol restricts transitions, not states.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;An AI agent can end up in a bad state through external circumstances — bad data, ambiguous input, conflicting context. That's unavoidable.&lt;/p&gt;

&lt;p&gt;What you &lt;em&gt;can&lt;/em&gt; control is whether the agent acknowledges that state and handles it explicitly, or papers over it with confident-sounding output.&lt;/p&gt;

&lt;p&gt;The boundary contract makes the agent's epistemic state legible — to you, and to downstream systems.&lt;/p&gt;




&lt;h2&gt;
  
  
  What I'm Releasing
&lt;/h2&gt;

&lt;p&gt;I've formalized this into a document:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Boundary Contract for AI Systems v0.1&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;It includes:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;The full trust layer specification (VERIFIED / CONSISTENT / ESTIMATED / REFUSED)&lt;/li&gt;
&lt;li&gt;The state model with transition rules&lt;/li&gt;
&lt;li&gt;System prompt templates for common agent patterns&lt;/li&gt;
&lt;li&gt;The Non-Advisory Integrity Clause (what your agent must never do)&lt;/li&gt;
&lt;li&gt;Refusal protocol with trigger conditions&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Available on Gumroad: &lt;a href="https://arcthree.gumroad.com/l/etb-boundary-contract" rel="noopener noreferrer"&gt;https://arcthree.gumroad.com/l/etb-boundary-contract&lt;/a&gt;
&lt;/h2&gt;

&lt;h2&gt;
  
  
  Final Thought
&lt;/h2&gt;

&lt;p&gt;The most reliable AI systems I've seen have one thing in common:&lt;/p&gt;

&lt;p&gt;They know what they don't know.&lt;/p&gt;

&lt;p&gt;Building that awareness in requires explicit design. It doesn't happen by default.&lt;/p&gt;

&lt;p&gt;A boundary contract is how you make it intentional.&lt;/p&gt;




&lt;p&gt;&lt;em&gt;Built on the UEH (Universal Exchange Adapters) design philosophy.&lt;/em&gt;&lt;br&gt;
&lt;em&gt;Originally developed for DeFi risk observation infrastructure.&lt;/em&gt;&lt;br&gt;
&lt;em&gt;GitHub: github.com/ueh-labs/ueh-observer&lt;/em&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>llm</category>
      <category>boundary</category>
      <category>programming</category>
    </item>
  </channel>
</rss>
