<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: DtoTHEmoon</title>
    <description>The latest articles on DEV Community by DtoTHEmoon (@dtothemoon).</description>
    <link>https://dev.to/dtothemoon</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3955327%2F7001497f-1267-4c21-8c0d-30c63c86a629.png</url>
      <title>DEV Community: DtoTHEmoon</title>
      <link>https://dev.to/dtothemoon</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/dtothemoon"/>
    <language>en</language>
    <item>
      <title>[Boost]</title>
      <dc:creator>DtoTHEmoon</dc:creator>
      <pubDate>Thu, 28 May 2026 02:03:01 +0000</pubDate>
      <link>https://dev.to/dtothemoon/-3p6b</link>
      <guid>https://dev.to/dtothemoon/-3p6b</guid>
      <description>&lt;div class="ltag__link--embedded"&gt;
  &lt;div class="crayons-story "&gt;
  &lt;a href="https://dev.to/dtothemoon/why-your-ai-agent-keeps-making-the-same-mistakes-its-not-the-model-3pl" class="crayons-story__hidden-navigation-link"&gt;Why Your AI Agent Keeps Making the Same Mistakes (It's Not the Model)&lt;/a&gt;


  &lt;div class="crayons-story__body crayons-story__body-full_post"&gt;
    &lt;div class="crayons-story__top"&gt;
      &lt;div class="crayons-story__meta"&gt;
        &lt;div class="crayons-story__author-pic"&gt;

          &lt;a href="/dtothemoon" class="crayons-avatar  crayons-avatar--l  "&gt;
            &lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3955327%2F7001497f-1267-4c21-8c0d-30c63c86a629.png" alt="dtothemoon profile" class="crayons-avatar__image" width="420" height="420"&gt;
          &lt;/a&gt;
        &lt;/div&gt;
        &lt;div&gt;
          &lt;div&gt;
            &lt;a href="/dtothemoon" class="crayons-story__secondary fw-medium m:hidden"&gt;
              DtoTHEmoon
            &lt;/a&gt;
            &lt;div class="profile-preview-card relative mb-4 s:mb-0 fw-medium hidden m:inline-block"&gt;
              
                DtoTHEmoon
                
              
              &lt;div id="story-author-preview-content-3767092" class="profile-preview-card__content crayons-dropdown branded-7 p-4 pt-0"&gt;
                &lt;div class="gap-4 grid"&gt;
                  &lt;div class="-mt-4"&gt;
                    &lt;a href="/dtothemoon" class="flex"&gt;
                      &lt;span class="crayons-avatar crayons-avatar--xl mr-2 shrink-0"&gt;
                        &lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3955327%2F7001497f-1267-4c21-8c0d-30c63c86a629.png" class="crayons-avatar__image" alt="" width="420" height="420"&gt;
                      &lt;/span&gt;
                      &lt;span class="crayons-link crayons-subtitle-2 mt-5"&gt;DtoTHEmoon&lt;/span&gt;
                    &lt;/a&gt;
                  &lt;/div&gt;
                  &lt;div class="print-hidden"&gt;
                    
                      Follow
                    
                  &lt;/div&gt;
                  &lt;div class="author-preview-metadata-container"&gt;&lt;/div&gt;
                &lt;/div&gt;
              &lt;/div&gt;
            &lt;/div&gt;

          &lt;/div&gt;
          &lt;a href="https://dev.to/dtothemoon/why-your-ai-agent-keeps-making-the-same-mistakes-its-not-the-model-3pl" class="crayons-story__tertiary fs-xs"&gt;&lt;time&gt;May 27&lt;/time&gt;&lt;span class="time-ago-indicator-initial-placeholder"&gt;&lt;/span&gt;&lt;/a&gt;
        &lt;/div&gt;
      &lt;/div&gt;

    &lt;/div&gt;

    &lt;div class="crayons-story__indention"&gt;
      &lt;h2 class="crayons-story__title crayons-story__title-full_post"&gt;
        &lt;a href="https://dev.to/dtothemoon/why-your-ai-agent-keeps-making-the-same-mistakes-its-not-the-model-3pl" id="article-link-3767092"&gt;
          Why Your AI Agent Keeps Making the Same Mistakes (It's Not the Model)
        &lt;/a&gt;
      &lt;/h2&gt;
        &lt;div class="crayons-story__tags"&gt;
            &lt;a class="crayons-tag  crayons-tag--monochrome " href="/t/ai"&gt;&lt;span class="crayons-tag__prefix"&gt;#&lt;/span&gt;ai&lt;/a&gt;
            &lt;a class="crayons-tag  crayons-tag--monochrome " href="/t/claude"&gt;&lt;span class="crayons-tag__prefix"&gt;#&lt;/span&gt;claude&lt;/a&gt;
            &lt;a class="crayons-tag  crayons-tag--monochrome " href="/t/agentaichallenge"&gt;&lt;span class="crayons-tag__prefix"&gt;#&lt;/span&gt;agentaichallenge&lt;/a&gt;
            &lt;a class="crayons-tag  crayons-tag--monochrome " href="/t/chatgpt"&gt;&lt;span class="crayons-tag__prefix"&gt;#&lt;/span&gt;chatgpt&lt;/a&gt;
        &lt;/div&gt;
      &lt;div class="crayons-story__bottom"&gt;
        &lt;div class="crayons-story__details"&gt;
          &lt;a href="https://dev.to/dtothemoon/why-your-ai-agent-keeps-making-the-same-mistakes-its-not-the-model-3pl" class="crayons-btn crayons-btn--s crayons-btn--ghost crayons-btn--icon-left"&gt;
            &lt;div class="multiple_reactions_aggregate"&gt;
              &lt;span class="multiple_reactions_icons_container"&gt;
                  &lt;span class="crayons_icon_container"&gt;
                    &lt;img src="https://assets.dev.to/assets/exploding-head-daceb38d627e6ae9b730f36a1e390fca556a4289d5a41abb2c35068ad3e2c4b5.svg" width="24" height="24"&gt;
                  &lt;/span&gt;
                  &lt;span class="crayons_icon_container"&gt;
                    &lt;img src="https://assets.dev.to/assets/multi-unicorn-b44d6f8c23cdd00964192bedc38af3e82463978aa611b4365bd33a0f1f4f3e97.svg" width="24" height="24"&gt;
                  &lt;/span&gt;
                  &lt;span class="crayons_icon_container"&gt;
                    &lt;img src="https://assets.dev.to/assets/sparkle-heart-5f9bee3767e18deb1bb725290cb151c25234768a0e9a2bd39370c382d02920cf.svg" width="24" height="24"&gt;
                  &lt;/span&gt;
              &lt;/span&gt;
              &lt;span class="aggregate_reactions_counter"&gt;5&lt;span class="hidden s:inline"&gt; reactions&lt;/span&gt;&lt;/span&gt;
            &lt;/div&gt;
          &lt;/a&gt;
            &lt;a href="https://dev.to/dtothemoon/why-your-ai-agent-keeps-making-the-same-mistakes-its-not-the-model-3pl#comments" class="crayons-btn crayons-btn--s crayons-btn--ghost crayons-btn--icon-left flex items-center"&gt;
              Comments


              &lt;span class="hidden s:inline"&gt;Add Comment&lt;/span&gt;
            &lt;/a&gt;
        &lt;/div&gt;
        &lt;div class="crayons-story__save"&gt;
          &lt;small class="crayons-story__tertiary fs-xs mr-2"&gt;
            3 min read
          &lt;/small&gt;
            
              &lt;span class="bm-initial"&gt;
                

              &lt;/span&gt;
              &lt;span class="bm-success"&gt;
                

              &lt;/span&gt;
            
        &lt;/div&gt;
      &lt;/div&gt;
    &lt;/div&gt;
  &lt;/div&gt;
&lt;/div&gt;

&lt;/div&gt;


</description>
    </item>
    <item>
      <title>Why Your AI Agent Keeps Making the Same Mistakes (It's Not the Model)</title>
      <dc:creator>DtoTHEmoon</dc:creator>
      <pubDate>Wed, 27 May 2026 23:28:49 +0000</pubDate>
      <link>https://dev.to/dtothemoon/why-your-ai-agent-keeps-making-the-same-mistakes-its-not-the-model-3pl</link>
      <guid>https://dev.to/dtothemoon/why-your-ai-agent-keeps-making-the-same-mistakes-its-not-the-model-3pl</guid>
      <description>&lt;p&gt;Does this sound familiar?&lt;/p&gt;

&lt;p&gt;Your AI just fixed a bug. Two weeks later, the exact same bug is back.&lt;/p&gt;

&lt;p&gt;You deploy something, and you have no idea if it actually worked — so you manually test it.&lt;/p&gt;

&lt;p&gt;You've written 100 lines of rules in your config file, but the AI still ignores half of them.&lt;/p&gt;

&lt;p&gt;Every new chat session, you re-explain the same context from scratch.&lt;/p&gt;

&lt;p&gt;I ran into all four of these problems while building an internal AI quoting system for a healthcare company — with no technical background. And after months of debugging, I realized: &lt;strong&gt;none of these were model problems. They were Harness problems.&lt;/strong&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  What is Harness Engineering?
&lt;/h2&gt;

&lt;p&gt;Harness Engineering is the discipline of building the scaffolding around your AI — the rules, constraints, verification scripts, and knowledge structures that make it produce consistent, reliable output.&lt;/p&gt;

&lt;p&gt;Without Harness, even the best model will drift, forget, and repeat the same mistakes.&lt;/p&gt;

&lt;p&gt;The data backs this up: research shows that &lt;strong&gt;80% of Agent quality failures come from Harness gaps, not model limitations&lt;/strong&gt;. And in one benchmark, the same 15 models all improved significantly when only the Harness changed — not the models themselves.&lt;/p&gt;

&lt;p&gt;The problem is: most people don't know what their Harness is missing. They just know something feels broken.&lt;/p&gt;




&lt;h2&gt;
  
  
  The framework: two dimensions, not six steps
&lt;/h2&gt;

&lt;p&gt;After studying real production failures and building my own system from scratch, I organized Harness Engineering into two dimensions.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Vertical Quality Layers (Q) — required for every project&lt;/strong&gt;&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Layer&lt;/th&gt;
&lt;th&gt;Name&lt;/th&gt;
&lt;th&gt;What it solves&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Q1&lt;/td&gt;
&lt;td&gt;SPEC&lt;/td&gt;
&lt;td&gt;AI knows what to build, what not to, and how to verify&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Q2&lt;/td&gt;
&lt;td&gt;Rules + Security&lt;/td&gt;
&lt;td&gt;Hard business limits + security red lines, equally mandatory&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Q3&lt;/td&gt;
&lt;td&gt;Skills&lt;/td&gt;
&lt;td&gt;Repetitive workflows standardized with counter-examples&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Q4&lt;/td&gt;
&lt;td&gt;Scripts (unified gate)&lt;/td&gt;
&lt;td&gt;Nothing is "done" until scripts pass&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;&lt;strong&gt;Horizontal Scale Layers (S) — enable only when needed&lt;/strong&gt;&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Layer&lt;/th&gt;
&lt;th&gt;Name&lt;/th&gt;
&lt;th&gt;When to enable&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;S1&lt;/td&gt;
&lt;td&gt;Context&lt;/td&gt;
&lt;td&gt;Sessions losing coherence after ~20 turns&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;S2&lt;/td&gt;
&lt;td&gt;dev-map + Memory&lt;/td&gt;
&lt;td&gt;Project iterating 2+ months, AI re-inventing solutions&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;S3&lt;/td&gt;
&lt;td&gt;Multi-Agent&lt;/td&gt;
&lt;td&gt;Single agent consistently failing on long task chains&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;The key insight: &lt;strong&gt;Q4 is not step four. It's the exit gate for every layer.&lt;/strong&gt; Code changes, doc updates, multi-agent outputs — all must pass Q4 before anything counts as done.&lt;/p&gt;

&lt;p&gt;Most people skip Q4 entirely. That's why the same bug keeps coming back.&lt;/p&gt;




&lt;h2&gt;
  
  
  What I built: Rein
&lt;/h2&gt;

&lt;p&gt;Rein is an open-source Skill for Claude Code (and any agent supporting the SKILL.md standard) that acts as a silent Harness Engineering advisor throughout your project.&lt;/p&gt;

&lt;p&gt;It watches your conversations for patterns — not keywords — and speaks up only when it detects a real gap. When everything's fine, it stays silent. &lt;strong&gt;Silence is a feature.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;What it detects automatically:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Repeated failures (same bug fixed twice → missing Rule or regression test)&lt;/li&gt;
&lt;li&gt;Context loss (re-explaining background every session → incomplete project docs)&lt;/li&gt;
&lt;li&gt;Scale shifts (internal tool going external → time to harden your Harness)&lt;/li&gt;
&lt;li&gt;Cost spikes (API bill climbing → identifies token waste sources)&lt;/li&gt;
&lt;li&gt;Over-engineering (more config, slower shipping → tells you what to delete)&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Test results: 97% pass rate across 16 scenarios with Rein vs 52% without.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;The biggest gap was in root cause diagnosis: 92% accuracy with Rein, 24% without.&lt;/p&gt;




&lt;h2&gt;
  
  
  A real example from my project
&lt;/h2&gt;

&lt;p&gt;My &lt;code&gt;verify.sh&lt;/code&gt; only checked if the service started. It didn't check if the business logic was correct.&lt;/p&gt;

&lt;p&gt;So when the AI "fixed" a pricing calculation bug, it passed my verification — service was running — but the actual calculation was still wrong. Same bug, two weeks later.&lt;/p&gt;

&lt;p&gt;After adding a business baseline check (call a known correct quote request, compare against expected output), that class of bug disappeared entirely.&lt;/p&gt;

&lt;p&gt;This is Q4. Not just "is the service alive?" but "&lt;strong&gt;is the output actually correct?&lt;/strong&gt;"&lt;/p&gt;




&lt;h2&gt;
  
  
  Install
&lt;/h2&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;git clone https://github.com/DtoTHEmoon/rein-skill.git ~/.claude/skills/rein
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Restart your agent. Rein activates automatically — no commands needed.&lt;/p&gt;

&lt;p&gt;Also works with: OpenClaw, Codex CLI, Gemini CLI, Cursor, and any agent supporting SKILL.md.&lt;/p&gt;




&lt;h2&gt;
  
  
  The core philosophy
&lt;/h2&gt;

&lt;p&gt;Start minimal. Add only when you have a real pain point. And know when to subtract — Rein will tell you when your Harness is getting in your own way.&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;If your scaffolding is slowing you down, it's time to cut.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;GitHub: &lt;a href="https://github.com/DtoTHEmoon/rein-skill" rel="noopener noreferrer"&gt;github.com/DtoTHEmoon/rein-skill&lt;/a&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>claude</category>
      <category>agentaichallenge</category>
      <category>chatgpt</category>
    </item>
  </channel>
</rss>
