<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Programming with Jack Chew</title>
    <description>The latest articles on DEV Community by Programming with Jack Chew (@programming_withjackche).</description>
    <link>https://dev.to/programming_withjackche</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3948462%2F62622315-576c-4dea-9153-4cda4503f812.jpg</url>
      <title>DEV Community: Programming with Jack Chew</title>
      <link>https://dev.to/programming_withjackche</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/programming_withjackche"/>
    <language>en</language>
    <item>
      <title>2026 Q1 is the year developers still build the agent harness. 2026 Q3 / 2027 is the year the LLM builds its own harness.</title>
      <dc:creator>Programming with Jack Chew</dc:creator>
      <pubDate>Sun, 24 May 2026 03:12:16 +0000</pubDate>
      <link>https://dev.to/programming_withjackche/2026-q1-is-the-year-developers-still-build-the-agent-harness-2026-q3-2027-is-the-year-the-llm-359f</link>
      <guid>https://dev.to/programming_withjackche/2026-q1-is-the-year-developers-still-build-the-agent-harness-2026-q3-2027-is-the-year-the-llm-359f</guid>
      <description>&lt;p&gt;2026 Q1 is the year developers still build the agent harness.&lt;/p&gt;

&lt;p&gt;2026 Q3 / 2027 is the year the LLM builds its own harness.&lt;/p&gt;

&lt;p&gt;Today, every AI coding agent — Claude Code, Cursor, Codex, Gemini CLI, Aider, you name it — depends on the same hidden layer:&lt;/p&gt;

&lt;p&gt;the files that brief the agent before it starts work.&lt;/p&gt;

&lt;p&gt;&lt;code&gt;AGENTS.md&lt;/code&gt;&lt;br&gt;
&lt;code&gt;CLAUDE.md&lt;/code&gt;&lt;br&gt;
&lt;code&gt;.cursor/rules&lt;/code&gt;&lt;br&gt;
&lt;code&gt;SKILLS/&lt;/code&gt;&lt;br&gt;
MCP server lists&lt;br&gt;
memory schemas&lt;br&gt;
test commands&lt;br&gt;
lint commands&lt;br&gt;
“Do not touch these paths.”&lt;br&gt;
“Require human approval before this.”&lt;/p&gt;

&lt;p&gt;Different IDE, same boilerplate.&lt;br&gt;
Different repo, same boilerplate.&lt;br&gt;
Different agent, same boilerplate.&lt;/p&gt;

&lt;p&gt;That is the agent harness problem.&lt;/p&gt;
&lt;h2&gt;
  
  
  The hidden work behind AI coding agents
&lt;/h2&gt;

&lt;p&gt;Most people talk about the coding agent itself.&lt;/p&gt;

&lt;p&gt;But in practice, the quality of an AI coding session often depends on the context layer around the agent.&lt;/p&gt;

&lt;p&gt;Before the agent starts coding, it needs to know:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;what kind of project this is&lt;/li&gt;
&lt;li&gt;what framework it uses&lt;/li&gt;
&lt;li&gt;what files are important&lt;/li&gt;
&lt;li&gt;what commands run tests&lt;/li&gt;
&lt;li&gt;what commands run linting&lt;/li&gt;
&lt;li&gt;what paths should not be touched&lt;/li&gt;
&lt;li&gt;what tools are available&lt;/li&gt;
&lt;li&gt;what memory should persist&lt;/li&gt;
&lt;li&gt;what failure modes to avoid&lt;/li&gt;
&lt;li&gt;what coding conventions to follow&lt;/li&gt;
&lt;li&gt;when human approval is required&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Without this layer, even strong coding agents can make subtle mistakes.&lt;/p&gt;

&lt;p&gt;With this layer, the same agent can behave much more consistently.&lt;/p&gt;

&lt;p&gt;That layer is what I call the harness.&lt;/p&gt;
&lt;h2&gt;
  
  
  Why this still exists in 2026
&lt;/h2&gt;

&lt;p&gt;In theory, the LLM should be able to inspect a repo and generate all of this itself.&lt;/p&gt;

&lt;p&gt;In practice, we are not fully there yet.&lt;/p&gt;

&lt;p&gt;The models are smart enough to do real coding work, but not always reliable enough to deterministically generate perfect project-specific ground truth from scratch on every fresh repo, every time.&lt;/p&gt;

&lt;p&gt;They can do it sometimes.&lt;/p&gt;

&lt;p&gt;Not always.&lt;/p&gt;

&lt;p&gt;So the human stays in the loop.&lt;/p&gt;

&lt;p&gt;We write the same repo instructions again.&lt;/p&gt;

&lt;p&gt;We copy the same rules across projects.&lt;/p&gt;

&lt;p&gt;We maintain separate files for Claude Code, Cursor, Codex-style agents, Continue, Windsurf, and others.&lt;/p&gt;

&lt;p&gt;Small work per repo.&lt;/p&gt;

&lt;p&gt;Painful in aggregate.&lt;/p&gt;
&lt;h2&gt;
  
  
  The future: self-generating harnesses
&lt;/h2&gt;

&lt;p&gt;I think this is temporary.&lt;/p&gt;

&lt;p&gt;Soon, the coding model should be able to:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;read the repo&lt;/li&gt;
&lt;li&gt;understand the task&lt;/li&gt;
&lt;li&gt;detect the project type&lt;/li&gt;
&lt;li&gt;generate the right harness&lt;/li&gt;
&lt;li&gt;connect the right tools&lt;/li&gt;
&lt;li&gt;create memory schemas&lt;/li&gt;
&lt;li&gt;write validation scripts&lt;/li&gt;
&lt;li&gt;refine the loop until the task is complete&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;At that point, the harness layer disappears as a separately authored artifact.&lt;/p&gt;

&lt;p&gt;But until then, developers still need a bridge.&lt;/p&gt;
&lt;h2&gt;
  
  
  I built harnessforge
&lt;/h2&gt;

&lt;p&gt;I built &lt;code&gt;harnessforge&lt;/code&gt; to test this idea.&lt;/p&gt;

&lt;p&gt;It is a local, open-source harness generator for AI coding agents.&lt;/p&gt;

&lt;p&gt;It is not another coding agent.&lt;/p&gt;

&lt;p&gt;Your coding agent stays the brain.&lt;/p&gt;

&lt;p&gt;&lt;code&gt;harnessforge&lt;/code&gt; just lays down the ground truth the agent reads before work begins.&lt;/p&gt;

&lt;p&gt;Run:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;uvx harnessforge init
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;or install:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;pip &lt;span class="nb"&gt;install &lt;/span&gt;harnessforge
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;In a few seconds, fully local with no network calls by default, it inspects your repo and generates startup files commonly used by AI coding agents.&lt;/p&gt;

&lt;h2&gt;
  
  
  What it generates
&lt;/h2&gt;

&lt;p&gt;Depending on the project and blueprint, &lt;code&gt;harnessforge&lt;/code&gt; can generate files such as:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;AGENTS.md
SOUL.md
TOOLS.md
MEMORY.md
SKILLS/
.claude/CLAUDE.md
.cursor/rules
.continue/
.windsurf/rules
blueprint-specific validators
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;The goal is simple:&lt;/p&gt;

&lt;p&gt;give the coding agent a stronger starting point.&lt;/p&gt;

&lt;h2&gt;
  
  
  Current blueprints
&lt;/h2&gt;

&lt;p&gt;The current version includes these blueprints:&lt;/p&gt;

&lt;h3&gt;
  
  
  &lt;code&gt;rag-agent&lt;/code&gt;
&lt;/h3&gt;

&lt;p&gt;For retrieval systems, knowledge-base agents, citation enforcement, and grounded responses.&lt;/p&gt;

&lt;h3&gt;
  
  
  &lt;code&gt;finance-agent&lt;/code&gt;
&lt;/h3&gt;

&lt;p&gt;For finance or stock-related agents, including market-data handling and validation rules around trade execution safety.&lt;/p&gt;

&lt;h3&gt;
  
  
  &lt;code&gt;support-agent&lt;/code&gt;
&lt;/h3&gt;

&lt;p&gt;For customer support flows such as intent detection, knowledge-base lookup, ticket creation, escalation, and ticket lineage.&lt;/p&gt;

&lt;h3&gt;
  
  
  &lt;code&gt;workflow-agent&lt;/code&gt;
&lt;/h3&gt;

&lt;p&gt;For multi-step orchestration with tool logs, idempotency, and validation structure.&lt;/p&gt;

&lt;h3&gt;
  
  
  &lt;code&gt;python-cli-app&lt;/code&gt;
&lt;/h3&gt;

&lt;p&gt;A default blueprint for greenfield Python CLI projects.&lt;/p&gt;

&lt;h2&gt;
  
  
  Why this matters
&lt;/h2&gt;

&lt;p&gt;The important idea is not the specific files.&lt;/p&gt;

&lt;p&gt;The important idea is that coding agents need a reliable project-specific operating context.&lt;/p&gt;

&lt;p&gt;Today, we manually maintain that context.&lt;/p&gt;

&lt;p&gt;Tomorrow, the model may generate it automatically.&lt;/p&gt;

&lt;p&gt;&lt;code&gt;harnessforge&lt;/code&gt; is meant to sit in the middle.&lt;/p&gt;

&lt;p&gt;A bridge, not a moat.&lt;/p&gt;

&lt;p&gt;Use it now.&lt;/p&gt;

&lt;p&gt;Throw it away when the models catch up.&lt;/p&gt;

&lt;h2&gt;
  
  
  Example workflow
&lt;/h2&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;uvx harnessforge init
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Then open Claude Code, Cursor, Codex, Gemini CLI, Aider, or another coding agent inside the repo.&lt;/p&gt;

&lt;p&gt;The agent now has project-specific context files to read before it starts work.&lt;/p&gt;

&lt;p&gt;Instead of starting from a blank repo, the agent starts with:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;project rules&lt;/li&gt;
&lt;li&gt;tool definitions&lt;/li&gt;
&lt;li&gt;memory structure&lt;/li&gt;
&lt;li&gt;validation expectations&lt;/li&gt;
&lt;li&gt;blueprint-specific failure modes&lt;/li&gt;
&lt;li&gt;agent-specific startup files&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The coding agent still writes the code.&lt;/p&gt;

&lt;p&gt;The harness just gives it the right context.&lt;/p&gt;

&lt;h2&gt;
  
  
  The bet
&lt;/h2&gt;

&lt;p&gt;My bet is:&lt;/p&gt;

&lt;p&gt;2026 Q1: developers still build the agent harness.&lt;/p&gt;

&lt;p&gt;2026 Q3 / 2027: the LLM builds its own harness.&lt;/p&gt;

&lt;p&gt;Until that happens, a local deterministic harness generator can make AI coding workflows more reliable.&lt;/p&gt;

&lt;p&gt;GitHub:&lt;br&gt;
&lt;a href="https://github.com/jcaiagent7143-ui/harnessforge" rel="noopener noreferrer"&gt;https://github.com/jcaiagent7143-ui/harnessforge&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;PyPI:&lt;br&gt;
&lt;a href="https://pypi.org/project/harnessforge/" rel="noopener noreferrer"&gt;https://pypi.org/project/harnessforge/&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;I would love feedback from developers using Claude Code, Cursor, Codex, Gemini CLI, Aider, Continue, Windsurf, or other coding agents in real repos.&lt;/p&gt;

&lt;p&gt;How are you managing your agent harness today?&lt;/p&gt;

&lt;p&gt;Are you manually maintaining &lt;code&gt;AGENTS.md&lt;/code&gt;, &lt;code&gt;CLAUDE.md&lt;/code&gt;, &lt;code&gt;.cursor/rules&lt;/code&gt;, MCP configs, memory files, and validation rules?&lt;/p&gt;

&lt;p&gt;Or do you think the next generation of coding models will generate this layer automatically?&lt;/p&gt;

</description>
      <category>agents</category>
      <category>ai</category>
      <category>llm</category>
      <category>tooling</category>
    </item>
  </channel>
</rss>
