<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Bruce He</title>
    <description>The latest articles on DEV Community by Bruce He (@bruce_he).</description>
    <link>https://dev.to/bruce_he</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3860271%2F7100377c-e1ef-4190-84bf-09cf0c8ad15b.jpg</url>
      <title>DEV Community: Bruce He</title>
      <link>https://dev.to/bruce_he</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/bruce_he"/>
    <language>en</language>
    <item>
      <title>Sub-Agent Architecture for AI Coding Harnesses: When to Spawn, How to Route, What It Costs</title>
      <dc:creator>Bruce He</dc:creator>
      <pubDate>Sun, 12 Apr 2026 16:46:30 +0000</pubDate>
      <link>https://dev.to/bruce_he/sub-agent-architecture-for-ai-coding-harnesses-when-to-spawn-how-to-route-what-it-costs-1bno</link>
      <guid>https://dev.to/bruce_he/sub-agent-architecture-for-ai-coding-harnesses-when-to-spawn-how-to-route-what-it-costs-1bno</guid>
      <description>&lt;p&gt;&lt;em&gt;Originally published at &lt;a href="https://www.heyuan110.com/posts/ai/2026-04-13-harness-subagent-architecture/" rel="noopener noreferrer"&gt;my blog&lt;/a&gt;&lt;/em&gt;&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;Sub-agents are not a parallel speed hack. They are a context garbage collection mechanism.&lt;/strong&gt; The point is to throw noise away, not to split thinking.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;Most engineering teams reach for sub-agents the first time they hit a context window limit, or the first time a task feels "big." They fan out, parallelize, marvel at how fast things go — then spend the next month debugging why outputs keep drifting from each other. The failure mode is predictable: sub-agents that should have stayed in the main thread got fired off, and a single decision that needed shared working memory got split across three cold-started processes that never saw each other's evidence.&lt;/p&gt;

&lt;p&gt;This article gives you a decision framework, a concrete routing table across Opus / Sonnet / Haiku, and a cost model — so you stop spawning sub-agents by instinct and start spawning them for reasons you can name.&lt;/p&gt;

&lt;h2&gt;
  
  
  Three Myths That Burn Money
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;Myth 1: "More sub-agents means faster completion."&lt;/strong&gt; Every spawn carries cold-start overhead — system prompt re-tokenization, CLAUDE.md reload, tool schemas re-injected. If your sub-agent only does 2,000 tokens of real work, the overhead can exceed the work itself. Break-even sits at roughly &lt;strong&gt;10,000 input tokens per spawn&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Myth 2: "Sub-agents should always use the cheapest model."&lt;/strong&gt; Route by &lt;em&gt;decision complexity&lt;/em&gt;, not input volume. Haiku reading 100K tokens of logs to emit a 200-token classification is great. Haiku writing 2,000 lines of production code is malpractice.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Myth 3: "The orchestrator should be the smartest model."&lt;/strong&gt; The most expensive mistake. Orchestration is mostly routing and state tracking — Sonnet or even Haiku handles it fine. Save Opus for the final generation step where judgment compounds.&lt;/p&gt;

&lt;p&gt;I restructured my own blog pipeline from "Opus orchestrator + Sonnet workers" to "Sonnet orchestrator + Opus writer + Haiku searchers." End-to-end token cost dropped by ~60% and output quality measurably improved, because Opus was finally being used where it mattered.&lt;/p&gt;

&lt;h2&gt;
  
  
  The Mental Model
&lt;/h2&gt;

&lt;p&gt;A sub-agent is a fresh heap. You fork an isolated process, let it churn through whatever mess it needs (reading 20 files, running grep, inspecting logs), extract a compact summary, and let the whole thing get garbage-collected. The parent agent never sees the mess.&lt;/p&gt;

&lt;p&gt;The primary use cases all share one shape: &lt;strong&gt;high input, low output, stateless&lt;/strong&gt;. Codebase search. Doc triage. Log analysis. The sub-agent's job is to summarize and throw away.&lt;/p&gt;

&lt;p&gt;Conversely, this framing tells you when &lt;em&gt;not&lt;/em&gt; to use a sub-agent. If the "waste" you would throw away is actually load-bearing context the main agent needs downstream, sub-agents cost you more than they save.&lt;/p&gt;




&lt;p&gt;&lt;strong&gt;&lt;a href="https://www.heyuan110.com/posts/ai/2026-04-13-harness-subagent-architecture/" rel="noopener noreferrer"&gt;Read the full article →&lt;/a&gt;&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;The full version covers:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Three architecture patterns&lt;/strong&gt; — Fan-Out / Gather, Scout-Then-Act, Specialist Delegation — with mermaid diagrams and concrete failure modes for each&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;A complete Opus / Sonnet / Haiku routing table&lt;/strong&gt; with worked examples by task type&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;A sub-agent cost formula&lt;/strong&gt; and the 0.3 overhead-threshold metric for when sub-agents actually pay for themselves&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Cognition's Devin post-mortem&lt;/strong&gt; and what it teaches about multi-agent drift&lt;/li&gt;
&lt;li&gt;A real 60% cost reduction case study from rewiring my own writing pipeline&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This is &lt;strong&gt;Part 3 of the &lt;a href="https://www.heyuan110.com/posts/ai/2026-03-30-harness-engineering-guide/" rel="noopener noreferrer"&gt;Harness Engineering series&lt;/a&gt;&lt;/strong&gt;. Part 1 framed the thesis (Agent = Model + Harness). &lt;a href="https://www.heyuan110.com/posts/ai/2026-03-31-harness-claudemd-guide/" rel="noopener noreferrer"&gt;Part 2&lt;/a&gt; went deep on &lt;code&gt;CLAUDE.md&lt;/code&gt;. If you found this useful, &lt;a href="https://www.heyuan110.com/" rel="noopener noreferrer"&gt;the blog&lt;/a&gt; has more AI engineering deep-dives.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>claude</category>
      <category>productivity</category>
      <category>agents</category>
    </item>
    <item>
      <title>Claude Pricing 2026: Complete Guide to Free, Pro, Max &amp; Team Plans</title>
      <dc:creator>Bruce He</dc:creator>
      <pubDate>Sat, 04 Apr 2026 03:33:13 +0000</pubDate>
      <link>https://dev.to/bruce_he/claude-pricing-2026-complete-guide-to-free-pro-max-team-plans-ghg</link>
      <guid>https://dev.to/bruce_he/claude-pricing-2026-complete-guide-to-free-pro-max-team-plans-ghg</guid>
      <description>&lt;p&gt;&lt;em&gt;Originally published at &lt;a href="https://www.heyuan110.com/posts/ai/2026-04-04-claude-pricing-complete-guide/" rel="noopener noreferrer"&gt;my blog&lt;/a&gt;&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;Anthropic's Claude has become one of the most capable AI assistants available, but its pricing structure can be confusing. With five consumer tiers, two Max sub-tiers, a Team plan with mixed seat types, and a separate API — choosing the right plan takes real research.&lt;/p&gt;

&lt;p&gt;This guide breaks down every Claude pricing option available in April 2026, including exact costs, usage limits, and practical recommendations for different user types.&lt;/p&gt;

&lt;h2&gt;
  
  
  Quick Pricing Comparison
&lt;/h2&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Plan&lt;/th&gt;
&lt;th&gt;Price&lt;/th&gt;
&lt;th&gt;Usage vs Pro&lt;/th&gt;
&lt;th&gt;Best For&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Free&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;$0&lt;/td&gt;
&lt;td&gt;~0.2x&lt;/td&gt;
&lt;td&gt;Trying Claude, occasional questions&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Pro&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;$20/mo&lt;/td&gt;
&lt;td&gt;1x (baseline)&lt;/td&gt;
&lt;td&gt;Daily individual use&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Max 5x&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;$100/mo&lt;/td&gt;
&lt;td&gt;5x&lt;/td&gt;
&lt;td&gt;Power users, heavy Claude Code usage&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Max 20x&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;$200/mo&lt;/td&gt;
&lt;td&gt;20x&lt;/td&gt;
&lt;td&gt;Professional developers, near-unlimited&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Team&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;$25-30/seat/mo&lt;/td&gt;
&lt;td&gt;1x-6.25x&lt;/td&gt;
&lt;td&gt;Teams of 5-150&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Enterprise&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Custom&lt;/td&gt;
&lt;td&gt;Custom&lt;/td&gt;
&lt;td&gt;150+ seats, compliance needs&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;




&lt;p&gt;&lt;strong&gt;&lt;a href="https://www.heyuan110.com/posts/ai/2026-04-04-claude-pricing-complete-guide/" rel="noopener noreferrer"&gt;Read the full article →&lt;/a&gt;&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;The full article covers:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Detailed breakdown of each plan's features and limitations&lt;/li&gt;
&lt;li&gt;Claude Code pricing (included plans vs API costs)&lt;/li&gt;
&lt;li&gt;API token pricing for Opus 4.6, Sonnet 4.6, and Haiku 4.5&lt;/li&gt;
&lt;li&gt;Cost optimization strategies for developers&lt;/li&gt;
&lt;li&gt;How to choose the right plan based on your usage&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;If you found this useful, check out &lt;a href="https://www.heyuan110.com/" rel="noopener noreferrer"&gt;my blog&lt;/a&gt; for more AI engineering guides.&lt;/p&gt;

</description>
      <category>claude</category>
      <category>ai</category>
      <category>pricing</category>
      <category>programming</category>
    </item>
    <item>
      <title>Harness Engineering: Why the System Around Your AI Agent Matters More Than the Model</title>
      <dc:creator>Bruce He</dc:creator>
      <pubDate>Sat, 04 Apr 2026 01:45:58 +0000</pubDate>
      <link>https://dev.to/bruce_he/harness-engineering-why-the-system-around-your-ai-agent-matters-more-than-the-model-3kmk</link>
      <guid>https://dev.to/bruce_he/harness-engineering-why-the-system-around-your-ai-agent-matters-more-than-the-model-3kmk</guid>
      <description>&lt;p&gt;&lt;em&gt;Originally published at &lt;a href="https://www.heyuan110.com/posts/ai/2026-04-04-harness-engineering-guide/" rel="noopener noreferrer"&gt;heyuan110.com&lt;/a&gt;&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;In 2026, the AI engineering community discovered something counterintuitive: &lt;strong&gt;the model is the least important part of an AI agent&lt;/strong&gt;. What actually determines whether an agent succeeds or fails in production is everything &lt;em&gt;around&lt;/em&gt; the model — the tools it can access, the guardrails that keep it safe, the feedback loops that help it self-correct.&lt;/p&gt;

&lt;p&gt;This "everything around the model" now has a name: the &lt;strong&gt;harness&lt;/strong&gt;. And the discipline of building it is called &lt;strong&gt;harness engineering&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;OpenAI's Codex team used harness engineering to ship over 1 million lines of production code written entirely by AI agents. LangChain jumped from #30 to #5 on TerminalBench 2.0 by changing only their harness. A Stanford HAI study found harness-level changes improved output quality by 28-47%, while prompt refinement improved quality by less than 3%.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;This guide covers:&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;The three evolutions: Prompt → Context → Harness Engineering&lt;/li&gt;
&lt;li&gt;Core formula: Agent = Model + Harness&lt;/li&gt;
&lt;li&gt;Guides (feedforward) + Sensors (feedback) framework&lt;/li&gt;
&lt;li&gt;Real cases: OpenAI Codex, LangChain, Stripe Minions&lt;/li&gt;
&lt;li&gt;5-level practical implementation guide&lt;/li&gt;
&lt;/ul&gt;




&lt;p&gt;&lt;strong&gt;&lt;a href="https://www.heyuan110.com/posts/ai/2026-04-04-harness-engineering-guide/" rel="noopener noreferrer"&gt;Read the full article →&lt;/a&gt;&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;If you found this useful, check out &lt;a href="https://www.heyuan110.com/" rel="noopener noreferrer"&gt;my blog&lt;/a&gt; for more AI engineering guides.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>agents</category>
      <category>engineering</category>
      <category>productivity</category>
    </item>
    <item>
      <title>Seedance 2.0 Deep Dive: ByteDance AI Video That Tops Sora and Veo</title>
      <dc:creator>Bruce He</dc:creator>
      <pubDate>Sat, 04 Apr 2026 01:45:38 +0000</pubDate>
      <link>https://dev.to/bruce_he/seedance-20-deep-dive-bytedance-ai-video-that-tops-sora-and-veo-3hn9</link>
      <guid>https://dev.to/bruce_he/seedance-20-deep-dive-bytedance-ai-video-that-tops-sora-and-veo-3hn9</guid>
      <description>&lt;p&gt;&lt;em&gt;Originally published at &lt;a href="https://www.heyuan110.com/posts/ai/2026-04-04-seedance-2-bytedance-ai-video/" rel="noopener noreferrer"&gt;heyuan110.com&lt;/a&gt;&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;In February 2026, ByteDance released &lt;strong&gt;Seedance 2.0&lt;/strong&gt;. Within weeks, it hit &lt;strong&gt;#1 on the Artificial Analysis text-to-video leaderboard&lt;/strong&gt; — beating Google Veo 3, OpenAI Sora 2, and Runway Gen-4.5 in blind human evaluation.&lt;/p&gt;

&lt;p&gt;If you are reading this from outside China, you have probably heard the buzz but face a wall of confusion: What is Dreamina? What is VolcEngine? Can you even sign up without a Chinese phone number?&lt;/p&gt;

&lt;p&gt;This guide is written specifically for international users. It covers the technical architecture in depth (why joint audio-video generation is a real breakthrough), gives an honest assessment of what works and what does not, provides a step-by-step access guide, and explains the IP controversy.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Key findings:&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Joint audio-video generation produces the most natural lip sync of any model&lt;/li&gt;
&lt;li&gt;Multi-reference input (up to 12 files) enables director-level control&lt;/li&gt;
&lt;li&gt;2K max resolution is a limitation vs Kling 3.0's 4K@60fps&lt;/li&gt;
&lt;li&gt;~$0.14 per 15-second clip — 5-10x cheaper than competitors&lt;/li&gt;
&lt;li&gt;CapCut integration gives it the largest distribution platform of any AI video model&lt;/li&gt;
&lt;/ul&gt;




&lt;p&gt;&lt;strong&gt;&lt;a href="https://www.heyuan110.com/posts/ai/2026-04-04-seedance-2-bytedance-ai-video/" rel="noopener noreferrer"&gt;Read the full article →&lt;/a&gt;&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;If you found this useful, check out &lt;a href="https://www.heyuan110.com/" rel="noopener noreferrer"&gt;my blog&lt;/a&gt; for more AI engineering guides.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>video</category>
      <category>bytedance</category>
      <category>machinelearning</category>
    </item>
    <item>
      <title>Cursor Composer 2: The Kimi K2.5 Controversy and What It Means</title>
      <dc:creator>Bruce He</dc:creator>
      <pubDate>Sat, 04 Apr 2026 01:45:17 +0000</pubDate>
      <link>https://dev.to/bruce_he/cursor-composer-2-the-kimi-k25-controversy-and-what-it-means-2061</link>
      <guid>https://dev.to/bruce_he/cursor-composer-2-the-kimi-k25-controversy-and-what-it-means-2061</guid>
      <description>&lt;p&gt;&lt;em&gt;Originally published at &lt;a href="https://www.heyuan110.com/posts/ai/2026-04-04-cursor-composer-2-review/" rel="noopener noreferrer"&gt;heyuan110.com&lt;/a&gt;&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;On March 19, Cursor shipped Composer 2 with a triumphant blog post. Three days later, a developer found &lt;code&gt;kimi-k2p5-rl-0317-s515-fast&lt;/code&gt; in the API config. That single string unraveled a story about transparency, open-source ethics, and the global nature of AI infrastructure.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Key findings:&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Composer 2 is built on Moonshot AI's Kimi K2.5 (Chinese open-source MoE model)&lt;/li&gt;
&lt;li&gt;Cursor's "75% of compute was ours" defense doesn't hold up&lt;/li&gt;
&lt;li&gt;CursorBench scores (61.3) are home-field advantage; Terminal-Bench gap vs Claude is only 3.7 points&lt;/li&gt;
&lt;li&gt;At $0.50/M input tokens, Composer 2 is 30x cheaper than Opus 4.6&lt;/li&gt;
&lt;li&gt;Most productive devs use both: Cursor for 80% daily tasks, Claude Code for 20% complex work&lt;/li&gt;
&lt;/ul&gt;




&lt;p&gt;&lt;strong&gt;&lt;a href="https://www.heyuan110.com/posts/ai/2026-04-04-cursor-composer-2-review/" rel="noopener noreferrer"&gt;Read the full article →&lt;/a&gt;&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;If you found this useful, check out &lt;a href="https://www.heyuan110.com/" rel="noopener noreferrer"&gt;my blog&lt;/a&gt; for more AI engineering guides.&lt;/p&gt;

</description>
      <category>cursor</category>
      <category>ai</category>
      <category>coding</category>
      <category>opensource</category>
    </item>
    <item>
      <title>MCP vs Skills vs Hooks in Claude Code: Which Extension Do You Need?</title>
      <dc:creator>Bruce He</dc:creator>
      <pubDate>Sat, 04 Apr 2026 01:45:04 +0000</pubDate>
      <link>https://dev.to/bruce_he/mcp-vs-skills-vs-hooks-in-claude-code-which-extension-do-you-need-3b8i</link>
      <guid>https://dev.to/bruce_he/mcp-vs-skills-vs-hooks-in-claude-code-which-extension-do-you-need-3b8i</guid>
      <description>&lt;p&gt;&lt;em&gt;Originally published at &lt;a href="https://www.heyuan110.com/posts/ai/2026-04-04-mcp-vs-skills-claude-code/" rel="noopener noreferrer"&gt;heyuan110.com&lt;/a&gt;&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;Claude Code has three distinct extension mechanisms: &lt;strong&gt;MCP&lt;/strong&gt; (Model Context Protocol), &lt;strong&gt;Skills&lt;/strong&gt;, and &lt;strong&gt;Hooks&lt;/strong&gt;. They look related on the surface, but they operate at fundamentally different layers:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Hooks&lt;/strong&gt; (bottom layer): Lifecycle event automation — "what must always happen"&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;MCP&lt;/strong&gt; (middle layer): External tool connections via open protocol — "what can be done"&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Skills&lt;/strong&gt; (top layer): Reusable workflows and domain knowledge — "how to do things well"&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;This guide covers:&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Three-layer architecture diagram&lt;/li&gt;
&lt;li&gt;Side-by-side comparison across 8 dimensions&lt;/li&gt;
&lt;li&gt;Same task implemented three different ways&lt;/li&gt;
&lt;li&gt;Decision framework: when to use which&lt;/li&gt;
&lt;li&gt;Common mistakes and how to avoid them&lt;/li&gt;
&lt;/ul&gt;




&lt;p&gt;&lt;strong&gt;&lt;a href="https://www.heyuan110.com/posts/ai/2026-04-04-mcp-vs-skills-claude-code/" rel="noopener noreferrer"&gt;Read the full article →&lt;/a&gt;&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;If you found this useful, check out &lt;a href="https://www.heyuan110.com/" rel="noopener noreferrer"&gt;my blog&lt;/a&gt; for more AI engineering guides.&lt;/p&gt;

</description>
      <category>claudecode</category>
      <category>ai</category>
      <category>mcp</category>
      <category>tutorial</category>
    </item>
    <item>
      <title>OpenClaw Multi-Agent Configuration: Architecture and Production Patterns</title>
      <dc:creator>Bruce He</dc:creator>
      <pubDate>Sat, 04 Apr 2026 01:44:28 +0000</pubDate>
      <link>https://dev.to/bruce_he/openclaw-multi-agent-configuration-architecture-and-production-patterns-pe9</link>
      <guid>https://dev.to/bruce_he/openclaw-multi-agent-configuration-architecture-and-production-patterns-pe9</guid>
      <description>&lt;p&gt;&lt;em&gt;Originally published at &lt;a href="https://www.heyuan110.com/posts/ai/2026-04-04-openclaw-multi-agent-setup-guide/" rel="noopener noreferrer"&gt;heyuan110.com&lt;/a&gt;&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;Your single OpenClaw agent worked great for two weeks. Then it started hallucinating project context into unrelated conversations, confusing coding tasks with writing tasks, and taking 15 seconds to respond because its memory index had grown to 200MB.&lt;/p&gt;

&lt;p&gt;The problem is not the model. The problem is architectural: &lt;strong&gt;one agent cannot hold unlimited context domains without degradation&lt;/strong&gt;. The solution is multiple specialized agents with isolated workspaces.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;This guide covers:&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Why multi-agent (the single-agent ceiling)&lt;/li&gt;
&lt;li&gt;Agent creation and model routing configuration&lt;/li&gt;
&lt;li&gt;Binding-based routing (most-specific-wins priority)&lt;/li&gt;
&lt;li&gt;Agent-to-agent communication via sessions_send&lt;/li&gt;
&lt;li&gt;Four production patterns: Supervisor, Router, Pipeline, Parallel&lt;/li&gt;
&lt;li&gt;Cost optimization strategies&lt;/li&gt;
&lt;/ul&gt;




&lt;p&gt;&lt;strong&gt;&lt;a href="https://www.heyuan110.com/posts/ai/2026-04-04-openclaw-multi-agent-setup-guide/" rel="noopener noreferrer"&gt;Read the full article →&lt;/a&gt;&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;If you found this useful, check out &lt;a href="https://www.heyuan110.com/" rel="noopener noreferrer"&gt;my blog&lt;/a&gt; for more AI engineering guides.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>agents</category>
      <category>opensource</category>
      <category>tutorial</category>
    </item>
    <item>
      <title>How to Write CLAUDE.md Files That Actually Work (Harness Engineering #2)</title>
      <dc:creator>Bruce He</dc:creator>
      <pubDate>Sat, 04 Apr 2026 01:43:06 +0000</pubDate>
      <link>https://dev.to/bruce_he/how-to-write-claudemd-files-that-actually-work-harness-engineering-2-5a6m</link>
      <guid>https://dev.to/bruce_he/how-to-write-claudemd-files-that-actually-work-harness-engineering-2-5a6m</guid>
      <description>&lt;p&gt;&lt;em&gt;Originally published at &lt;a href="https://www.heyuan110.com/posts/ai/2026-04-04-harness-claudemd-guide/" rel="noopener noreferrer"&gt;heyuan110.com&lt;/a&gt;&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;This is &lt;strong&gt;Part 2&lt;/strong&gt; of the Harness Engineering series. Most CLAUDE.md files are bad — not because people don't try, but because they optimize for the wrong thing.&lt;/p&gt;

&lt;p&gt;ETH Zurich researchers tested 138 agentfiles across multiple AI coding agents. The results:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Human-written, concise (&amp;lt;60 lines): &lt;strong&gt;+4% success rate&lt;/strong&gt;
&lt;/li&gt;
&lt;li&gt;LLM-generated, verbose (200+ lines): &lt;strong&gt;-3% success rate, +20% token cost&lt;/strong&gt;
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;LLM-generated files made agents &lt;em&gt;worse&lt;/em&gt;.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;This guide covers:&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;The 60-line principle: what to include, what to leave out&lt;/li&gt;
&lt;li&gt;Anti-pattern gallery (documentation dump, LLM manifesto, everything file)&lt;/li&gt;
&lt;li&gt;Progressive disclosure with Skills&lt;/li&gt;
&lt;li&gt;Templates for 3 project types (monorepo, API, frontend)&lt;/li&gt;
&lt;li&gt;How to measure if your CLAUDE.md is working&lt;/li&gt;
&lt;/ul&gt;




&lt;p&gt;&lt;strong&gt;&lt;a href="https://www.heyuan110.com/posts/ai/2026-04-04-harness-claudemd-guide/" rel="noopener noreferrer"&gt;Read the full article →&lt;/a&gt;&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;If you found this useful, check out &lt;a href="https://www.heyuan110.com/" rel="noopener noreferrer"&gt;my blog&lt;/a&gt; for more AI engineering guides.&lt;/p&gt;

</description>
      <category>claudecode</category>
      <category>ai</category>
      <category>productivity</category>
      <category>engineering</category>
    </item>
  </channel>
</rss>
