<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: hideyoshi</title>
    <description>The latest articles on DEV Community by hideyoshi (@hideyoshi_th).</description>
    <link>https://dev.to/hideyoshi_th</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3835568%2Fea8ff786-bf7b-4f1f-986b-aa1d6d4c54d8.png</url>
      <title>DEV Community: hideyoshi</title>
      <link>https://dev.to/hideyoshi_th</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/hideyoshi_th"/>
    <language>en</language>
    <item>
      <title>I Built a Free CLI That Generates Production CLAUDE.md Configs in 30 Seconds</title>
      <dc:creator>hideyoshi</dc:creator>
      <pubDate>Tue, 24 Mar 2026 01:32:38 +0000</pubDate>
      <link>https://dev.to/hideyoshi_th/i-built-a-free-cli-that-generates-production-claudemd-configs-in-30-seconds-6g4</link>
      <guid>https://dev.to/hideyoshi_th/i-built-a-free-cli-that-generates-production-claudemd-configs-in-30-seconds-6g4</guid>
      <description>&lt;p&gt;Most CLAUDE.md files I see in the wild look like this:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight markdown"&gt;&lt;code&gt;&lt;span class="gh"&gt;# Rules&lt;/span&gt;
&lt;span class="p"&gt;-&lt;/span&gt; Use TypeScript
&lt;span class="p"&gt;-&lt;/span&gt; Write tests
&lt;span class="p"&gt;-&lt;/span&gt; Be helpful
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;That's not a configuration. That's a wish list.&lt;/p&gt;

&lt;p&gt;A real CLAUDE.md needs:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Autonomy boundaries&lt;/strong&gt; — what can the agent do without asking?&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Safety rules&lt;/strong&gt; — what must it never do?&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Skill modules&lt;/strong&gt; — how does it handle specific workflows?&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Multi-agent patterns&lt;/strong&gt; — how do parallel agents avoid conflicts?&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Writing all of this from scratch takes hours. So I built a CLI that does it in 30 seconds.&lt;/p&gt;

&lt;h2&gt;
  
  
  &lt;code&gt;npx hideyoshi init&lt;/code&gt;
&lt;/h2&gt;

&lt;p&gt;No install required. Just run it:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;&lt;span class="nv"&gt;$ &lt;/span&gt;npx hideyoshi init

◆ Which AI coding tool?
│ ● Claude Code
│ ○ Cursor
│ ○ Windsurf
│ ○ Aider
│ ○ Multiple tools

◆ Primary framework?
│ ● Next.js

◆ Primary language?
│ ● TypeScript

◆ How many people work on this codebase?
│ ● 2-5 people

◆ How much autonomy should the agent have?
│ ● Moderate — commit freely, ask before deploy

◆ Will you run multiple AI agents &lt;span class="k"&gt;in &lt;/span&gt;parallel?
│ ● Yes

◆ Which skill modules &lt;span class="k"&gt;do &lt;/span&gt;you want?
│ ◻ Code Review
│ ◻ Bug Triage &lt;span class="o"&gt;(&lt;/span&gt;verify-before-act&lt;span class="o"&gt;)&lt;/span&gt;
│ ◻ Deploy Guard
│ ◻ Content Writer
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;It generates a complete set of files:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;├── CLAUDE.md              # Main agent config
├── AGENTS.md              # Multi-agent safety rules
└── .agents/skills/
    ├── code-review/SKILL.md
    ├── bug-triage/SKILL.md
    └── deploy-guard/SKILL.md
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h2&gt;
  
  
  What the generated CLAUDE.md looks like
&lt;/h2&gt;

&lt;p&gt;Here's a snippet from what it produces for a moderate-autonomy Next.js team:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight markdown"&gt;&lt;code&gt;&lt;span class="gh"&gt;# Project Agent Configuration&lt;/span&gt;

You are the development agent for this project.

&lt;span class="gu"&gt;## Autonomy Boundaries&lt;/span&gt;

&lt;span class="gu"&gt;### Do without asking&lt;/span&gt;
&lt;span class="p"&gt;-&lt;/span&gt; Run tests and linting
&lt;span class="p"&gt;-&lt;/span&gt; Format code
&lt;span class="p"&gt;-&lt;/span&gt; Create commits within scope
&lt;span class="p"&gt;-&lt;/span&gt; Install dependencies (retry once on failure)

&lt;span class="gu"&gt;### Ask before doing&lt;/span&gt;
&lt;span class="p"&gt;-&lt;/span&gt; Deploy to production
&lt;span class="p"&gt;-&lt;/span&gt; Modify CI/CD pipelines
&lt;span class="p"&gt;-&lt;/span&gt; Change environment variables
&lt;span class="p"&gt;-&lt;/span&gt; Delete files outside current task scope

&lt;span class="gu"&gt;## Coding Standards&lt;/span&gt;
&lt;span class="p"&gt;-&lt;/span&gt; TypeScript (ESM) is the standard
&lt;span class="p"&gt;-&lt;/span&gt; Next.js App Router conventions
&lt;span class="p"&gt;-&lt;/span&gt; Verify builds pass before pushing
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;It's tailored to your tool, framework, team size, and autonomy level. Not a generic template.&lt;/p&gt;

&lt;h2&gt;
  
  
  Multi-agent safety (the part most people skip)
&lt;/h2&gt;

&lt;p&gt;If you said "yes" to parallel agents, the CLI generates an &lt;code&gt;AGENTS.md&lt;/code&gt; with rules like:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight markdown"&gt;&lt;code&gt;&lt;span class="gu"&gt;## Multi-Agent Safety&lt;/span&gt;
&lt;span class="p"&gt;
-&lt;/span&gt; Never use &lt;span class="sb"&gt;`git stash`&lt;/span&gt; (can destroy other agents' work)
&lt;span class="p"&gt;-&lt;/span&gt; Never switch branches unless explicitly instructed
&lt;span class="p"&gt;-&lt;/span&gt; Only stage YOUR changes — never &lt;span class="sb"&gt;`git add -A`&lt;/span&gt;
&lt;span class="p"&gt;-&lt;/span&gt; Run &lt;span class="sb"&gt;`git pull --rebase`&lt;/span&gt; before pushing
&lt;span class="p"&gt;-&lt;/span&gt; Ignore unfamiliar files (they belong to another agent)
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;These rules exist because I learned each one from a real incident. Force pushes, corrupted indexes, deleted configs — all from agents stepping on each other.&lt;/p&gt;

&lt;h2&gt;
  
  
  Skill modules
&lt;/h2&gt;

&lt;p&gt;Skills are modular behavior files that teach your agent domain-specific workflows. The CLI generates them based on your selection:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Code Review skill&lt;/strong&gt; — teaches the agent to check for security issues, style violations, and test coverage before approving.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Bug Triage skill&lt;/strong&gt; — enforces "verify before act": reproduce the bug, identify the root cause (file + line), then fix. No guessing.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Deploy Guard skill&lt;/strong&gt; — pre-deploy checklist: tests pass, no console errors, build succeeds, env vars present.&lt;/p&gt;

&lt;p&gt;Each skill has a clear scope, specific do/don't rules, and an escalation path.&lt;/p&gt;

&lt;h2&gt;
  
  
  The free CLI vs. the full Playbook
&lt;/h2&gt;

&lt;p&gt;The CLI generates &lt;strong&gt;starter configs&lt;/strong&gt; — enough to get going. The &lt;a href="https://hideyoshi.app/playbook" rel="noopener noreferrer"&gt;full Playbook&lt;/a&gt; includes:&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;&lt;/th&gt;
&lt;th&gt;Free CLI&lt;/th&gt;
&lt;th&gt;Playbook&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;CLAUDE.md template&lt;/td&gt;
&lt;td&gt;1&lt;/td&gt;
&lt;td&gt;5+ (tool-specific)&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Skill modules&lt;/td&gt;
&lt;td&gt;Up to 4&lt;/td&gt;
&lt;td&gt;10+&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Multi-agent patterns&lt;/td&gt;
&lt;td&gt;Basic rules&lt;/td&gt;
&lt;td&gt;Full conflict resolution&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Safety checklists&lt;/td&gt;
&lt;td&gt;—&lt;/td&gt;
&lt;td&gt;Security + governance&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Real-world examples&lt;/td&gt;
&lt;td&gt;—&lt;/td&gt;
&lt;td&gt;3 complete configs&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Trust spectrum framework&lt;/td&gt;
&lt;td&gt;—&lt;/td&gt;
&lt;td&gt;Full chapter&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;The CLI is genuinely useful on its own. The Playbook is for teams that want the complete system.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Launch week: 50% off with code &lt;code&gt;LAUNCH50&lt;/code&gt; (expires March 28)&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://hideyoshi.app/playbook" rel="noopener noreferrer"&gt;hideyoshi.app/playbook&lt;/a&gt;&lt;/p&gt;




&lt;p&gt;Try it: &lt;code&gt;npx hideyoshi init&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;Questions? Find me at &lt;a href="https://twitter.com/hideyoshi_th" rel="noopener noreferrer"&gt;@hideyoshi_th&lt;/a&gt;.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>productivity</category>
      <category>cli</category>
      <category>programming</category>
    </item>
    <item>
      <title>What 20+ Production CLAUDE.md Files Actually Look Like</title>
      <dc:creator>hideyoshi</dc:creator>
      <pubDate>Sun, 22 Mar 2026 01:25:46 +0000</pubDate>
      <link>https://dev.to/hideyoshi_th/what-20-production-claudemd-files-actually-look-like-12od</link>
      <guid>https://dev.to/hideyoshi_th/what-20-production-claudemd-files-actually-look-like-12od</guid>
      <description>&lt;p&gt;Most CLAUDE.md advice online is vague. "Be specific." "Use TypeScript." "Follow best practices."&lt;/p&gt;

&lt;p&gt;None of that changes how your agent actually behaves. Here's what does: a structured system of files that work together.&lt;/p&gt;

&lt;p&gt;I'll walk you through the exact file structure we use in production.&lt;/p&gt;

&lt;h2&gt;
  
  
  The File Structure
&lt;/h2&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;playbook/
├── 01-foundations/
│   ├── constrained-autonomy-model.md
│   └── trust-levels.md
├── 02-configuration-templates/
│   ├── CLAUDE.md.template
│   ├── AGENTS.md.template
│   ├── cursorrules.template
│   ├── aider-conventions.template
│   └── windsurf-rules.template
├── 03-skill-system/
│   ├── SKILL-TEMPLATE.md
│   ├── skill-design-checklist.md
│   └── examples/
│       ├── code-review.md
│       ├── deploy-guard.md
│       ├── bug-triage.md
│       └── content-writer.md
├── 04-multi-agent-patterns/
│   ├── parallel-agents.md
│   ├── worktree-strategy.md
│   └── conflict-resolution.md
├── 05-safety-and-governance/
│   ├── security-checklist.md
│   ├── cost-control.md
│   └── audit-trail.md
└── 06-real-world-examples/
    ├── autonomous-business-agent.md
    ├── ci-cd-agent.md
    ├── customer-support-agent.md
    └── filled-claude-md.md
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;24 files. All Markdown. Every one of them exists because we hit a real problem without it.&lt;/p&gt;

&lt;h2&gt;
  
  
  1. The Constrained Autonomy Model
&lt;/h2&gt;

&lt;p&gt;This is the foundation. Everything else builds on it.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight markdown"&gt;&lt;code&gt;&lt;span class="gu"&gt;## Constrained Autonomy&lt;/span&gt;

&lt;span class="gu"&gt;### Actions You May Take Without Approval&lt;/span&gt;
&lt;span class="p"&gt;-&lt;/span&gt; Run tests, format code, fix lint errors
&lt;span class="p"&gt;-&lt;/span&gt; Create commits within the current branch
&lt;span class="p"&gt;-&lt;/span&gt; Install dependencies from package.json
&lt;span class="p"&gt;-&lt;/span&gt; Read any file in the repository

&lt;span class="gu"&gt;### Actions That Require Approval&lt;/span&gt;
&lt;span class="p"&gt;-&lt;/span&gt; Push to main or release branches
&lt;span class="p"&gt;-&lt;/span&gt; Delete files or directories
&lt;span class="p"&gt;-&lt;/span&gt; Modify CI/CD pipeline configuration
&lt;span class="p"&gt;-&lt;/span&gt; Changes affecting billing or payments
&lt;span class="p"&gt;-&lt;/span&gt; Bulk operations affecting 5+ resources
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;The key insight: you need two explicit lists. What the agent can do freely, and what needs a human check. Without this, agents either do nothing (waiting for permission) or do everything (breaking production).&lt;/p&gt;

&lt;p&gt;The "5+ resources" threshold came from an incident where an agent tried to close 50 GitHub issues in one operation. Now the rule is: if it touches more than 5 things, show the count and ask first.&lt;/p&gt;

&lt;h2&gt;
  
  
  2. Skill Modules
&lt;/h2&gt;

&lt;p&gt;Instead of one massive CLAUDE.md file, we split agent behaviors into separate skill files:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight markdown"&gt;&lt;code&gt;&lt;span class="nn"&gt;---&lt;/span&gt;
&lt;span class="na"&gt;name&lt;/span&gt;&lt;span class="pi"&gt;:&lt;/span&gt; &lt;span class="s"&gt;code-review&lt;/span&gt;
&lt;span class="na"&gt;description&lt;/span&gt;&lt;span class="pi"&gt;:&lt;/span&gt; &lt;span class="s2"&gt;"&lt;/span&gt;&lt;span class="s"&gt;Automated&lt;/span&gt;&lt;span class="nv"&gt; &lt;/span&gt;&lt;span class="s"&gt;code&lt;/span&gt;&lt;span class="nv"&gt; &lt;/span&gt;&lt;span class="s"&gt;review&lt;/span&gt;&lt;span class="nv"&gt; &lt;/span&gt;&lt;span class="s"&gt;for&lt;/span&gt;&lt;span class="nv"&gt; &lt;/span&gt;&lt;span class="s"&gt;pull&lt;/span&gt;&lt;span class="nv"&gt; &lt;/span&gt;&lt;span class="s"&gt;requests"&lt;/span&gt;
&lt;span class="nn"&gt;---&lt;/span&gt;

&lt;span class="gh"&gt;# Code Review Skill&lt;/span&gt;

&lt;span class="gu"&gt;## When This Skill Applies&lt;/span&gt;
&lt;span class="p"&gt;-&lt;/span&gt; A pull request is open and awaiting review
&lt;span class="p"&gt;-&lt;/span&gt; You are asked to evaluate changes before merge

&lt;span class="gu"&gt;## When This Skill Does NOT Apply&lt;/span&gt;
&lt;span class="p"&gt;-&lt;/span&gt; During initial feature development
&lt;span class="p"&gt;-&lt;/span&gt; For documentation-only changes

&lt;span class="gu"&gt;## Rules&lt;/span&gt;

&lt;span class="gu"&gt;### Do&lt;/span&gt;
&lt;span class="p"&gt;-&lt;/span&gt; Review the diff systematically, file by file
&lt;span class="p"&gt;-&lt;/span&gt; Check that new code has corresponding tests
&lt;span class="p"&gt;-&lt;/span&gt; Flag security concerns immediately
&lt;span class="p"&gt;-&lt;/span&gt; Provide specific feedback with file paths and line numbers

&lt;span class="gu"&gt;### Do Not&lt;/span&gt;
&lt;span class="p"&gt;-&lt;/span&gt; Block a PR for style issues the linter doesn't catch
&lt;span class="p"&gt;-&lt;/span&gt; Suggest refactoring code not changed in this PR
&lt;span class="p"&gt;-&lt;/span&gt; Approve a PR with failing tests
&lt;span class="p"&gt;-&lt;/span&gt; Leave vague feedback without explaining how to fix
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;The agent loads 2-3 relevant skills per task. A debugging task loads the bug-triage skill. A PR loads code-review. A deployment loads deploy-guard.&lt;/p&gt;

&lt;p&gt;This is dramatically more predictable than dumping everything into one giant configuration file.&lt;/p&gt;

&lt;h2&gt;
  
  
  3. Multi-Agent Safety
&lt;/h2&gt;

&lt;p&gt;If you run multiple Claude Code instances on the same repo (and you should — it's incredibly productive), you need explicit rules:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight markdown"&gt;&lt;code&gt;&lt;span class="gu"&gt;## Multi-Agent Safety&lt;/span&gt;
&lt;span class="p"&gt;
-&lt;/span&gt; Never &lt;span class="sb"&gt;`git stash`&lt;/span&gt; — destroys other agents' work in progress
&lt;span class="p"&gt;-&lt;/span&gt; Never switch branches without explicit permission
&lt;span class="p"&gt;-&lt;/span&gt; Only stage your own files when committing
&lt;span class="p"&gt;-&lt;/span&gt; Run &lt;span class="sb"&gt;`git pull --rebase`&lt;/span&gt; before every push
&lt;span class="p"&gt;-&lt;/span&gt; If you see unfamiliar files, ignore them — another agent is working
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Every single rule exists because of a real incident. The &lt;code&gt;git stash&lt;/code&gt; one was particularly painful — one agent stashed another's work, then the stash got dropped during cleanup.&lt;/p&gt;

&lt;h2&gt;
  
  
  4. The Trust Spectrum
&lt;/h2&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;Full Manual ────────────────────── Full Autopilot
  Ask    Read   Format  Test  Commit  Deploy  Business
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Most teams start on the far left (ask for everything) and never move. The goal is finding your team's comfort zone and encoding it in configuration.&lt;/p&gt;

&lt;p&gt;For a new project, we start at "Commit" — the agent can read, format, test, and commit freely. Deploys and business decisions need approval. As trust builds, the boundary shifts right.&lt;/p&gt;

&lt;h2&gt;
  
  
  5. Real-World Example: A Filled-In CLAUDE.md
&lt;/h2&gt;

&lt;p&gt;The playbook includes a complete, working CLAUDE.md for a fictional SaaS called "Acme SaaS":&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight markdown"&gt;&lt;code&gt;&lt;span class="gh"&gt;# Acme SaaS — AI Agent Configuration&lt;/span&gt;

You are Atlas, the lead engineer for Acme SaaS.
You operate autonomously within the boundaries
defined in this document.

&lt;span class="gu"&gt;## Core Responsibilities&lt;/span&gt;
&lt;span class="p"&gt;1.&lt;/span&gt; Deliver working features that pass code review
&lt;span class="p"&gt;2.&lt;/span&gt; Maintain test coverage above 80%
&lt;span class="p"&gt;3.&lt;/span&gt; Keep the codebase clean — no lint errors, no type errors

&lt;span class="gu"&gt;## Guiding Principles&lt;/span&gt;
&lt;span class="p"&gt;-&lt;/span&gt; Act on your own judgment within the guardrails
&lt;span class="p"&gt;-&lt;/span&gt; Never guess. Read the code, check the data, then decide
&lt;span class="p"&gt;-&lt;/span&gt; 1 PR = 1 topic. Do not bundle unrelated changes
&lt;span class="p"&gt;-&lt;/span&gt; Ship working increments over perfect solutions
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;This isn't a template with &lt;code&gt;[INSERT HERE]&lt;/code&gt; placeholders. It's a working configuration that shows how every section fits together.&lt;/p&gt;




&lt;h2&gt;
  
  
  Want all 24 files?
&lt;/h2&gt;

&lt;p&gt;The full playbook includes everything above plus configuration templates for 5 tools (Claude Code, Cursor, Windsurf, Aider, multi-agent setups), 4 example skills, safety checklists, cost controls, and 3 complete real-world agent configurations.&lt;/p&gt;

&lt;p&gt;All Markdown. Drop into your project. Start using immediately.&lt;/p&gt;

&lt;p&gt;50% off this week with code &lt;strong&gt;LAUNCH50&lt;/strong&gt;: &lt;a href="https://hideyoshi.app/playbook" rel="noopener noreferrer"&gt;hideyoshi.app/playbook&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Or grab the &lt;a href="https://hideyoshi.app/claude-md-starter.md" rel="noopener noreferrer"&gt;free CLAUDE.md starter template&lt;/a&gt; to get started.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>productivity</category>
      <category>programming</category>
      <category>claude</category>
    </item>
    <item>
      <title>I built an autonomous AI agent playbook — here are the patterns that actually work</title>
      <dc:creator>hideyoshi</dc:creator>
      <pubDate>Sun, 22 Mar 2026 01:24:29 +0000</pubDate>
      <link>https://dev.to/hideyoshi_th/i-built-an-autonomous-ai-agent-playbook-here-are-the-patterns-that-actually-work-3244</link>
      <guid>https://dev.to/hideyoshi_th/i-built-an-autonomous-ai-agent-playbook-here-are-the-patterns-that-actually-work-3244</guid>
      <description>&lt;p&gt;I've been running an autonomous AI agent (codename Hideyoshi) as a full business operator for the past few months. Not a chatbot. Not a copilot. An agent that makes decisions, writes code, deploys, and markets — with minimal human oversight.&lt;/p&gt;

&lt;p&gt;Along the way I discovered a set of patterns that actually hold up in production. These are battle-tested.&lt;/p&gt;

&lt;h2&gt;
  
  
  1. Constrained Autonomy &amp;gt; Full Autonomy
&lt;/h2&gt;

&lt;p&gt;Giving an agent unlimited freedom sounds cool until it burns your production database. The pattern that works:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Approve-free zone&lt;/strong&gt;: formatting, linting, test runs, commits within scope, research&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Human gate&lt;/strong&gt;: releases, billing changes, security-impacting changes, bulk operations (5+ items)&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Define the boundary explicitly in a config file. The agent reads it on every session. No ambiguity.&lt;/p&gt;

&lt;h2&gt;
  
  
  2. Verify Before You Act
&lt;/h2&gt;

&lt;p&gt;The single most important rule. Agents love to hallucinate solutions.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Never guess. Read the code. Check the data.&lt;/li&gt;
&lt;li&gt;Bug fixes require: symptom evidence, root cause (file + line), a fix that addresses the cause, and a regression test.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This alone eliminated 80% of the "fixes" that broke other things.&lt;/p&gt;

&lt;h2&gt;
  
  
  3. Skill Modules for Domain Knowledge
&lt;/h2&gt;

&lt;p&gt;Split specialized behaviors into skill files. Each skill has:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;A clear scope (when it applies, when it doesn't)&lt;/li&gt;
&lt;li&gt;Concrete do/don't rules&lt;/li&gt;
&lt;li&gt;An escalation path&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The agent loads relevant skills on demand. Like giving a new employee a procedures manual for each department.&lt;/p&gt;

&lt;h2&gt;
  
  
  4. Multi-Agent Safety Rules
&lt;/h2&gt;

&lt;p&gt;When multiple agents work on the same repo, things break fast without rules:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Never touch git stash (you'll destroy another agent's work)&lt;/li&gt;
&lt;li&gt;Never switch branches unless explicitly told&lt;/li&gt;
&lt;li&gt;Only stage your own changes&lt;/li&gt;
&lt;li&gt;Always git pull --rebase before push&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Every single one came from a real incident.&lt;/p&gt;

&lt;h2&gt;
  
  
  5. Design as a First-Class Constraint
&lt;/h2&gt;

&lt;p&gt;Without explicit design constraints, AI agents produce AI-looking output. The antidote:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Ban gratuitous gradients, emoji in UI, generic icon sets&lt;/li&gt;
&lt;li&gt;Limit palette: 2 colors max (base + accent)&lt;/li&gt;
&lt;li&gt;Mandatory screenshot review after every UI change&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  The Meta-Pattern
&lt;/h2&gt;

&lt;p&gt;All five patterns share one thing: &lt;strong&gt;they're encoded in the repo, not in prompts.&lt;/strong&gt; The agent reads project files, not chat history. This makes behavior reproducible across sessions and across agents.&lt;/p&gt;

&lt;p&gt;More details and real examples: &lt;a href="https://hideyoshi.app/playbook" rel="noopener noreferrer"&gt;hideyoshi.app/playbook&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;What patterns have you found? What breaks when you give agents more autonomy?&lt;/p&gt;

</description>
      <category>ai</category>
      <category>productivity</category>
      <category>devops</category>
      <category>programming</category>
    </item>
    <item>
      <title>5 CLAUDE.md Patterns That Actually Work in Production</title>
      <dc:creator>hideyoshi</dc:creator>
      <pubDate>Sat, 21 Mar 2026 08:12:52 +0000</pubDate>
      <link>https://dev.to/hideyoshi_th/5-claudemd-patterns-that-actually-work-in-production-5ho</link>
      <guid>https://dev.to/hideyoshi_th/5-claudemd-patterns-that-actually-work-in-production-5ho</guid>
      <description>&lt;p&gt;Everyone's talking about AI coding agents. Most people are still writing CLAUDE.md files that look like this:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight markdown"&gt;&lt;code&gt;Use TypeScript. Follow best practices. Be helpful.
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;That's a style guide, not a system prompt. Here are 5 patterns I've tested in production that actually change how the agent behaves.&lt;/p&gt;

&lt;h2&gt;
  
  
  1. Constrained Autonomy
&lt;/h2&gt;

&lt;p&gt;The biggest unlock wasn't giving the agent &lt;em&gt;more&lt;/em&gt; freedom. It was defining &lt;em&gt;exactly&lt;/em&gt; where the fence is.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight markdown"&gt;&lt;code&gt;&lt;span class="gu"&gt;## Constrained Autonomy&lt;/span&gt;

&lt;span class="gu"&gt;### Do without asking:&lt;/span&gt;
&lt;span class="p"&gt;-&lt;/span&gt; Code formatting, lint fixes
&lt;span class="p"&gt;-&lt;/span&gt; Running tests
&lt;span class="p"&gt;-&lt;/span&gt; Commits and pushes (within scope)
&lt;span class="p"&gt;-&lt;/span&gt; Installing dependencies (one auto-retry on failure)
&lt;span class="p"&gt;-&lt;/span&gt; Research, analysis, reports
&lt;span class="p"&gt;-&lt;/span&gt; Drafting marketing content

&lt;span class="gu"&gt;### Ask first:&lt;/span&gt;
&lt;span class="p"&gt;-&lt;/span&gt; Releases, version changes
&lt;span class="p"&gt;-&lt;/span&gt; Anything that costs money
&lt;span class="p"&gt;-&lt;/span&gt; Security-impacting changes
&lt;span class="p"&gt;-&lt;/span&gt; Bulk operations (5+ PRs/Issues — show count, then confirm)
&lt;span class="p"&gt;-&lt;/span&gt; Direct production impact
&lt;span class="p"&gt;-&lt;/span&gt; Major strategy pivots
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Why this works: the agent stops asking permission for trivial stuff, but you still have a kill switch on anything expensive or irreversible. The &lt;code&gt;5+ items&lt;/code&gt; threshold is oddly specific — it came from an incident where an agent tried to close 47 issues at once.&lt;/p&gt;

&lt;h2&gt;
  
  
  2. Skill System (Modular Behavior)
&lt;/h2&gt;

&lt;p&gt;One massive CLAUDE.md doesn't scale. You end up with a 2000-line file that the agent half-reads and half-ignores.&lt;/p&gt;

&lt;p&gt;The fix: skills as separate Markdown files that load on demand.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight markdown"&gt;&lt;code&gt;&lt;span class="gu"&gt;## Skill Extension&lt;/span&gt;

Specialized behaviors live in &lt;span class="sb"&gt;`.agents/skills/*/SKILL.md`&lt;/span&gt;.
Skills load as needed for specific domains
(PR management, releases, debugging, etc.).

Skill structure:
&lt;span class="p"&gt;-&lt;/span&gt; Frontmatter with &lt;span class="sb"&gt;`name`&lt;/span&gt; and &lt;span class="sb"&gt;`description`&lt;/span&gt;
&lt;span class="p"&gt;-&lt;/span&gt; Clear scope (when this skill applies / doesn't apply)
&lt;span class="p"&gt;-&lt;/span&gt; Specific do/don't rules
&lt;span class="p"&gt;-&lt;/span&gt; Escalation paths
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Each skill is a self-contained behavior module. A debugging skill forces root cause analysis before any fix attempt. A PR skill enforces single-topic commits. A release skill runs a full pre-flight checklist.&lt;/p&gt;

&lt;p&gt;The agent loads 2-3 relevant skills per task instead of processing your entire configuration every time. Context stays clean, behavior stays predictable.&lt;/p&gt;

&lt;h2&gt;
  
  
  3. Multi-Agent Safety
&lt;/h2&gt;

&lt;p&gt;Running two Claude Code instances on the same repo is incredibly productive. Running three without safety rules is how you lose a day of work.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight markdown"&gt;&lt;code&gt;&lt;span class="gu"&gt;## Multi-Agent Safety&lt;/span&gt;

When multiple agents work in parallel:
&lt;span class="p"&gt;
-&lt;/span&gt; Never create, apply, or drop &lt;span class="sb"&gt;`git stash`&lt;/span&gt; (can destroy another agent's work)
&lt;span class="p"&gt;-&lt;/span&gt; Never switch branches unless explicitly told to
&lt;span class="p"&gt;-&lt;/span&gt; Never create/modify git worktrees unless explicitly told to
&lt;span class="p"&gt;-&lt;/span&gt; Only stage your own files when committing
&lt;span class="p"&gt;-&lt;/span&gt; &lt;span class="sb"&gt;`git pull --rebase`&lt;/span&gt; before pushing — never discard others' work
&lt;span class="p"&gt;-&lt;/span&gt; Unknown files in the repo? Ignore them and focus on your task
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Every single one of these rules exists because of a real incident. The &lt;code&gt;git stash&lt;/code&gt; one was particularly painful — Agent A stashed Agent B's uncommitted work, then Agent B couldn't find its changes.&lt;/p&gt;

&lt;p&gt;The "ignore unknown files" rule prevents a cascade where Agent A sees Agent B's files, decides they look wrong, and "helpfully" reverts them.&lt;/p&gt;

&lt;h2&gt;
  
  
  4. Design Guardrails (Kill the AI Aesthetic)
&lt;/h2&gt;

&lt;p&gt;This one surprised me. Without design constraints, AI agents converge on a specific "AI-generated" look. You know it when you see it: rainbow gradients, scattered emojis, neon glow effects, rounded-everything.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight markdown"&gt;&lt;code&gt;&lt;span class="gu"&gt;## Design Principles&lt;/span&gt;

Design is a first-class priority. Working isn't enough — it must look good.

&lt;span class="gu"&gt;### Eliminate AI aesthetic:&lt;/span&gt;
&lt;span class="p"&gt;-&lt;/span&gt; No lazy gradients (rainbow, multi-color). Subtle single-color only
&lt;span class="p"&gt;-&lt;/span&gt; No emojis in UI (or text content, in principle)
&lt;span class="p"&gt;-&lt;/span&gt; Minimal icons. Avoid generic sets (rockets, lightbulbs, gears)
&lt;span class="p"&gt;-&lt;/span&gt; Avoid "AI-looking" defaults: neon, heavy drop shadows,
  over-rounded cards, meaningless animations
&lt;span class="p"&gt;-&lt;/span&gt; References: Linear, Stripe, Vercel, Notion —
  restrained colors, hierarchy through typography
&lt;span class="p"&gt;-&lt;/span&gt; Max 2 colors (base + 1 accent)
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;The reference list is critical. Without it, "make it look professional" is too vague. With "look at how Linear does it," the agent has a concrete visual target.&lt;/p&gt;

&lt;p&gt;The emoji ban alone improved output quality more than I expected. AI agents love emojis. Every heading gets a rocket, every feature gets a sparkle. Banning them forces the agent to create hierarchy through actual design decisions.&lt;/p&gt;

&lt;h2&gt;
  
  
  5. Verify-Then-Act (Evidence Before Fixes)
&lt;/h2&gt;

&lt;p&gt;The default AI behavior: you report a bug, it immediately proposes a fix. The fix is often wrong because it's guessing at the root cause.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight markdown"&gt;&lt;code&gt;&lt;span class="gu"&gt;## Verify Before Acting&lt;/span&gt;
&lt;span class="p"&gt;
-&lt;/span&gt; Don't guess. Read the code, check the data, then decide.
&lt;span class="p"&gt;-&lt;/span&gt; Bug fixes require evidence:
&lt;span class="p"&gt;  1.&lt;/span&gt; Proof of symptom
&lt;span class="p"&gt;  2.&lt;/span&gt; Root cause identification (file/line)
&lt;span class="p"&gt;  3.&lt;/span&gt; Fix that addresses the root cause
&lt;span class="p"&gt;  4.&lt;/span&gt; Regression test
&lt;span class="p"&gt;-&lt;/span&gt; Read npm dependency source and local code before
  concluding where a bug lives
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;This pattern forces a diagnostic workflow. The agent can't just pattern-match on the error message and apply the most common fix. It has to trace the actual execution path, find the actual broken line, and prove its fix addresses &lt;em&gt;that specific cause&lt;/em&gt;.&lt;/p&gt;

&lt;p&gt;The regression test requirement catches the "fix the symptom, not the cause" failure mode. If the agent can't write a test that would have caught the bug, its understanding of the root cause is probably wrong.&lt;/p&gt;




&lt;h2&gt;
  
  
  The Compound Effect
&lt;/h2&gt;

&lt;p&gt;None of these patterns is revolutionary on its own. But together, they create something that behaves less like a chatbot and more like a disciplined engineer:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Constrained autonomy means it doesn't waste your time with trivial approvals&lt;/li&gt;
&lt;li&gt;Skills keep behavior focused and predictable&lt;/li&gt;
&lt;li&gt;Multi-agent safety lets you scale without chaos&lt;/li&gt;
&lt;li&gt;Design guardrails produce output you're not embarrassed to ship&lt;/li&gt;
&lt;li&gt;Verify-then-act catches bugs properly instead of cargo-culting fixes&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The difference between a useful AI agent and an impressive demo is almost entirely in the configuration layer.&lt;/p&gt;




&lt;p&gt;I've packaged these patterns (and about 15 more) into a complete playbook with ready-to-use templates. Free starter template at &lt;a href="https://hideyoshi.app" rel="noopener noreferrer"&gt;hideyoshi.app&lt;/a&gt;. Full playbook with 20+ files: 50% off this week with code &lt;code&gt;LAUNCH50&lt;/code&gt; at &lt;a href="https://hideyoshi.app/playbook" rel="noopener noreferrer"&gt;hideyoshi.app/playbook&lt;/a&gt;.&lt;/p&gt;

</description>
      <category>claude</category>
      <category>ai</category>
      <category>productivity</category>
      <category>programming</category>
    </item>
    <item>
      <title>My CLAUDE.md That Runs a Business (With Actual Config Snippets)</title>
      <dc:creator>hideyoshi</dc:creator>
      <pubDate>Sat, 21 Mar 2026 04:31:48 +0000</pubDate>
      <link>https://dev.to/hideyoshi_th/my-claudemd-that-runs-a-business-with-actual-config-snippets-36jg</link>
      <guid>https://dev.to/hideyoshi_th/my-claudemd-that-runs-a-business-with-actual-config-snippets-36jg</guid>
      <description>&lt;p&gt;Most CLAUDE.md files I've seen look like this:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight markdown"&gt;&lt;code&gt;Use TypeScript. Prefer functional components. Run prettier before committing.
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Mine looks like this:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight markdown"&gt;&lt;code&gt;You are responsible for this business's revenue, strategy, product
development, marketing, and operations. You make decisions autonomously.
You do not wait for instructions.
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Same file. Very different outcomes. Let me show you what happens when you treat CLAUDE.md not as a coding style guide, but as an operating manual for an autonomous agent.&lt;/p&gt;

&lt;h2&gt;
  
  
  The Setup
&lt;/h2&gt;

&lt;p&gt;I run a project called Hideyoshi. It's a Claude Code agent configured to operate a business end-to-end: build the product, deploy it, write marketing copy, manage releases. My role has shifted from "person who writes code" to "person who reviews PRs and approves payments."&lt;/p&gt;

&lt;p&gt;The entire system runs on Markdown configuration files. No custom tooling, no wrapper scripts, no API integrations. Just CLAUDE.md, a handful of skill files, and Claude Code doing its thing.&lt;/p&gt;

&lt;p&gt;Here's how the configuration actually works.&lt;/p&gt;

&lt;h2&gt;
  
  
  Section 1: Identity and Responsibilities
&lt;/h2&gt;

&lt;p&gt;The first thing in my CLAUDE.md isn't a coding convention. It's a job description.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight markdown"&gt;&lt;code&gt;&lt;span class="gu"&gt;## Your Responsibilities&lt;/span&gt;
&lt;span class="p"&gt;
1.&lt;/span&gt; Revenue: you own the P&amp;amp;L
&lt;span class="p"&gt;2.&lt;/span&gt; Strategy: market analysis, competitive research, planning
&lt;span class="p"&gt;3.&lt;/span&gt; Product: design, build, ship, iterate
&lt;span class="p"&gt;4.&lt;/span&gt; Marketing: acquisition, awareness, conversion
&lt;span class="p"&gt;5.&lt;/span&gt; Operations: everything else the business needs
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;This sounds abstract, but it changes the agent's behavior in concrete ways. When I say "we need more traffic," the agent doesn't ask me what to do. It researches channels, drafts content, and opens a PR with a marketing page. When a test fails, it doesn't just report the failure -- it traces the root cause, fixes it, adds a regression test, and commits.&lt;/p&gt;

&lt;p&gt;The job description creates a default bias toward action. Without it, the agent defaults to "helpful assistant" mode: waiting for specific instructions, asking clarifying questions, hedging its answers. With it, the agent defaults to "I own this, let me figure it out."&lt;/p&gt;

&lt;h2&gt;
  
  
  Section 2: The Trust Boundary
&lt;/h2&gt;

&lt;p&gt;This is the part that took the most iteration. You need two explicit lists:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight markdown"&gt;&lt;code&gt;&lt;span class="gu"&gt;## Do Without Asking&lt;/span&gt;
&lt;span class="p"&gt;
-&lt;/span&gt; Format, lint, fix code style
&lt;span class="p"&gt;-&lt;/span&gt; Run tests
&lt;span class="p"&gt;-&lt;/span&gt; Commit and push (within scope)
&lt;span class="p"&gt;-&lt;/span&gt; Install dependencies
&lt;span class="p"&gt;-&lt;/span&gt; Research and analysis
&lt;span class="p"&gt;-&lt;/span&gt; Draft marketing content

&lt;span class="gu"&gt;## Requires Human Approval&lt;/span&gt;
&lt;span class="p"&gt;
-&lt;/span&gt; Releases and version changes
&lt;span class="p"&gt;-&lt;/span&gt; Any operation that spends money
&lt;span class="p"&gt;-&lt;/span&gt; Security-impacting changes
&lt;span class="p"&gt;-&lt;/span&gt; Bulk operations (5+ items affected)
&lt;span class="p"&gt;-&lt;/span&gt; Direct production impact
&lt;span class="p"&gt;-&lt;/span&gt; Major strategy pivots
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;The boundary between these two lists is where the entire system succeeds or fails. Too restrictive and the agent pings you every five minutes -- you're back to being a human in the loop for every decision. Too loose and you wake up to a surprise production deployment or an unexpected charge on your credit card.&lt;/p&gt;

&lt;p&gt;The sweet spot I landed on: &lt;strong&gt;the agent can complete an entire development cycle (code, test, commit, push) without interruption. Deployment and anything involving money are the human checkpoints.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;This means the agent can build a complete feature, write tests, commit with a meaningful message, and push to a branch -- all without asking. But it can't merge to main, deploy, or sign up for a new SaaS tool.&lt;/p&gt;

&lt;h2&gt;
  
  
  Section 3: Design Guardrails (The Hard Lesson)
&lt;/h2&gt;

&lt;p&gt;I learned this one the painful way. Without design constraints, Claude Code produces UI that is technically functional and aesthetically... recognizable. You know the look: gradient backgrounds in four colors, icons on everything, rounded corners you could lose a marble in.&lt;/p&gt;

&lt;p&gt;My config now includes this:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight markdown"&gt;&lt;code&gt;&lt;span class="gu"&gt;## Design Principles&lt;/span&gt;
&lt;span class="p"&gt;
-&lt;/span&gt; Maximum 2 colors: base + one accent
&lt;span class="p"&gt;-&lt;/span&gt; No emojis in the UI
&lt;span class="p"&gt;-&lt;/span&gt; No multi-color gradients
&lt;span class="p"&gt;-&lt;/span&gt; Icons: minimal, only when they add meaning
&lt;span class="p"&gt;-&lt;/span&gt; Reference aesthetic: Linear, Stripe, Vercel
&lt;span class="p"&gt;-&lt;/span&gt; Typography creates hierarchy, not color
&lt;span class="p"&gt;-&lt;/span&gt; Verify every UI change with a screenshot
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;That last line matters. The agent takes a Playwright screenshot after every UI change and visually checks its own work. This single rule eliminated about 80% of the "it works but looks terrible" PRs.&lt;/p&gt;

&lt;p&gt;The reference list (Linear, Stripe, Vercel) is surprisingly effective. Instead of trying to describe good design in words -- which is like trying to describe music in a spreadsheet -- you give the agent concrete examples of the target aesthetic. It gets the point.&lt;/p&gt;

&lt;h2&gt;
  
  
  Section 4: Multi-Agent Safety
&lt;/h2&gt;

&lt;p&gt;If you run multiple Claude Code instances on the same repo (and you should -- it's wildly productive), you need coordination rules. Every rule below exists because of a real incident:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight markdown"&gt;&lt;code&gt;&lt;span class="gu"&gt;## Multi-Agent Rules&lt;/span&gt;
&lt;span class="p"&gt;
-&lt;/span&gt; Never create, apply, or drop a git stash
&lt;span class="p"&gt;-&lt;/span&gt; Never switch branches unless explicitly told to
&lt;span class="p"&gt;-&lt;/span&gt; Only stage your own files when committing
&lt;span class="p"&gt;-&lt;/span&gt; Always git pull --rebase before pushing
&lt;span class="p"&gt;-&lt;/span&gt; Ignore unfamiliar files -- another agent is working on them
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;The &lt;code&gt;git stash&lt;/code&gt; rule is the one that bit me hardest. Agent A stashes its work-in-progress, Agent B runs &lt;code&gt;git stash&lt;/code&gt; and now Agent A's stash is buried. Agent A pops what it thinks is its stash and gets Agent B's half-finished work. Chaos.&lt;/p&gt;

&lt;p&gt;The fix is simple: don't stash. Use branches. But the agent won't know that unless you tell it.&lt;/p&gt;

&lt;h2&gt;
  
  
  Section 5: Skills as Composable Behaviors
&lt;/h2&gt;

&lt;p&gt;One monolithic CLAUDE.md doesn't scale. My project splits specialized behaviors into skill files that live in &lt;code&gt;.agents/skills/&lt;/code&gt;:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;.agents/skills/
  copywriting/SKILL.md
  seo-audit/SKILL.md
  launch-strategy/SKILL.md
  pricing-strategy/SKILL.md
  content-strategy/SKILL.md
  ...
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Each skill has a clear scope (when it applies), concrete rules (do this, don't do that), and an escalation path (when to stop and ask a human). The agent loads the relevant skill based on the task at hand.&lt;/p&gt;

&lt;p&gt;This is what makes the system extensible. Adding a new capability doesn't mean editing a giant config file and hoping you don't break something. You add a new skill file. The agent discovers it and uses it when the context matches.&lt;/p&gt;

&lt;h2&gt;
  
  
  What This Actually Looks Like Day-to-Day
&lt;/h2&gt;

&lt;p&gt;Yesterday, I told the agent we were launching a product. It:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Built the landing page (Next.js, Tailwind, responsive)&lt;/li&gt;
&lt;li&gt;Set up structured data for SEO&lt;/li&gt;
&lt;li&gt;Created a free lead magnet (a starter CLAUDE.md template)&lt;/li&gt;
&lt;li&gt;Drafted marketing posts for four different channels&lt;/li&gt;
&lt;li&gt;Added analytics tracking&lt;/li&gt;
&lt;li&gt;Committed each piece as a separate, scoped PR&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;I reviewed the PRs, approved them, and the product was live. Total hands-on time for me: maybe 30 minutes of review across the day.&lt;/p&gt;

&lt;p&gt;Is this replacing senior engineering judgment? No. I still make the strategic calls, review the code, and approve anything that touches production. But the sheer volume of execution that happens between those checkpoints is something I couldn't do alone.&lt;/p&gt;

&lt;h2&gt;
  
  
  Try the Config Pattern Yourself
&lt;/h2&gt;

&lt;p&gt;If you want to experiment with this approach, I put together a free CLAUDE.md starter template you can drop into any project. It includes the trust boundary pattern, basic design guardrails, and a simple skill structure.&lt;/p&gt;

&lt;p&gt;Grab it at &lt;a href="https://hideyoshi.app" rel="noopener noreferrer"&gt;hideyoshi.app&lt;/a&gt; -- no signup required, just a direct download.&lt;/p&gt;

&lt;p&gt;And if you want the full production setup -- complete templates, 30+ skill files, multi-agent safety configs, and real-world agent configurations -- that's in &lt;a href="https://hideyoshi.app/playbook" rel="noopener noreferrer"&gt;The Autonomous AI Agent Playbook&lt;/a&gt;. $19, one-time, all Markdown.&lt;/p&gt;




&lt;p&gt;Questions about any of these patterns? I'm in the comments.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>productivity</category>
      <category>programming</category>
      <category>claude</category>
    </item>
    <item>
      <title>How I Configure AI Coding Agents for Autonomous Operation (With Real Examples)</title>
      <dc:creator>hideyoshi</dc:creator>
      <pubDate>Fri, 20 Mar 2026 13:59:33 +0000</pubDate>
      <link>https://dev.to/hideyoshi_th/how-i-configure-ai-coding-agents-for-autonomous-operation-with-real-examples-3377</link>
      <guid>https://dev.to/hideyoshi_th/how-i-configure-ai-coding-agents-for-autonomous-operation-with-real-examples-3377</guid>
      <description>&lt;p&gt;I've been running an experiment for the past few months: giving an AI coding agent enough configuration and trust to operate autonomously on real business tasks.&lt;/p&gt;

&lt;p&gt;Not "generate a React component." More like "you are responsible for this product — build it, ship it, market it."&lt;/p&gt;

&lt;p&gt;The agent is called Hideyoshi. It runs on Claude Code. Here's what I've learned about the configuration layer that makes autonomous operation possible.&lt;/p&gt;

&lt;h3&gt;
  
  
  The Configuration Layer Is Everything
&lt;/h3&gt;

&lt;p&gt;When most developers use AI coding tools, the interaction is transactional: you ask for code, it writes code, you review and edit.&lt;/p&gt;

&lt;p&gt;Autonomous operation requires a different approach. The agent needs:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;
&lt;strong&gt;Clear responsibilities&lt;/strong&gt; — What is it accountable for?&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Trust boundaries&lt;/strong&gt; — What can it do alone? What needs approval?&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Quality standards&lt;/strong&gt; — How should the output look and behave?&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Safety rails&lt;/strong&gt; — How do you prevent it from causing damage?&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;All of this lives in configuration files that the agent reads on startup.&lt;/p&gt;

&lt;h3&gt;
  
  
  Pattern 1: Constrained Autonomy
&lt;/h3&gt;

&lt;p&gt;The most important pattern. You define two explicit lists:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Can do without asking:&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Run tests, lint, format code&lt;/li&gt;
&lt;li&gt;Commit and push within scope&lt;/li&gt;
&lt;li&gt;Install dependencies&lt;/li&gt;
&lt;li&gt;Draft content&lt;/li&gt;
&lt;li&gt;Run investigations and analysis&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Needs human approval:&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Production deployments&lt;/li&gt;
&lt;li&gt;Purchases or billing changes&lt;/li&gt;
&lt;li&gt;Security-impacting changes&lt;/li&gt;
&lt;li&gt;Bulk operations (5+ items affected)&lt;/li&gt;
&lt;li&gt;Major strategy changes&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This sounds simple, but the boundaries require real thought. Too restrictive and the agent asks permission for everything (defeating the purpose). Too loose and you're debugging production incidents at 3am.&lt;/p&gt;

&lt;p&gt;The sweet spot: &lt;strong&gt;the agent should be able to complete a full development cycle (code → test → commit → push) without interruption.&lt;/strong&gt; Deployment and release are the human checkpoints.&lt;/p&gt;

&lt;h3&gt;
  
  
  Pattern 2: Modular Skill System
&lt;/h3&gt;

&lt;p&gt;One massive configuration file doesn't scale. Instead, break agent behavior into composable "skills" — each a separate Markdown file that activates in specific contexts.&lt;/p&gt;

&lt;p&gt;Example skills:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Debugging Skill&lt;/strong&gt;&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight markdown"&gt;&lt;code&gt;&lt;span class="gh"&gt;# When: Bug report or test failure&lt;/span&gt;
&lt;span class="p"&gt;
1.&lt;/span&gt; Reproduce the issue (show evidence)
&lt;span class="p"&gt;2.&lt;/span&gt; Identify root cause (file and line number)
&lt;span class="p"&gt;3.&lt;/span&gt; Explain WHY it happens
&lt;span class="p"&gt;4.&lt;/span&gt; Fix the cause, not the symptom
&lt;span class="p"&gt;5.&lt;/span&gt; Add regression test
&lt;span class="p"&gt;6.&lt;/span&gt; Never guess — read the code first
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;&lt;strong&gt;PR Review Skill&lt;/strong&gt;&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight markdown"&gt;&lt;code&gt;&lt;span class="gh"&gt;# When: Creating or reviewing pull requests&lt;/span&gt;
&lt;span class="p"&gt;
1.&lt;/span&gt; One PR = one topic (no bundled changes)
&lt;span class="p"&gt;2.&lt;/span&gt; Commit messages are action-oriented
&lt;span class="p"&gt;3.&lt;/span&gt; Only stage your own files
&lt;span class="p"&gt;4.&lt;/span&gt; Run tests before push
&lt;span class="p"&gt;5.&lt;/span&gt; Include before/after evidence for UI changes
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;&lt;strong&gt;Release Skill&lt;/strong&gt;&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight markdown"&gt;&lt;code&gt;&lt;span class="gh"&gt;# When: Version bump or release&lt;/span&gt;
&lt;span class="p"&gt;
1.&lt;/span&gt; All tests pass
&lt;span class="p"&gt;2.&lt;/span&gt; CHANGELOG updated
&lt;span class="p"&gt;3.&lt;/span&gt; Version bumped in package.json
&lt;span class="p"&gt;4.&lt;/span&gt; Build succeeds
&lt;span class="p"&gt;5.&lt;/span&gt; Human approval obtained
&lt;span class="p"&gt;6.&lt;/span&gt; Tag and push
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;The key insight: skills are composable. The agent loads whichever skills are relevant to its current task, keeping context focused and behavior predictable.&lt;/p&gt;

&lt;h3&gt;
  
  
  Pattern 3: Design Guardrails
&lt;/h3&gt;

&lt;p&gt;This was the hardest lesson. Without explicit design constraints, AI-generated UI converges on a recognizable aesthetic:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Rainbow or multi-color gradients&lt;/li&gt;
&lt;li&gt;Emojis scattered throughout the interface&lt;/li&gt;
&lt;li&gt;Excessive drop shadows and animations&lt;/li&gt;
&lt;li&gt;Generic icon sets (rockets, lightbulbs, gears)&lt;/li&gt;
&lt;li&gt;Overly rounded corners on everything&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;None of this is inherently bad, but it's instantly recognizable as "AI-generated." If you want production-quality output, you need constraints:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight markdown"&gt;&lt;code&gt;&lt;span class="gh"&gt;# Design Principles&lt;/span&gt;
&lt;span class="p"&gt;
-&lt;/span&gt; Maximum 2 colors (base + 1 accent)
&lt;span class="p"&gt;-&lt;/span&gt; No emojis in UI
&lt;span class="p"&gt;-&lt;/span&gt; No multi-color gradients
&lt;span class="p"&gt;-&lt;/span&gt; Icons: minimal, purposeful
&lt;span class="p"&gt;-&lt;/span&gt; Reference: Linear, Stripe, Vercel, Notion
&lt;span class="p"&gt;-&lt;/span&gt; Typography creates hierarchy, not color
&lt;span class="p"&gt;-&lt;/span&gt; Verify every UI change with a screenshot
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;After adding these constraints, the quality of generated interfaces improved dramatically. The agent stopped making "creative" choices and started making disciplined ones.&lt;/p&gt;

&lt;h3&gt;
  
  
  Pattern 4: Multi-Agent Safety
&lt;/h3&gt;

&lt;p&gt;Running multiple AI agents on the same repository is powerful — one can work on frontend while another handles backend. But it introduces real coordination problems.&lt;/p&gt;

&lt;p&gt;Rules I learned the hard way:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;
&lt;strong&gt;Never &lt;code&gt;git stash&lt;/code&gt;&lt;/strong&gt; — Agent A stashes work, Agent B's stash operation overwrites it. Use branches instead.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Never switch branches&lt;/strong&gt; unless explicitly told to. Agents should stay on their assigned branch.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Only stage your own files&lt;/strong&gt; when committing. &lt;code&gt;git add .&lt;/code&gt; in a multi-agent setup is dangerous.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Always &lt;code&gt;git pull --rebase&lt;/code&gt; before pushing.&lt;/strong&gt; Never overwrite another agent's commits.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Ignore unfamiliar files.&lt;/strong&gt; If an agent sees files it didn't create, it should leave them alone.&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;Every one of these rules exists because of a real incident where agents destroyed each other's work.&lt;/p&gt;

&lt;h3&gt;
  
  
  The Result
&lt;/h3&gt;

&lt;p&gt;With these patterns in place, Hideyoshi operates as something closer to a junior team member than a code generator:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;It builds complete features end-to-end&lt;/li&gt;
&lt;li&gt;It follows consistent quality standards&lt;/li&gt;
&lt;li&gt;It coordinates with other agents safely&lt;/li&gt;
&lt;li&gt;It escalates decisions that require human judgment&lt;/li&gt;
&lt;li&gt;It documents its work through meaningful commits&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Is it perfect? No. Does it replace senior engineering judgment? No. But it handles a remarkable amount of work that would otherwise require constant human direction.&lt;/p&gt;

&lt;h3&gt;
  
  
  Try It Yourself
&lt;/h3&gt;

&lt;p&gt;I've packaged all of these patterns into &lt;strong&gt;The Autonomous AI Agent Playbook&lt;/strong&gt; — a set of Markdown configuration files you can copy into your project and customize:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Complete CLAUDE.md and .cursorrules templates&lt;/li&gt;
&lt;li&gt;5+ ready-made skill files&lt;/li&gt;
&lt;li&gt;Multi-agent safety configuration&lt;/li&gt;
&lt;li&gt;Security and governance checklists&lt;/li&gt;
&lt;li&gt;3 real-world agent configurations (business agent, CI/CD agent, support agent)&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Everything is in Markdown. Works with Claude Code, Cursor, Windsurf, or any tool that reads configuration files.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;$19, one-time purchase. No subscription.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://hideyoshi.app/playbook" rel="noopener noreferrer"&gt;Get the Playbook&lt;/a&gt;&lt;/p&gt;




&lt;p&gt;If you have questions about any of these patterns, drop them in the comments. Happy to share more specific examples.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>productivity</category>
      <category>programming</category>
      <category>devops</category>
    </item>
  </channel>
</rss>
