<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Sharp Dev Eye</title>
    <description>The latest articles on DEV Community by Sharp Dev Eye (@sharpdeveye).</description>
    <link>https://dev.to/sharpdeveye</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3872031%2Fb40e0388-3e79-4243-add7-740eaddfae59.jpeg</url>
      <title>DEV Community: Sharp Dev Eye</title>
      <link>https://dev.to/sharpdeveye</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/sharpdeveye"/>
    <language>en</language>
    <item>
      <title>Your AI coding agent is winging it. Here's how to stop that.</title>
      <dc:creator>Sharp Dev Eye</dc:creator>
      <pubDate>Fri, 10 Apr 2026 15:22:39 +0000</pubDate>
      <link>https://dev.to/sharpdeveye/your-ai-coding-agent-is-winging-it-heres-how-to-stop-that-48h0</link>
      <guid>https://dev.to/sharpdeveye/your-ai-coding-agent-is-winging-it-heres-how-to-stop-that-48h0</guid>
      <description>&lt;p&gt;I spent months watching AI coding agents make the same mistakes across every project I threw at them:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Unstructured wall-of-text prompts&lt;/li&gt;
&lt;li&gt;Context windows stuffed until they overflow&lt;/li&gt;
&lt;li&gt;15+ tools exposed with vague one-line descriptions&lt;/li&gt;
&lt;li&gt;Zero error handling — happy path only&lt;/li&gt;
&lt;li&gt;Multi-agent orchestration for tasks a single agent handles fine&lt;/li&gt;
&lt;li&gt;"It seems to work" as the entire evaluation strategy&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;I call this &lt;strong&gt;workflow slop&lt;/strong&gt;. And every AI coding tool ships with it by default.&lt;/p&gt;

&lt;p&gt;So I built &lt;a href="https://github.com/sharpdeveye/maestro" rel="noopener noreferrer"&gt;&lt;strong&gt;Maestro&lt;/strong&gt;&lt;/a&gt; — 21 skills and 20 commands that inject workflow discipline into any AI coding agent. One install. Works with Cursor, Claude Code, Gemini CLI, Copilot, Codex, and 5 more.&lt;/p&gt;




&lt;h2&gt;
  
  
  What Does "Workflow Slop" Actually Look Like?
&lt;/h2&gt;

&lt;p&gt;Run &lt;code&gt;/diagnose&lt;/code&gt; on any project. You'll get a scored audit across 5 dimensions:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;╔══════════════════════════════════════╗
║          MAESTRO DIAGNOSTIC         ║
╠══════════════════════════════════════╣
║ Prompt Quality       ████░  4/5     ║
║ Context Efficiency   ███░░  3/5     ║
║ Tool Health          ██░░░  2/5     ║
║ Architecture         ████░  4/5     ║
║ Safety &amp;amp; Reliability ██░░░  2/5     ║
╠══════════════════════════════════════╣
║ Overall Score:       15/25          ║
╚══════════════════════════════════════╝
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Every finding maps to a specific remediation command:&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Score&lt;/th&gt;
&lt;th&gt;Meaning&lt;/th&gt;
&lt;th&gt;Auto-prescribed action&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;5&lt;/td&gt;
&lt;td&gt;Excellent&lt;/td&gt;
&lt;td&gt;No action needed&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;4&lt;/td&gt;
&lt;td&gt;Minor gaps&lt;/td&gt;
&lt;td&gt;
&lt;code&gt;/refine&lt;/code&gt; for polish&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;3&lt;/td&gt;
&lt;td&gt;Functional but risky&lt;/td&gt;
&lt;td&gt;
&lt;code&gt;/fortify&lt;/code&gt; or &lt;code&gt;/streamline&lt;/code&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;2&lt;/td&gt;
&lt;td&gt;Significant issues&lt;/td&gt;
&lt;td&gt;
&lt;code&gt;/fortify&lt;/code&gt; + &lt;code&gt;/guard&lt;/code&gt; immediately&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;1&lt;/td&gt;
&lt;td&gt;Broken&lt;/td&gt;
&lt;td&gt;
&lt;code&gt;/onboard-agent&lt;/code&gt; — rebuild&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;No generic advice. No "consider adding tests." The agent tells you &lt;em&gt;exactly&lt;/em&gt; which command to run next.&lt;/p&gt;




&lt;h2&gt;
  
  
  The 20 Commands
&lt;/h2&gt;

&lt;p&gt;Every command is a structured skill file with explicit instructions, checklists, anti-patterns, and a &lt;strong&gt;recommended next step&lt;/strong&gt; so the agent never leaves you hanging.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Analysis&lt;/strong&gt; — find the problems:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;code&gt;/diagnose&lt;/code&gt; — Full workflow health audit with scored dimensions&lt;/li&gt;
&lt;li&gt;
&lt;code&gt;/evaluate&lt;/code&gt; — Test workflow quality against realistic scenarios&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Fix &amp;amp; Improve&lt;/strong&gt; — targeted repairs:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;code&gt;/fortify&lt;/code&gt; — Add error handling, retries, fallbacks&lt;/li&gt;
&lt;li&gt;
&lt;code&gt;/streamline&lt;/code&gt; — Remove over-engineering and complexity&lt;/li&gt;
&lt;li&gt;
&lt;code&gt;/calibrate&lt;/code&gt; — Align naming, formatting, conventions&lt;/li&gt;
&lt;li&gt;
&lt;code&gt;/refine&lt;/code&gt; — Final quality pass before shipping&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Enhancement&lt;/strong&gt; — add new capabilities:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;code&gt;/amplify&lt;/code&gt; &lt;code&gt;/compose&lt;/code&gt; &lt;code&gt;/enrich&lt;/code&gt; &lt;code&gt;/accelerate&lt;/code&gt; &lt;code&gt;/chain&lt;/code&gt; &lt;code&gt;/guard&lt;/code&gt; &lt;code&gt;/iterate&lt;/code&gt; &lt;code&gt;/temper&lt;/code&gt; &lt;code&gt;/turbocharge&lt;/code&gt;
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Utility&lt;/strong&gt; — setup and adaptation:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;code&gt;/teach-maestro&lt;/code&gt; &lt;code&gt;/onboard-agent&lt;/code&gt; &lt;code&gt;/specialize&lt;/code&gt; &lt;code&gt;/adapt-workflow&lt;/code&gt; &lt;code&gt;/extract-pattern&lt;/code&gt;
&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  Install in 30 Seconds
&lt;/h2&gt;

&lt;h3&gt;
  
  
  Option A: Skill Files (any provider)
&lt;/h3&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;npx skills add sharpdeveye/maestro
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Works with Cursor, Claude Code, Gemini CLI, Codex CLI, VS Code Copilot / Antigravity, Kiro, Trae, OpenCode, and Pi.&lt;/p&gt;

&lt;h3&gt;
  
  
  Option B: MCP Server (any MCP client)
&lt;/h3&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight json"&gt;&lt;code&gt;&lt;span class="p"&gt;{&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="nl"&gt;"mcpServers"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="p"&gt;{&lt;/span&gt;&lt;span class="w"&gt;
    &lt;/span&gt;&lt;span class="nl"&gt;"maestro"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="p"&gt;{&lt;/span&gt;&lt;span class="w"&gt;
      &lt;/span&gt;&lt;span class="nl"&gt;"command"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"npx"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
      &lt;/span&gt;&lt;span class="nl"&gt;"args"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="p"&gt;[&lt;/span&gt;&lt;span class="s2"&gt;"-y"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"maestro-workflow-mcp"&lt;/span&gt;&lt;span class="p"&gt;]&lt;/span&gt;&lt;span class="w"&gt;
    &lt;/span&gt;&lt;span class="p"&gt;}&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="p"&gt;}&lt;/span&gt;&lt;span class="w"&gt;
&lt;/span&gt;&lt;span class="p"&gt;}&lt;/span&gt;&lt;span class="w"&gt;
&lt;/span&gt;&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Drop that in your MCP config. Done. 20 prompts, 4 tools, 8 knowledge resources — instantly available.&lt;/p&gt;




&lt;h2&gt;
  
  
  Why This Isn't Just Another Prompt Collection
&lt;/h2&gt;

&lt;p&gt;Most "AI skill" repos are prompt dumps. Maestro is an &lt;strong&gt;ecosystem&lt;/strong&gt;:&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Feature&lt;/th&gt;
&lt;th&gt;Prompt dumps&lt;/th&gt;
&lt;th&gt;Maestro&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Structure&lt;/td&gt;
&lt;td&gt;Random .md files&lt;/td&gt;
&lt;td&gt;YAML frontmatter + versioned skills&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Flow&lt;/td&gt;
&lt;td&gt;Dead ends&lt;/td&gt;
&lt;td&gt;Every command recommends the next step&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Anti-patterns&lt;/td&gt;
&lt;td&gt;None&lt;/td&gt;
&lt;td&gt;Explicit "NEVER do X" in every skill&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Context&lt;/td&gt;
&lt;td&gt;Hope the AI figures it out&lt;/td&gt;
&lt;td&gt;
&lt;code&gt;.maestro.md&lt;/code&gt; project context protocol&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Delivery&lt;/td&gt;
&lt;td&gt;Copy-paste files&lt;/td&gt;
&lt;td&gt;
&lt;code&gt;npx install&lt;/code&gt; + MCP server + 10 providers&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Evaluation&lt;/td&gt;
&lt;td&gt;None&lt;/td&gt;
&lt;td&gt;
&lt;code&gt;/diagnose&lt;/code&gt; scores 5 dimensions 1-5&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;The ecosystem forms a loop:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;/teach-maestro → /diagnose → /fortify → /evaluate → /refine
       ↑                                                  |
       └──────────────── continuous improvement ──────────┘
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;






&lt;h2&gt;
  
  
  Real Example: What &lt;code&gt;/diagnose&lt;/code&gt; Found in My Project
&lt;/h2&gt;

&lt;p&gt;I ran &lt;code&gt;/diagnose&lt;/code&gt; on my production app. It found:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Wallet service handling real money with zero test coverage.&lt;/strong&gt; Idempotency keys were implemented, but no tests verified they actually prevent double-credits. Score: Safety 2/5.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Two services using DB transactions without try/catch.&lt;/strong&gt; If a deadlock occurs, the exception bubbles unhandled and the user gets a raw 500 error.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Frontend deploying to Cloudflare Pages without &lt;code&gt;tsc --noEmit&lt;/code&gt;.&lt;/strong&gt; Type errors were reaching production undetected.&lt;/p&gt;&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;Each finding came with a specific command: &lt;code&gt;/fortify WalletService&lt;/code&gt;, &lt;code&gt;/guard financial-flows&lt;/code&gt;, &lt;code&gt;/fortify frontend-build&lt;/code&gt;.&lt;/p&gt;

&lt;p&gt;That's the difference between "you should probably add tests" and "Run &lt;code&gt;/guard&lt;/code&gt; on your wallet service because your financial operations have zero test coverage and idempotency keys are unverified."&lt;/p&gt;




&lt;h2&gt;
  
  
  The Architecture
&lt;/h2&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;source/skills/           ← 21 skill definitions (source of truth)
├── agent-workflow/      ← Core skill + 7 reference docs
│   └── reference/       ← Prompt engineering, context mgmt, etc.
├── diagnose/            ← Analysis commands
├── fortify/             ← Fix commands
├── amplify/             ← Enhancement commands
└── teach-maestro/       ← Utility commands

scripts/
├── build.js             ← Copies to 10 provider directories
├── bundle-skills.js     ← Bundles into MCP server
└── validate.js          ← Validates frontmatter + references

mcp-server/              ← npm package: maestro-workflow-mcp
├── tools.ts             ← 4 MCP tools with template resolution
├── prompts.ts           ← 20 MCP prompts
└── resources.ts         ← 8 read-only knowledge resources
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;One source. 10 providers. One MCP server. Everything validated, bundled, and versioned.&lt;/p&gt;




&lt;h2&gt;
  
  
  What's Next
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;More references&lt;/strong&gt; — domain-specific guides for testing, deployment, observability&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Scoring trends&lt;/strong&gt; — track &lt;code&gt;/diagnose&lt;/code&gt; scores over time&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Community skills&lt;/strong&gt; — contribute your own commands via PR&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  Try It
&lt;/h2&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;&lt;span class="c"&gt;# Install skills&lt;/span&gt;
npx skills add sharpdeveye/maestro

&lt;span class="c"&gt;# Or add the MCP server&lt;/span&gt;
&lt;span class="c"&gt;# → add to your mcp config: npx -y maestro-workflow-mcp&lt;/span&gt;

&lt;span class="c"&gt;# Then run your first diagnostic&lt;/span&gt;
/diagnose
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;If it finds workflow slop — it will.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;GitHub:&lt;/strong&gt; &lt;a href="https://github.com/sharpdeveye/maestro" rel="noopener noreferrer"&gt;github.com/sharpdeveye/maestro&lt;/a&gt;&lt;br&gt;
&lt;strong&gt;npm:&lt;/strong&gt; &lt;a href="https://www.npmjs.com/package/maestro-workflow-mcp" rel="noopener noreferrer"&gt;maestro-workflow-mcp&lt;/a&gt;&lt;br&gt;
&lt;strong&gt;License:&lt;/strong&gt; MIT&lt;/p&gt;




&lt;p&gt;&lt;em&gt;If this saved you from one more "it seems to work" deployment, consider dropping a ⭐ on the repo. It helps more than you think.&lt;/em&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>opensource</category>
      <category>productivity</category>
      <category>claude</category>
    </item>
  </channel>
</rss>
