<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: gracefullight</title>
    <description>The latest articles on DEV Community by gracefullight (@gracefullight).</description>
    <link>https://dev.to/gracefullight</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F1435105%2F42b0924f-ba4d-4504-b64b-07fc976c4a5e.jpg</url>
      <title>DEV Community: gracefullight</title>
      <link>https://dev.to/gracefullight</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/gracefullight"/>
    <language>en</language>
    <item>
      <title>oh-my-agent — A Production-Grade Multi AI IDE Agent Harness</title>
      <dc:creator>gracefullight</dc:creator>
      <pubDate>Wed, 25 Mar 2026 08:22:17 +0000</pubDate>
      <link>https://dev.to/gracefullight/oh-my-agent-a-production-grade-multi-ai-ide-agent-harness-4jh</link>
      <guid>https://dev.to/gracefullight/oh-my-agent-a-production-grade-multi-ai-ide-agent-harness-4jh</guid>
      <description>&lt;p&gt;When you tell an agent to "build a TODO app," it does build something. The problem is that it often builds the wrong thing, drifts out of scope, and repeats the same mistakes.&lt;/p&gt;

&lt;p&gt;To address this, structural approaches like AGENTS.md and, more recently, Skills have emerged. But looking at the skills actually being shared, a few recurring problems stand out:&lt;/p&gt;

&lt;p&gt;The most critical piece — library version information — is missing.&lt;br&gt;
Role descriptions end at hollow declarations like "You are a Senior engineer."&lt;br&gt;
Content that could be covered by a few keywords gets padded into lengthy prose, wasting tokens.&lt;/p&gt;

&lt;p&gt;As a result, these skills are poorly followed by models, burn context for nothing, and over time become dead code that nobody wants to open.&lt;/p&gt;

&lt;h2&gt;
  
  
  [Approach]
&lt;/h2&gt;

&lt;p&gt;With oh-my-agent, we wanted to solve this through process, not prompts. Instead of simply telling the agent to "redo it" when something goes wrong, we record why it went wrong and feed that back into the next run.&lt;/p&gt;

&lt;p&gt;The core mechanism is Clarification Debt (CD) Scoring. When the agent misinterprets a requirement or drifts out of scope, points accumulate:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;clarify: +10 — simple confirmation question&lt;/li&gt;
&lt;li&gt;correct: +25 — direction change due to misunderstood intent&lt;/li&gt;
&lt;li&gt;redo: +40 — rollback and restart due to scope deviation&lt;/li&gt;
&lt;li&gt;Starting work without checking the Charter: +15&lt;/li&gt;
&lt;li&gt;Modifying files outside the allowed scope: +20&lt;/li&gt;
&lt;li&gt;Repeating the same error: x1.5 multiplier&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Above 50 points, writing a Root Cause Analysis (RCA) is mandatory. Above 80, the session is halted. Lessons extracted are accumulated in lessons-learned.md and reflected from the very next session. Even with simple prompts, the process compensates.&lt;/p&gt;

&lt;p&gt;Beyond that, several common protocols keep the agent from going rogue:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Clarification Protocol&lt;/strong&gt; — Requirement ambiguity is classified as LOW / MEDIUM / HIGH. LOW means proceed, MEDIUM means present options, HIGH means stop and clarify first.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Difficulty Guide&lt;/strong&gt; — Tasks are categorized as Simple / Medium / Complex, adjusting the required protocol depth accordingly.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Context Budget&lt;/strong&gt; — Token budgets are set per model to reduce unnecessary context consumption.&lt;/p&gt;

&lt;p&gt;This approach aligns with the Harness Engineering concept discussed by OpenAI. Getting the most out of agents isn't a one-liner prompt problem — it's about what control structure you wrap around them.&lt;/p&gt;

&lt;h2&gt;
  
  
  [Project Structure]
&lt;/h2&gt;

&lt;p&gt;oh-my-agent manages all of this within the project directory.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;.agents/ = SSOT&lt;/strong&gt; — Skills, workflows, and configurations live under .agents/ as the single source of truth. No dependency on any specific IDE.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Role-based agent team&lt;/strong&gt; — Core roles include PM, QA, Frontend, Backend, Mobile, and Debug, with DB Agent and TF Infra Agent newly added.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;DB Agent: SQL / NoSQL / Vector DB modeling, including ISO 27001 security recommendations&lt;/li&gt;
&lt;li&gt;TF Infra Agent: Multi-cloud Terraform, OPA / Sentinel policies, ISO 42000 series control guidance&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Workflow-centric orchestration&lt;/strong&gt; — Planning, review, debug, and parallel execution form the default flow. The newly added /brainstorm workflow explores design before writing code: codebase analysis → clarification questions → approach proposal → user approval → design document saved, then followed by /plan → implementation.&lt;/p&gt;

&lt;h2&gt;
  
  
  [Two Orchestration Modes]
&lt;/h2&gt;

&lt;p&gt;&lt;code&gt;/coordinate&lt;/code&gt; is built for speed — iterate fast, fix problems as they surface. The PM breaks down tasks, dispatches agents, and QA runs a single review pass. If CRITICAL/HIGH issues appear, the affected task is re-run. It's a lightweight 7-step loop.&lt;/p&gt;

&lt;p&gt;&lt;code&gt;/ultrawork&lt;/code&gt; emphasizes quality gates. It's divided into five phases — PLAN → IMPL → VERIFY → REFINE → SHIP — each with a gate that blocks progression until passed. Of the 17 steps, 11 are reviews. The REFINE phase handles file splitting, deduplication, side-effect analysis, and dead code removal.&lt;/p&gt;

&lt;p&gt;It might seem like overkill, but as programming abstraction climbs from machine language to high-level languages and now to natural language, verification only becomes more critical — a point that's hard to argue with.&lt;/p&gt;

&lt;h2&gt;
  
  
  [Expansion Background]
&lt;/h2&gt;

&lt;p&gt;A month ago, this project launched as oh-my-ag, an orchestrator exclusive to Antigravity. Since then, multiple AI IDEs started adopting .agents/skills/ as the project skill path, and there was no longer a reason to keep it locked to a single IDE. So it was expanded into a universal harness format and became oh-my-agent.&lt;/p&gt;

&lt;h2&gt;
  
  
  [Getting Started]
&lt;/h2&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;curl &lt;span class="nt"&gt;-fsSL&lt;/span&gt; https://raw.githubusercontent.com/first-fluke/oh-my-agent/refs/heads/main/cli/install.sh | bash
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Supports all major AI IDEs: Antigravity, Claude Code, Codex CLI, Cursor, and more.&lt;/p&gt;

&lt;p&gt;If you're already using an AI IDE, give it a try. At the end of the day, the developer's goal is to hit QCD (Quality, Cost, Delivery) all at once. Agent-driven development is no exception — and that's the mindset behind this project.&lt;/p&gt;

&lt;p&gt;🔗 GitHub: &lt;a href="https://github.com/first-fluke/oh-my-agent" rel="noopener noreferrer"&gt;first-fluke/oh-my-agent&lt;/a&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>programming</category>
      <category>agents</category>
      <category>productivity</category>
    </item>
    <item>
      <title>Stop Micro-Managing Prompts: Meet oh-my-ag, the Multi-Agent Orchestrator for Antigravity</title>
      <dc:creator>gracefullight</dc:creator>
      <pubDate>Tue, 10 Feb 2026 05:08:59 +0000</pubDate>
      <link>https://dev.to/gracefullight/stop-micro-managing-prompts-meet-oh-my-ag-the-multi-agent-orchestrator-for-antigravity-4k0g</link>
      <guid>https://dev.to/gracefullight/stop-micro-managing-prompts-meet-oh-my-ag-the-multi-agent-orchestrator-for-antigravity-4k0g</guid>
      <description>&lt;blockquote&gt;
&lt;p&gt;According to &lt;strong&gt;Claude's recent 2026 Agentic Coding Trends Report&lt;/strong&gt;, software development is no longer just about "writing code"—it's about how well you orchestrate and supervise AI agents.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;AI is an incredible collaborator, but ensuring quality and security requires human oversight. We are moving toward a structure of &lt;strong&gt;collaboration rather than total delegation&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;The problem? Setting up this collaboration structure manually via prompts every single time is exhausting.&lt;/p&gt;

&lt;h2&gt;
  
  
  The Problem with Single Agents
&lt;/h2&gt;

&lt;p&gt;If you've used &lt;strong&gt;Antigravity&lt;/strong&gt; or other CLI-based agents in real-world projects, you’ve likely faced these frustrations:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Ignored Skills:&lt;/strong&gt; The agent ignores its defined "Skills."&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Amnesia:&lt;/strong&gt; "Rules" are forgotten halfway through a long conversation.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Infinite Loops:&lt;/strong&gt; The agent enters an infinite loop of uncontrolled responses.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Busy but Useless:&lt;/strong&gt; It works hard, but doesn't actually do what you asked.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;We often blame the model, but in reality, this is an &lt;strong&gt;orchestration problem&lt;/strong&gt;. We didn't need to "write better prompts." We needed to provide a default agent collaboration structure.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;That question started &lt;code&gt;oh-my-ag&lt;/code&gt;.&lt;/strong&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  What is oh-my-ag?
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;oh-my-ag&lt;/strong&gt; is a role-based agent orchestration layer for Antigravity.&lt;/p&gt;

&lt;p&gt;Instead of shoving every bit of context into a single agent, it separates responsibilities clearly. The &lt;strong&gt;Orchestrator&lt;/strong&gt; controls the flow, &lt;strong&gt;sub-agents&lt;/strong&gt; focus on their specific domains, and context is shared via a common memory stream.&lt;/p&gt;

&lt;h3&gt;
  
  
  One-line Installation
&lt;/h3&gt;

&lt;p&gt;You can get a full agent team running in your project instantly:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;bunx oh-my-ag
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;This command automatically configures a &lt;strong&gt;Role-Based Agent Set&lt;/strong&gt;:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;PM (Product Manager):&lt;/strong&gt; Clarifies requirements and breaks down tasks.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Frontend &amp;amp; Backend:&lt;/strong&gt; Handle implementation in their respective domains.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Mobile:&lt;/strong&gt; Handles Flutter-based mobile implementation.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;QA:&lt;/strong&gt; Verifies implementation against requirements and checks edge cases.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Debug:&lt;/strong&gt; Analyzes failure causes and suggests fixes.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Each agent comes with optimized Skills and prompt structures for its role, all tied together by the Orchestrator.&lt;/p&gt;

&lt;h2&gt;
  
  
  Why You Need This
&lt;/h2&gt;

&lt;h3&gt;
  
  
  1. Mitigating Model Volatility
&lt;/h3&gt;

&lt;p&gt;Model performance (like the recent fluctuations in Gemini 3 Pro) can change overnight due to updates or server loads. A single-agent structure with a massive prompt is incredibly fragile to these changes.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Solution:&lt;/strong&gt; &lt;code&gt;oh-my-ag&lt;/code&gt; splits roles and responsibilities. By using &lt;strong&gt;Serena Memory&lt;/strong&gt; as a shared brain, decisions and intermediate results are persisted. Even if a specific model response wobbles, the overall context remains intact.&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  2. Clear Human Oversight Points
&lt;/h3&gt;

&lt;p&gt;The goal isn't to remove humans from the loop, but to clarify &lt;strong&gt;where the human needs to be&lt;/strong&gt;.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Solution:&lt;/strong&gt; Because roles like PM, QA, and Debug are separated, it becomes structurally obvious where AI has finished its job and where human judgment is required.&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  3. Stop Reinventing the Prompt
&lt;/h3&gt;

&lt;p&gt;Typing out role descriptions, rules, and approval processes every time is unproductive.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Solution:&lt;/strong&gt; &lt;code&gt;oh-my-ag&lt;/code&gt; automates the prompt settings and connection structures via role-specific Skill bundles and a parallel-execution-ready orchestrator.&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Key Features
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Role-Based Specialist Agent Sets&lt;/strong&gt;&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Parallel Execution Orchestration Layer&lt;/strong&gt;&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Multi-CLI Support&lt;/strong&gt; (Gemini CLI / Claude CLI / Codex CLI)&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Serena Memory Integration&lt;/strong&gt;&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;MCP-specific Tool Scope Control&lt;/strong&gt;&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Automated Conventional Commits&lt;/strong&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Conclusion
&lt;/h2&gt;

&lt;p&gt;In 2026, AI-Native development isn't about "how to use a tool"; it's about &lt;strong&gt;"how to manage the workflow."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;This orchestration layer is designed with the &lt;a href="https://bit.ly/3L1frc0" rel="noopener noreferrer"&gt;first-fluke/fullstack-starter&lt;/a&gt; structure in mind and currently handles over &lt;strong&gt;50 commits a day&lt;/strong&gt; in full-stack web/mobile applications.&lt;/p&gt;

&lt;p&gt;If you are already using Antigravity, stop wrestling with prompts. Use &lt;code&gt;oh-my-ag&lt;/code&gt; and AI Pro to melt through those tokens across your 6 accounts efficiently.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Check it out on GitHub:&lt;/strong&gt; &lt;a href="https://bit.ly/4kbPpQh" rel="noopener noreferrer"&gt;first-fluke/oh-my-ag&lt;/a&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>productivity</category>
      <category>antigravity</category>
      <category>agentaichallenge</category>
    </item>
    <item>
      <title>A Production-Ready Monorepo for AI-Native Full-Stack Development</title>
      <dc:creator>gracefullight</dc:creator>
      <pubDate>Tue, 13 Jan 2026 06:38:31 +0000</pubDate>
      <link>https://dev.to/gracefullight/a-production-ready-monorepo-for-ai-native-full-stack-development-55ma</link>
      <guid>https://dev.to/gracefullight/a-production-ready-monorepo-for-ai-native-full-stack-development-55ma</guid>
      <description>&lt;p&gt;Andrej Karpathy recently wrote: “If you properly connect the things that emerged over the past year, they could easily become 10× more powerful.” &lt;br&gt;
At the same time, we’re being handed agents, sub-agents, prompts, context, MCP, workflows—tools dropped on us without a manual.&lt;/p&gt;

&lt;p&gt;Ashok, Tesla’s CTO, also said: “Everyone is a CEO now.”&lt;br&gt;
In other words, we’ve entered an era where a single developer can build a full-stack product end to end.&lt;/p&gt;

&lt;p&gt;Both statements are true. Spinning up a service with a few clicks has become easy. But building software that humans and AI can review together—and that remains maintainable over time—is still hard. To truly leverage these tools, you need a solid foundation first.&lt;/p&gt;

&lt;p&gt;I’m open-sourcing the foundation I built while developing AI-driven services as an AI SWE. Compared to starting from scratch, it should save you roughly two weeks.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Tech Stack&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Web: Next.js 16, React 19, TailwindCSS v4&lt;/li&gt;
&lt;li&gt;API: FastAPI, async SQLAlchemy, PostgreSQL&lt;/li&gt;
&lt;li&gt;Mobile: Flutter 3.38, Riverpod&lt;/li&gt;
&lt;li&gt;Infra: Terraform, GCP (Cloud Run, Cloud SQL)&lt;/li&gt;
&lt;li&gt;CI/CD: GitHub Actions + Workload Identity Federation (keyless)&lt;/li&gt;
&lt;li&gt;Observability: OpenTelemetry&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Why This Matters&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;The quality of AI-generated code varies widely by model. &lt;/li&gt;
&lt;li&gt;A well-designed template provides clear patterns for AI to follow, while strict linting and CI act as guardrails.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Key Features&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;mise-based monorepo: unified toolchains for Node, Python, and Flutter&lt;/li&gt;
&lt;li&gt;Single-source i18n: shared across web and mobile&lt;/li&gt;
&lt;li&gt;Automatic API client generation: Orval (web), swagger_parser (mobile)&lt;/li&gt;
&lt;li&gt;Rust-based toolchain: Biome, uv, Turbopack&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Production patterns and troubleshooting that rarely make it into docs, encoded directly in the codebase&lt;/p&gt;

&lt;p&gt;If you see room for improvement, feel free to open an issue.&lt;/p&gt;

&lt;p&gt;GitHub: &lt;a href="https://bit.ly/3L1frc0" rel="noopener noreferrer"&gt;https://bit.ly/3L1frc0&lt;/a&gt;&lt;/p&gt;

</description>
      <category>vibecoding</category>
      <category>programming</category>
      <category>webdev</category>
      <category>ai</category>
    </item>
  </channel>
</rss>
