Ahmed Ibrahim

Posted on May 20

Why AI Coding Agents Fail Senior Engineers (And What I Built to Fix It)

#ai #opensource #claude #openai

Why AI Coding Agents Fail Senior Engineers (And What I Built to Fix It)

AI coding agents are impressive. They write syntax-correct code. They know a thousand libraries.
They can implement anything you describe in seconds.

And that's exactly the problem.

The Speed Trap

You describe a feature. The agent produces code. You ship it. It works.

For a while.

Then the requirements change. The team grows. The system scales. The codebase becomes
unmaintainable in ways that feel mysterious but are entirely predictable.

The code worked. The architecture was never designed.

I've been there. I gave Claude a task, it jumped to code, I shipped it, and three months later I was completely rewriting it because there was no architecture.

What Senior Engineers Actually Do

Watch a Staff Engineer approach a new task. They don't open their editor.

They ask questions. They read existing code. They draw diagrams. They think about what happens in eighteen months when someone else maintains this. They document decisions — not to cover themselves, but because they know that future context is as valuable as present code.

By the time they write the first line of implementation, they know:

Why this architecture was chosen over alternatives
What the failure modes are and how they're handled
Where the system will need to change as it grows
What a new developer will need to understand to work on this

AI agents, left ungoverned, skip all of this. They optimize for the immediate request, not for the long-term health of the system.

The Gap

This isn't a tool problem. It's a process problem.

I built Kavro to close that gap.

Kavro is an open-source framework that enforces a 7-phase Staff-level engineering
workflow on top of any AI coding agent.

The 7 Phases (No Shortcuts)

Phase 1: Deep Research & Understanding
Understand the business goal, technical context, domain patterns, and risks.
Produce a Research Summary.

Code? No.

Phase 2: System Design & Architecture
Produce a complete Technical Blueprint with API contracts, data design, error handling
strategy, and a decision log for every major choice.

Code? No.

Phase 3: Task Decomposition
Break the project into atomic, independently-verifiable tasks with explicit dependencies,
scope, and validation steps.

Code? No.

Phase 4: Documentation
Establish the documentation baseline before implementation.

Code? No.

Phase 5: Prompt Orchestration
Generate precise, context-injected prompts for each task based on Phases 1-4.

Code? No.

Phase 6: Agent Selection
Decide which agent (or model tier) handles each task based on complexity.

Code? Not yet.

Phase 7: Governance
Execute continuously while validating output against the blueprint. Catch drift.

NOW code can be written. And it's good code.

Why This Works

The best engineers don't jump to solutions. They understand the problem deeply.

Kavro automates that discipline. The AI agent can't skip ahead. Can't vibe-code.
Can't punt decisions to later. The workflow is enforced.

What It Looks Like

You: "Build me a notification service"
Without Kavro:
→ Claude writes code immediately
With Kavro:
→ Phase 1: Asks about business goals, risks, technical context
→ Produces Research Summary
→ Phase 2: Technical Blueprint with full decision documentation
→ Phase 3: Task decomposition with dependencies
→ Phase 4: Documentation structure
→ Phase 5-7: Orchestrated execution with continuous validation
→ Result: Ship with confidence

Open Source

Kavro is MIT licensed and works on:

Claude Code
Claude.ai
Codex CLI
Cursor
Windsurf

Built on the agentskills.io open standard so it travels with developers, not with tools.

GitHub: https://github.com/a7medalyapany/kavro

Installation

git clone https://github.com/a7medalyapany/kavro.git
cd kavro
bash scripts/install.sh

For Claude.ai upload:

bash scripts/build.sh --claude
# Upload dist/kavro-claude.zip → Settings → Skills

The Long Vision

Kavro v1.0 is a skill — instructions that wrap around an AI agent.

The longer vision is a governance service that tracks architectural decisions across
sessions, detects drift over time, and gives engineering teams visibility into how
their AI agents are building systems.

Not surveillance. Accountability.

For Teams

If your team uses different AI tools (some use Claude, some use Cursor, some use Codex),
Kavro enforces the same discipline across all of them. One framework. Multiple agents.

The core belief behind Kavro is simple: A problem understood deeply is already half-solved.

Code written without architecture is technical debt by design.

Kavro makes AI agents think like senior engineers. It's the difference between
"works now" and "works forever."

If you've ever had to rewrite AI-generated code, give it a star.

Think before you build.

Top comments (1)

Harjot Singh • May 29

senior eng's complaint is usually 'the agent doesnt know what i wouldn't do' - taste gap, not knowledge gap. ur taking the fix at the IDE-context layer which is the right move there. at the gen-pipeline layer we encode senior-instincts as hard constraints (forbid-pattern phase, enforce-types phase) - similar idea different surface. $3 per shipped saas. happy to swap notes on encoding-senior-judgment-as-rules.