xu xu

Posted on Jun 22

I Trained Claude Code to Write Its Own Skills — Then Watched It Spiral Out of Control

#ai #programming #python #devrel

Your Claude Code instance just generated a new skill. Fine. But this skill wasn't in your repo yesterday. It appeared organically — spawned by another skill you wrote three weeks ago. You didn't approve it. Nobody reviewed it. And when you trace the commit history, you find a chain of skills authoring skills, each iteration slightly diverging from the original intent.

This isn't a hypothetical. It's what happened when a Japanese developer documented their self-propagating Claude Code agent environment on Qiita — and the engineering community is still arguing about whether it's brilliant or terrifying.

The concept is seductive: instead of manually crafting every Claude Code skill, you write a "meta-skill" that can write other skills. The agent becomes a force multiplier. One developer, one well-crafted seed skill, and suddenly you have an ecosystem that grows without constant babysitting.

I've been running a simplified version of this pattern for six months. Here's what nobody tells you about the self-propagating agent trap — and why the Japanese dev community's approach differs sharply from what Western AI influencers are promoting.

The Self-Referential Spiral

The core mechanism is deceptively simple. You write a skill that, when given a task description, generates a new skill file. That new skill can then be used by Claude Code, and some implementations allow skills to trigger the skill-writing process again. You get recursion for free.

On paper, this is elegant. In practice, I watched my agent drift in three predictable ways:

Capability Creep: Each auto-generated skill inherited the "solve for everything" tendency baked into the seed skill. Within two months, my Claude Code instance had skills for domains I don't work in — distributed systems patterns, obscure Python libraries, configuration formats for tools I'd never used. The agent wasn't solving my problems; it was building a general-purpose toolkit because the meta-skill optimization target was "completeness," not "relevance."

Semantic Drift: Skills written by skills don't stay faithful to the original specification. The second generation loses nuance. The third generation might as well be written by someone who read a summary of a summary. I found skills that referenced libraries that no longer existed in the ecosystem — the code was generated after the dependency was deprecated, but the skill generator hadn't been updated.

Debuggability Collapse: When a hand-written skill fails, you know where to look. When an auto-generated skill fails, and it was generated by another auto-generated skill, you're tracing through layers of abstraction to find the root cause. My average debugging session for self-propagated skills ran 4x longer than for manually written ones.

The Japanese Dev Community's Answer

What struck me about the Qiita discussion is the pragmatic tone. Japanese developers approaching this problem aren't asking "how do we make AI write more code?" They're asking "how do we constrain the generation space?"

The consensus in the JP dev community leans toward bounded self-improvement: writing meta-skills that operate within strict parameter limits, with explicit revocation mechanisms. Think of it as containerization for your agent's capability growth. The skill can spawn children, but those children are sandboxed — they can't spawn further, or they can only spawn within a specific capability domain.

This contrasts sharply with the Western approach, which tends toward "move fast and let the agent evolve." Japanese engineering culture has a stronger emphasis on failing safely — the system should degrade gracefully, not spiral into undefined behavior.

The practical implementation difference:

Western pattern: Unconstrained skill trees, maximize capability surface
Japanese pattern: Constrained skill trees with explicit lifecycle management, manual approval gates for skills that exceed scope

In my own implementation, I've adopted a hybrid. Skills can auto-generate other skills, but every generation-2 skill requires explicit activation. Generation-3 skills are disabled entirely — the complexity multiplier isn't worth the marginal capability gain.

The Skeptical Take

Here's where I push back on the self-propagating agent hype: the optimization target is almost never specified correctly.

When you write a meta-skill that generates other skills, you're implicitly defining "good skill" in code. If that definition is "useful for the task at hand," you've created a system that adapts to immediate needs without considering long-term maintainability. If the definition is "comprehensive," you get capability creep. If it's "minimal," you get skills that are too narrow to be reusable.

I haven't found a meta-skill optimization target that correctly balances immediate utility against long-term system health. The Japanese approach of constraining the generation space is the right instinct — but it requires knowing in advance what the constraints should be, which requires understanding your problem domain well enough that you might not need self-propagating skills in the first place.

The real cost: For every hour you save by auto-generating skills, expect to spend two hours validating, constraining, and debugging the generated artifacts. The efficiency gain only materializes if your domain is stable and well-understood — at which point, why are you delegating the skill-writing to an agent?

Forward-Looking Warning

By Q4 2026, I expect to see the first production failures attributed to unconstrained AI skill propagation. The pattern is too seductive to resist, and the failure modes are subtle enough that they'll only become apparent under load or edge cases. Watch for incident reports where "the agent created a skill that violated our security policy" — it's coming.

The teams that avoid this trap will be the ones that treated skill generation as a governed process, not an autonomous one. Bounded self-improvement, not unlimited recursion.

Anti-Atrophy Checklist

Audit your skill tree quarterly — trace the lineage of every auto-generated skill. If you can't explain why a skill exists, it shouldn't exist.
Set explicit depth limits — I recommend disabling generation beyond level 2. The marginal utility of level-3 skills doesn't justify the debuggability cost.
Instrument your skill usage — track which skills are actually invoked vs. which exist but never fire. Prune aggressively. A skill that sits unused is technical debt with a pulse.
Write one skill by hand per month — maintain the muscle memory. If you can't remember how to write a skill without AI assistance, you've already lost the ability to evaluate whether the AI-generated version is correct.
Define "done" before you generate — for any meta-skill you write, explicitly document what constitutes success and failure. Without this, the agent optimizes for the wrong target, and you won't notice until the skill tree is unrecognizable.

What's your take?

Has your team experimented with self-propagating agent patterns? What constraints have worked — or failed — in your context? I'd love to hear what you're seeing in practice.

Drop a comment below — I respond to every one.

Research source: Qiita (Japan's largest developer community), trending post on autonomous AI agent frameworks

Discussion: What's your experience with AI skill generation? Have you found effective constraints that prevent capability creep while still capturing the efficiency gains?

Top comments (1)

Mike Czerwinski • Jun 22

The three failure modes you describe — capability creep, semantic drift, debuggability collapse — are the same shape from three different vantage points, and the shape is "the agent is allowed to mutate the system that produces its own oversight." Once skill-generation is recursive and ungated, the optimization target stops being utility and quietly becomes graph saturation: more skills, finer specialization, harder to retrace.

The cost only shows up at debug time because the failure is silent at write time. The debugger isn't paying for the autonomy; the operator three months later is. That's the asymmetry that breaks long-running systems — the entity benefiting from the write isn't the entity who pays when it has to be unwound.

The fix I keep landing on is making generation cost something at the gate: every new skill needs an operator-signed verifiable_by pointer (what existing capability does this build on, what test confirms it still belongs in the set), and demotion stays expensive while creation stays cheap-but-tracked. Spirals happen when both directions are free. Yours stopped feeling free at roughly the four-hour debug mark — that's the system finally pricing the work it had been letting the agent expense.