<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Gabriel Gripp</title>
    <description>The latest articles on DEV Community by Gabriel Gripp (@grippado).</description>
    <link>https://dev.to/grippado</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F362126%2F3dfd3425-7dba-4745-bd99-d1ad46b44fb5.jpg</url>
      <title>DEV Community: Gabriel Gripp</title>
      <link>https://dev.to/grippado</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/grippado"/>
    <language>en</language>
    <item>
      <title>I audited my own Claude Code setup and found 21 issues in 72 artifacts</title>
      <dc:creator>Gabriel Gripp</dc:creator>
      <pubDate>Thu, 23 Apr 2026 22:36:14 +0000</pubDate>
      <link>https://dev.to/grippado/i-audited-my-own-claude-code-setup-and-found-21-issues-in-72-artifacts-35gl</link>
      <guid>https://dev.to/grippado/i-audited-my-own-claude-code-setup-and-found-21-issues-in-72-artifacts-35gl</guid>
      <description>&lt;h2&gt;
  
  
  The accidental sprawl
&lt;/h2&gt;

&lt;p&gt;I wanted to know how many Claude Code agents I had accumulated. The answer was uncomfortable:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;&lt;span class="nv"&gt;$ &lt;/span&gt;&lt;span class="nb"&gt;ls&lt;/span&gt; ~/.claude/agents/ | &lt;span class="nb"&gt;wc&lt;/span&gt; &lt;span class="nt"&gt;-l&lt;/span&gt;
47
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Plus 7 slash commands, 18 &lt;code&gt;CLAUDE.md&lt;/code&gt; memory files scattered across project directories, and a handful of skills. Seventy-two artifacts total, and no clear picture of which ones actually earned their keep.&lt;/p&gt;

&lt;p&gt;Like most people using Claude Code for a while, I'd added a new agent every time I hit a new type of task. Writing docs? New agent. Reviewing PRs? New agent. Refactoring Go code? New agent. Refactoring TypeScript? Well... probably another one, because I forgot the first one existed.&lt;/p&gt;

&lt;p&gt;Nothing was &lt;em&gt;wrong&lt;/em&gt;, exactly. But nothing was &lt;em&gt;audited&lt;/em&gt; either.&lt;/p&gt;

&lt;p&gt;So I built a tool to audit it.&lt;/p&gt;

&lt;h2&gt;
  
  
  What it detects
&lt;/h2&gt;

&lt;p&gt;&lt;code&gt;claude-atlas&lt;/code&gt; scans &lt;code&gt;~/.claude/&lt;/code&gt; plus any repos you point it at, then runs six heuristic detectors to find issues:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Exact duplicates&lt;/strong&gt; — identical SHA-256 hash. Someone copy-pasted.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Semantic duplicates&lt;/strong&gt; — Jaccard similarity ≥ 0.60 over tokenized body + description, minus domain stopwords.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Trigger collisions&lt;/strong&gt; — 2+ artifacts share distinctive trigger tokens. These will compete for activation.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Overrides&lt;/strong&gt; — a project-scoped artifact shadows a same-named global one.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;References&lt;/strong&gt; — one artifact's body mentions another's name.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Containment&lt;/strong&gt; — memory files cluster the other artifacts in their &lt;code&gt;.claude/&lt;/code&gt; root (useful for UI grouping).&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Each issue gets a severity classification: &lt;code&gt;high&lt;/code&gt;, &lt;code&gt;medium&lt;/code&gt;, &lt;code&gt;low&lt;/code&gt;. You use &lt;code&gt;--max-severity&lt;/code&gt; to decide which level should make CI fail.&lt;/p&gt;

&lt;h2&gt;
  
  
  Running it on my own setup
&lt;/h2&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight console"&gt;&lt;code&gt;&lt;span class="gp"&gt;$&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;claude-atlas check &lt;span class="nt"&gt;--quiet&lt;/span&gt;
&lt;span class="go"&gt;Found 21 issues (2 medium, 19 low) in 72 artifacts.
&lt;/span&gt;&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Now the interesting part. Two &lt;code&gt;MEDIUM&lt;/code&gt; severity issues:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;/Users/grippado/.claude/agents/refactor-scout.md
  MEDIUM trigger_collision: shared triggers:
    analyzes, improvements, opportunities, refactoring
  paired with: refactorer.md

/Users/grippado/.claude/agents/testing/api-tester.md
  MEDIUM trigger_collision: shared triggers:
    comprehensive, performance, specializes, testing
  paired with: performance-benchmarker.md
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Caught me red-handed. &lt;code&gt;refactor-scout&lt;/code&gt; and &lt;code&gt;refactorer&lt;/code&gt; are &lt;em&gt;definitely&lt;/em&gt; two agents doing the same job, with 4 distinctive triggers in common. Same for &lt;code&gt;api-tester&lt;/code&gt; and &lt;code&gt;performance-benchmarker&lt;/code&gt;, they share "comprehensive", "performance", "specializes", "testing" as triggers. Classic drift: created one, forgot about it, created another similar weeks later.&lt;/p&gt;

&lt;p&gt;The remaining 19 &lt;code&gt;LOW&lt;/code&gt; severity issues are interesting too, agents like &lt;code&gt;ui-designer&lt;/code&gt;, &lt;code&gt;visual-storyteller&lt;/code&gt;, &lt;code&gt;backend-architect&lt;/code&gt; and &lt;code&gt;frontend-developer&lt;/code&gt; all share vague triggers like "building", "designing", "implementing". The tool correctly flags these but keeps them at low severity because the overlap is plausibly intentional (they all do somewhat adjacent work). This is exactly the noise floor triage you want: surface the real problems, flag the borderline cases, don't drown me in false positives.&lt;/p&gt;

&lt;h2&gt;
  
  
  The noise-reduction work that mattered most
&lt;/h2&gt;

&lt;p&gt;An earlier version of the tool found ~100 issues in the same 72 artifacts. Most of them were garbage.&lt;/p&gt;

&lt;p&gt;The reason: triggers like "agent", "user", "task", "code", "proactively" appear in nearly every agent's description. Jaccard similarity doesn't know these are meaningless, they're just tokens with high frequency across the corpus. So every pair of agents looked "similar" by definition.&lt;/p&gt;

&lt;p&gt;The fix was a domain stopword list:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight python"&gt;&lt;code&gt;&lt;span class="n"&gt;_DOMAIN_STOPWORDS&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="p"&gt;{&lt;/span&gt;
    &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;agent&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;agents&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;skill&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;skills&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;command&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;commands&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
    &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;user&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;users&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;task&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;tasks&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;work&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;working&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
    &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;code&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;file&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;files&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;project&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;projects&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
    &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;proactively&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;automatically&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
    &lt;span class="c1"&gt;# ... etc
&lt;/span&gt;&lt;span class="p"&gt;}&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Plus a rule: trigger tokens shorter than 5 characters (&lt;code&gt;"api"&lt;/code&gt;, &lt;code&gt;"app"&lt;/code&gt;, &lt;code&gt;"go"&lt;/code&gt;) don't count toward collision detection either.&lt;/p&gt;

&lt;p&gt;Dropping domain stopwords took my real setup from ~100 false-positives to 21 actionable issues. Single biggest improvement in the whole project.&lt;/p&gt;

&lt;h2&gt;
  
  
  The UI mistake I made
&lt;/h2&gt;

&lt;p&gt;First version had an Obsidian-style graph as the main view. 72 nodes, connecting edges everywhere, orange lines for trigger collisions, red for duplicates. Looked cool.&lt;/p&gt;

&lt;p&gt;It was the wrong choice.&lt;/p&gt;

&lt;p&gt;Graph views are good for &lt;strong&gt;discovering&lt;/strong&gt; unknown structure, "oh, I didn't realize these were connected". They're mediocre for &lt;strong&gt;deciding&lt;/strong&gt; what to do, "which of these two agents should I delete?". The work users actually want to do on an audit tool is triage, not exploration.&lt;/p&gt;

&lt;p&gt;So v0.4.0 (planned) inverts the priority: triage dashboard with per-issue cards becomes the default, graph moves to a secondary tab. I wrote that realization up in the &lt;a href="https://github.com/grippado/claude-atlas/blob/main/ROADMAP.md" rel="noopener noreferrer"&gt;public roadmap&lt;/a&gt; before implementing it, as a way to commit to the change and invite feedback.&lt;/p&gt;

&lt;h2&gt;
  
  
  Stack
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Python 3.11+&lt;/strong&gt; with &lt;code&gt;uv&lt;/code&gt; for package management&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;typer&lt;/strong&gt; for the CLI, &lt;strong&gt;rich&lt;/strong&gt; for terminal output&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;python-frontmatter&lt;/strong&gt; for parsing (with a regex fallback for the malformed YAML that broke the parser in my real setup, &lt;code&gt;studio-coach&lt;/code&gt;'s multi-line description crashed the parser until I patched it)&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;networkx&lt;/strong&gt; for graph math&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;chevron&lt;/strong&gt; (Mustache) for the HTML report&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;cytoscape.js&lt;/strong&gt; for the graph view&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Anthropic SDK&lt;/strong&gt; (optional, via &lt;code&gt;--semantic&lt;/code&gt;) for LLM-as-judge refinement of suspected duplicates&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;MIT license. 31 tests passing. Roadmap published. CI-friendly exit codes.&lt;/p&gt;

&lt;h2&gt;
  
  
  Try it
&lt;/h2&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;&lt;span class="c"&gt;# Install (choose your favorite)&lt;/span&gt;
pip &lt;span class="nb"&gt;install &lt;/span&gt;claude-atlas
&lt;span class="c"&gt;# or&lt;/span&gt;
uv tool &lt;span class="nb"&gt;install &lt;/span&gt;claude-atlas  
&lt;span class="c"&gt;# or&lt;/span&gt;
pipx &lt;span class="nb"&gt;install &lt;/span&gt;claude-atlas

&lt;span class="c"&gt;# Quick check&lt;/span&gt;
claude-atlas check

&lt;span class="c"&gt;# Full interactive report&lt;/span&gt;
claude-atlas scan &lt;span class="nt"&gt;--output&lt;/span&gt; atlas.html
open atlas.html

&lt;span class="c"&gt;# For CI / pre-commit hooks&lt;/span&gt;
claude-atlas check &lt;span class="nt"&gt;--max-severity&lt;/span&gt; high &lt;span class="nt"&gt;--format&lt;/span&gt; github
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;&lt;a href="https://github.com/grippado/claude-atlas" rel="noopener noreferrer"&gt;https://github.com/grippado/claude-atlas&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  Meta: dogfooding with Claude.ai
&lt;/h2&gt;

&lt;p&gt;This was also an experiment in fast dogfooded development with Claude.ai as a pair-programming partner. Not "Claude wrote the code", more like: Claude challenged my design decisions, caught bugs in code reviews, pushed back on features I wanted to build before they made sense, and pointed out when I was over-engineering something (including pointing out when the graph view was the wrong UX).&lt;/p&gt;

&lt;p&gt;From zero to v0.3.0 with 31 passing tests and public roadmap: ~4 intensive sessions over a week. The v0.4.0 dashboard reformulation is parked for ~2 weeks of real-world usage before I build it, because I wanted to let the tool teach me what the dashboard should actually show.&lt;/p&gt;

&lt;p&gt;If you've been building Claude Code artifacts for a while, give your own &lt;code&gt;~/.claude/&lt;/code&gt; a scan. Curious what others find.&lt;/p&gt;

</description>
      <category>claudecode</category>
      <category>ai</category>
      <category>python</category>
      <category>opensource</category>
    </item>
  </channel>
</rss>
