DEV Community

pponali
pponali

Posted on

How I Cut Claude Code Token Consumption by 96% with the Skill Vault Pattern

The Problem: 181 Skills Burning 1.18 Million Tokens Per Session

I'm a power user of Claude Code — Anthropic's CLI for AI-assisted development. Over weeks of installing skill packs from GitHub repos, I accumulated 181 skills from multiple sources:

  • gstack agents (QA, design, deploy, monitoring)
  • marketingskills by Corey Haines (36 CRO/SEO/copywriting skills)
  • wondelai/skills (42 product/engineering frameworks)
  • superpowers by obra (14 dev methodology skills)
  • claude-mem (persistent memory plugin)
  • Deep-Research-skills (structured research workflows)
  • Custom skills I built myself

Sounds great, right? More skills = more capability.

Wrong. Every new conversation, Claude Code injects ALL skill descriptions into the system prompt. With 181 skills, that's approximately 1.18 million tokens of overhead per session — before I even type my first message.

Tokens were burning like a wildfire. Every task was expensive. Context windows were filling up fast.

The Discovery: Measuring the Actual Cost

I ran a simple audit to see exactly what was happening:

# Count total SKILL.md bytes across all skills
total=0
for f in ~/.claude/skills/*/SKILL.md; do
  size=$(wc -c < "$f" 2>/dev/null)
  total=$((total + size))
done
echo "Total bytes: $total"
echo "Approx tokens: $((total / 4))"
Enter fullscreen mode Exit fullscreen mode

Result: 4.7MB of SKILL.md files = ~1.18 million tokens.

The top offenders were massive:

Skill Size Tokens
ship (gstack) 130KB ~32K
plan-ceo-review 112KB ~28K
office-hours 101KB ~25K
seedance-15-real-estate 95KB ~24K
plan-devex-review 93KB ~23K

15 Seedance video skills alone consumed ~265K tokens. I used them maybe once a month.

The Solution: The Skill Vault Pattern

The idea is simple:

  1. Move rarely-used skills to a vault directory (out of auto-load)
  2. Keep a lightweight index skill that tells Claude what's available
  3. Claude restores skills on-demand when your request matches one
  4. Optionally move it back after the task

Step 1: Create the Vault

mkdir -p ~/.claude/skills-vault
Enter fullscreen mode Exit fullscreen mode

Step 2: Audit Your Skills by Usage Frequency

Sort all skills by file size to find the biggest offenders:

for f in ~/.claude/skills/*/SKILL.md; do
  dir=$(dirname "$f")
  name=$(basename "$dir")
  size=$(wc -c < "$f")
  echo "$size $name"
done | sort -rn | head -30
Enter fullscreen mode Exit fullscreen mode

Then categorize them:

  • Tier 1 (Daily use): Keep active — your core dev workflow skills
  • Tier 2 (Project-specific): Keep active — skills tied to current projects
  • Tier 3 (Occasional): Vault — heavy skills used weekly/monthly
  • Tier 4 (Rare): Vault — frameworks and reference skills

Step 3: Move Skills to the Vault

# Move video generation skills (rarely used)
mv ~/.claude/skills/seedance-* ~/.claude/skills-vault/

# Move heavy plan review skills
mv ~/.claude/skills/{plan-ceo-review,plan-eng-review,plan-design-review} \
   ~/.claude/skills-vault/

# Move framework/reference skills
mv ~/.claude/skills/{clean-architecture,domain-driven-design,system-design} \
   ~/.claude/skills-vault/

# Move marketing skills you don't use daily
mv ~/.claude/skills/{copywriting,seo-audit,paid-ads,cold-email} \
   ~/.claude/skills-vault/
Enter fullscreen mode Exit fullscreen mode

Step 4: Create the Index Skill (The Key Ingredient)

This is what makes the pattern work. Create a lightweight skill that acts as a lookup table for Claude:

mkdir -p ~/.claude/skills/skill-vault
Enter fullscreen mode Exit fullscreen mode

Create ~/.claude/skills/skill-vault/SKILL.md:

---
name: skill-vault
description: Index of vaulted skills at ~/.claude/skills-vault/.
  When the user's request matches a vaulted skill, restore it with
  mv ~/.claude/skills-vault/<name> ~/.claude/skills/ then use it.
  Use when user asks about design, SEO, CRO, copywriting,
  architecture patterns, security audit, deployment, video
  generation, or any topic not covered by active skills.
---

## Skill Vault

Skills stored in `~/.claude/skills-vault/` to save tokens.
When a user request matches one, restore it:

mv ~/.claude/skills-vault/<skill-name> ~/.claude/skills/

Then invoke it normally. After the task, optionally move back:

mv ~/.claude/skills/<skill-name> ~/.claude/skills-vault/

## Vaulted Skills Index

### Design & UI
- `design-review` — Visual QA, spacing/hierarchy fixes
- `design-html` — Production HTML/CSS from mockups
- `design-shotgun` — Generate multiple design variants
- `top-design` — Awwwards-quality web experiences

### Engineering Frameworks
- `clean-architecture` — Dependency rule, ports/adapters
- `domain-driven-design` — Bounded contexts, aggregates
- `system-design` — Distributed systems, scaling
- `refactoring-patterns` — Extract method, code smells

### DevOps & Monitoring
- `land-and-deploy` — Merge, CI, verify production
- `canary` — Post-deploy monitoring
- `cso` — Security audit (OWASP, STRIDE)

### Marketing
- `copywriting` — Marketing copy for any page
- `seo-audit` — Technical SEO audit
- `paid-ads` — Google/Meta/LinkedIn campaigns

(... add all your vaulted skills here ...)
Enter fullscreen mode Exit fullscreen mode

This index costs only ~1.5K tokens but gives Claude awareness of all 139 vaulted skills.

The Results

Metric Before After Reduction
Active skills 181 43 76% fewer
Tokens per session ~1.18M ~51K 96%
Vaulted (on-demand) 0 139
Capability lost None 0%

96% token reduction with zero capability loss.

How It Works at Runtime

You: "audit my design for visual issues"

Claude thinks:
  1. No active skill matches "design audit"
  2. skill-vault index matches: design-review
  3. Run: mv ~/.claude/skills-vault/design-review ~/.claude/skills/
  4. Now use design-review skill normally
  5. Task complete
Enter fullscreen mode Exit fullscreen mode

Claude reads the vault index, finds the right skill, restores it with a single mv command, and proceeds normally. The user experience is seamless.

My Final Active Skill Set (43 Skills)

Here's what I kept always-active:

Core Dev Workflow (Superpowers):
ship, qa, browse, review, investigate, writing-plans, executing-plans, brainstorming, subagent-driven-development, systematic-debugging, test-driven-development, verification-before-completion

Project-Specific:
ecommerce-architect, staff-engineer-interview, claude-api-patterns, claude-code-mastery, mcp-server-development

Marketing (Khetisahayak):
cmo, mkt-content, mkt-seo, mkt-social, mkt-growth, mkt-email, mkt-pr, mkt-review

Research:
research, research-deep, research-report

The Vault Index:
skill-vault (the 1.5K token index that knows about 139 other skills)

Tips for Your Own Vault

  1. Audit first. Run the size audit script before moving anything. You might be surprised which skills are the heaviest.

  2. Keep your daily drivers active. Don't vault skills you use every session. The restore step adds a small delay.

  3. Group by category. Makes it easy to restore a whole category:

   mv ~/.claude/skills-vault/seedance-* ~/.claude/skills/
Enter fullscreen mode Exit fullscreen mode
  1. Update the index when you add new skills to the vault.

  2. Restore everything if you need full power for a complex session:

   mv ~/.claude/skills-vault/* ~/.claude/skills/
Enter fullscreen mode Exit fullscreen mode

Where to Find Great Skills

Here are the skill repos I installed from:

Installing is simple — symlink into ~/.claude/skills/:

git clone https://github.com/coreyhaines31/marketingskills.git
for skill_dir in marketingskills/skills/*/; do
  ln -s "$(pwd)/$skill_dir" ~/.claude/skills/$(basename "$skill_dir")
done
Enter fullscreen mode Exit fullscreen mode

Then vault what you don't need daily.

Conclusion

The Skill Vault pattern is dead simple:

  • ~/.claude/skills/ = active skills (loaded every turn)
  • ~/.claude/skills-vault/ = dormant skills (restored on demand)
  • skill-vault/SKILL.md = lightweight index that bridges the two

If you're running more than 50 skills in Claude Code, you're probably burning hundreds of thousands of unnecessary tokens per session. The vault pattern gives you the full arsenal without the cost.

96% fewer tokens. Zero lost capability. One mv command away.


I'm Prakash Ponali, a Staff Engineer with 16+ years in enterprise eCommerce. Currently building Khetisahayak — a farming helper app for Telugu-speaking farmers in Andhra Pradesh. Find me on LinkedIn.

Top comments (0)