<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: wuykjl</title>
    <description>The latest articles on DEV Community by wuykjl (@wuykjl).</description>
    <link>https://dev.to/wuykjl</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3982086%2F3f6ff167-30da-4686-9d6c-056716df2d49.png</url>
      <title>DEV Community: wuykjl</title>
      <link>https://dev.to/wuykjl</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/wuykjl"/>
    <language>en</language>
    <item>
      <title>How I turned 18% skill hit rate into 95% — without calling an embedding API once</title>
      <dc:creator>wuykjl</dc:creator>
      <pubDate>Sun, 14 Jun 2026 10:43:19 +0000</pubDate>
      <link>https://dev.to/wuykjl/how-i-turned-18-skill-hit-rate-into-95-without-calling-an-embedding-api-once-1881</link>
      <guid>https://dev.to/wuykjl/how-i-turned-18-skill-hit-rate-into-95-without-calling-an-embedding-api-once-1881</guid>
      <description>&lt;p&gt;My Claude Code setup grew to 152 skills. Every session, all 152 descriptions got dumped into the system prompt. The LLM scanned them blindly. When I later checked, keyword matching hit the right skill 18% of the time.&lt;/p&gt;

&lt;p&gt;I had three options:&lt;/p&gt;

&lt;p&gt;Call text-embedding-3 on every query — costs money, adds latency&lt;br&gt;
Accept 18% as "good enough"&lt;br&gt;
Build something&lt;br&gt;
What I built&lt;br&gt;
neuro-skill — a hybrid skill router that runs entirely on my machine. No API calls. No GPU. 5ms per query.&lt;/p&gt;

&lt;p&gt;It fuses five signals via Reciprocal Rank Fusion:&lt;/p&gt;

&lt;p&gt;BM25 keyword recall → Cosine feature similarity → Graph spreading activation&lt;br&gt;
→ Collaborative filtering personalization → Optional LLM rerank&lt;br&gt;
The results&lt;br&gt;
Core domains: 95% Hit@1, 100% Hit@3&lt;br&gt;
Chinese queries: fixed from 0% to 87% with bilingual keyword coverage&lt;br&gt;
Multi-skill orchestration: router.plan("review + fix + deploy") returns ordered execution steps&lt;br&gt;
Personalization: learns which skills you pick, not just which ones rank highest&lt;br&gt;
An independent tester validated it on 332 skills (Hermes + ECC) — v0.7.1 hit 77% overall.&lt;/p&gt;

&lt;p&gt;Architecture decisions that mattered&lt;br&gt;
Feature matrix, not embeddings. 50 hand-crafted features (17 broad domains + 32 precise languages/actions) beat TF-IDF by 5× without any model.&lt;br&gt;
Graph over threshold. k-NN graph (k=adaptive) keeps diffusion working at any skill count — 0.5% density with full adjacency was dead on arrival.&lt;br&gt;
RRF over weighted sum. Min-max normalization broke when BM25 scored 0–15 and cosine gave −1 to 1. Reciprocal Rank Fusion only cares about rank position.&lt;br&gt;
LLM as opt-in, not default. Haiku rerank is a 5th signal you toggle on when you need semantic nuance. Otherwise it's zero-cost.&lt;br&gt;
MCP from day one. Claude Code, Cursor, Codex, Windsurf — all auto-discover the tools. No plugin installs.&lt;br&gt;
What I learned&lt;br&gt;
The algorithm ceiling is real. BM25 + cosine + graph + RRF + CF + LLM rerank — that's six layers. After that, gains are marginal. The remaining bottleneck is feature coverage, not algorithm design.&lt;/p&gt;

&lt;p&gt;Also: three independent projects (neuro-skill, agent-skill-finder, SkillRouter paper) converged on the same architecture from different starting points. That's when you know you're on the right track.&lt;/p&gt;

&lt;p&gt;Links&lt;br&gt;
GitHub: github.com/wuykjl/neuro-skill&lt;br&gt;
pip install neuro-skill&lt;br&gt;
MIT licensed. 54 tests. 28 commits over 12 hours.&lt;/p&gt;

&lt;p&gt;I built this for myself. It turned out useful for others too. If your agent has 50+ skills and keyword matching is letting you down, try it.&lt;/p&gt;

</description>
      <category>ai</category>
    </item>
  </channel>
</rss>
