<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Jason Zhu</title>
    <description>The latest articles on DEV Community by Jason Zhu (@gosailglobal).</description>
    <link>https://dev.to/gosailglobal</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3903785%2Ffc528f6f-9f7f-457c-9730-45be49a60c09.jpg</url>
      <title>DEV Community: Jason Zhu</title>
      <link>https://dev.to/gosailglobal</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/gosailglobal"/>
    <language>en</language>
    <item>
      <title>I Indexed 67,000 Open-Source AI Agent Projects. Here's What's Actually Inside.</title>
      <dc:creator>Jason Zhu</dc:creator>
      <pubDate>Wed, 29 Apr 2026 07:38:52 +0000</pubDate>
      <link>https://dev.to/gosailglobal/i-indexed-67000-open-source-ai-agent-projects-heres-whats-actually-inside-edg</link>
      <guid>https://dev.to/gosailglobal/i-indexed-67000-open-source-ai-agent-projects-heres-whats-actually-inside-edg</guid>
      <description>&lt;p&gt;A few months ago I started a side project called &lt;a href="https://agentskillshub.top" rel="noopener noreferrer"&gt;AgentSkillsHub&lt;/a&gt; — a directory that indexes every meaningful open-source AI agent project on GitHub: MCP servers, Claude Skills, Codex Skills, agent tools, the works.&lt;/p&gt;

&lt;p&gt;Six months in, the database has 67,196 projects, refreshed every 8 hours.&lt;/p&gt;

&lt;p&gt;I expected to find a healthy ecosystem with a long tail. What I actually found was so lopsided I had to stop and write &lt;a href="https://agentskillshub.top/book/" rel="noopener noreferrer"&gt;a 12-chapter book about it&lt;/a&gt; (free, CC BY-NC-SA, &lt;a href="https://github.com/zhuyansen/skill-blue-book/releases/tag/v1.0" rel="noopener noreferrer"&gt;PDF&lt;/a&gt;).&lt;/p&gt;

&lt;p&gt;This post is a 1,500-word version. If any of it surprises you, the source data is open and reproducible.&lt;/p&gt;




&lt;h2&gt;
  
  
  TL;DR (5 findings)
&lt;/h2&gt;

&lt;ol&gt;
&lt;li&gt;
&lt;strong&gt;The Gini coefficient of star distribution is 0.983&lt;/strong&gt; — more lopsided than the App Store (0.95), npm (0.93), or YouTube (0.87)&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;54% of all projects have 0 stars.&lt;/strong&gt; Not "few stars." Zero.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;The top 1% of projects own 83% of all stars&lt;/strong&gt; in the entire ecosystem&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Monthly new project creation 45×'d&lt;/strong&gt; between January 2025 and March 2026&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;The best engineering pattern I found&lt;/strong&gt; isn't more stars or better code — it's &lt;code&gt;MISTAKES.md&lt;/code&gt; files (only 2.8% of top projects have one)&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;I'll unpack each below, plus three things my own data &lt;strong&gt;proved I was wrong about&lt;/strong&gt;.&lt;/p&gt;




&lt;h2&gt;
  
  
  Finding 1: A new world record for inequality
&lt;/h2&gt;

&lt;p&gt;The Gini coefficient measures distribution inequality on a 0–1 scale. 0 = perfect equality (everyone has the same), 1 = one person owns everything.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;AI Agent Projects (2026):  0.983
App Store (2024):          0.95
npm packages (2022):       0.93
YouTube videos (2020):     0.87
US wealth (2023):          0.40
China wealth (2023):       0.47
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;I checked the math three times. The 67K project dataset really does have a Gini of 0.983.&lt;/p&gt;

&lt;p&gt;For context: &lt;strong&gt;the bottom 50% of all open-source AI agent projects own 0.4% of the stars&lt;/strong&gt;. The top 0.1% (about 67 projects) own about half of all the stars on the platform.&lt;/p&gt;

&lt;p&gt;This isn't a "long tail." This is a needle and a desert.&lt;/p&gt;

&lt;h2&gt;
  
  
  Finding 2: 54% have zero stars
&lt;/h2&gt;

&lt;p&gt;Of 67,196 projects:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;36,346 have exactly 0 stars&lt;/strong&gt; (54.1%)&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;47,381 have ≤ 5 stars&lt;/strong&gt; (70.5%)&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Only 1,693 have ≥ 100 stars&lt;/strong&gt; (2.5%)&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Only 403 have ≥ 1,000 stars&lt;/strong&gt; (0.6%)&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The 36K zero-star projects represent a real human behavior: someone wrote a Skill or MCP server, pushed it to GitHub, and &lt;strong&gt;literally no one&lt;/strong&gt; — not even themselves on a different account — clicked the star button.&lt;/p&gt;

&lt;p&gt;Most of these aren't spam. They're earnest first attempts. Someone learned about Skills, wrote one in a weekend, pushed it, and then never came back.&lt;/p&gt;

&lt;h2&gt;
  
  
  Finding 3: 1% own 83%
&lt;/h2&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;Top 1% of projects:    83.2% of all stars
Top 10%:               96.8% of all stars
Bottom 90%:            3.2% of all stars
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;If you're an open-source agent author and you're not in the top 10%, &lt;strong&gt;mathematically you are competing for 3% of the visible attention&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;The cliff between #1,000 and #10,000 in star rank is steeper than between #100 and #1,000.&lt;/p&gt;

&lt;h2&gt;
  
  
  Finding 4: The 2026 supply explosion
&lt;/h2&gt;

&lt;p&gt;Monthly new agent projects, by year:&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Month&lt;/th&gt;
&lt;th&gt;Count&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;2024 Jan&lt;/td&gt;
&lt;td&gt;~50&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;2024 Dec&lt;/td&gt;
&lt;td&gt;~280&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;2025 Jun&lt;/td&gt;
&lt;td&gt;~620&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;2025 Dec&lt;/td&gt;
&lt;td&gt;~1,400&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;2026 Mar&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;~27,720&lt;/strong&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;That's a 45× jump from 2024's monthly average to one month in 2026.&lt;/p&gt;

&lt;p&gt;What changed? Three things compounded:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;
&lt;strong&gt;Anthropic published the Skill Spec&lt;/strong&gt; (October 2025), giving creators a concrete format&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Claude Code shipped &lt;code&gt;~/.claude/skills/&lt;/code&gt;&lt;/strong&gt; (February 2026), making installation one-step&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Cursor + Codex CLI added Skill loading&lt;/strong&gt; in the same quarter&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;When the format went from "5 commands and one config file" to "drop a folder, done," the supply curve broke.&lt;/p&gt;

&lt;p&gt;The demand curve hasn't kept up. Hence the 54% zero-star rate.&lt;/p&gt;

&lt;h2&gt;
  
  
  Finding 5: The 2.8% who write &lt;code&gt;MISTAKES.md&lt;/code&gt;
&lt;/h2&gt;

&lt;p&gt;Of the top 500 projects (≥500 stars), I checked which had files like &lt;code&gt;MISTAKES.md&lt;/code&gt;, &lt;code&gt;LESSONS.md&lt;/code&gt;, or &lt;code&gt;POSTMORTEM.md&lt;/code&gt;.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Only 14 do (2.8%).&lt;/strong&gt; Of those, 8 are forks/templates. Real authors actively recording mistakes: &lt;strong&gt;6&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;Those 6 projects' mean quality score is &lt;strong&gt;55.8 vs 47.2&lt;/strong&gt; for the Top 500 average. A +8.6 delta on a 100-point scale.&lt;/p&gt;

&lt;p&gt;Sample size is small, but the signal is loud: &lt;strong&gt;the engineers who write down their mistakes ship better Skills&lt;/strong&gt;. This isn't a Skill design rule. It's a personality trait that leaks into the artifact.&lt;/p&gt;

&lt;p&gt;If I had to pick one signal to predict whether a Skill will still be alive in 6 months, it'd be "does the author maintain a &lt;code&gt;MISTAKES.md&lt;/code&gt;?" — beating star count, commit frequency, and quality score combined.&lt;/p&gt;




&lt;h2&gt;
  
  
  Three things I was wrong about
&lt;/h2&gt;

&lt;p&gt;This is the part I almost didn't write.&lt;/p&gt;

&lt;h3&gt;
  
  
  Wrong #1: "Quality score will surface hidden gems"
&lt;/h3&gt;

&lt;p&gt;I built a 6-dimension quality score (completeness, clarity, specificity, examples, README structure, agent readiness). It's open-source: &lt;a href="https://github.com/zhuyansen/agent-skills-hub/blob/main/backend/app/services/quality_analyzer.py" rel="noopener noreferrer"&gt;quality_analyzer.py&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;I assumed: &lt;em&gt;if I rank by quality instead of stars, the underrated stuff will float up.&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;Reality: &lt;strong&gt;quality and stars correlate at r = 0.71&lt;/strong&gt;. The hidden gems mostly aren't hidden — they're below the noise floor. Quality scoring helps within tiers (separating B from A), but it doesn't move pages from rank 5,000 to rank 50.&lt;/p&gt;

&lt;h3&gt;
  
  
  Wrong #2: "Categories will help users find what they need"
&lt;/h3&gt;

&lt;p&gt;I categorized everything into 7 buckets (mcp-server, claude-skill, codex-skill, agent-tool, etc.). 9.5% of projects ended up in &lt;code&gt;uncategorized&lt;/code&gt; — too generic to classify.&lt;/p&gt;

&lt;p&gt;Bigger problem: &lt;strong&gt;users don't search by category.&lt;/strong&gt; They search by use case ("PDF parsing", "code review", "Slack integration"). Category is an artifact of how I think, not how anyone uses the site.&lt;/p&gt;

&lt;p&gt;I had to build 58 separate &lt;code&gt;/best/{scenario}/&lt;/code&gt; landing pages to fix this.&lt;/p&gt;

&lt;h3&gt;
  
  
  Wrong #3: "Verified Creator badges will reward real authors"
&lt;/h3&gt;

&lt;p&gt;I designed a Verified Creator program with strict criteria. I even pre-filled the founding member list with 4 well-known names from the ecosystem.&lt;/p&gt;

&lt;p&gt;I forgot to ask them.&lt;/p&gt;

&lt;p&gt;One of them politely said "I haven't actually joined." I pulled the entire list within 4 hours. The lesson — &lt;strong&gt;never pre-announce someone else's name without consent, even when it makes your launch look better&lt;/strong&gt; — is in chapter 10 of the book if you want the full postmortem.&lt;/p&gt;




&lt;h2&gt;
  
  
  The chart that explains everything
&lt;/h2&gt;

&lt;p&gt;If you take one image away, take this one:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;       LOG SCALE — stars distribution (67K projects)

10K █▍
 5K █████▎
 1K ████████████████▎
500 ███████████████████████████████▎
100 ████████████████████████████████████████████████▎
 10 █████████████████████████████████████████████████████████████████████████████▎
  0 ██████████████████████████████████████████████████████████████████████████████████████████████████ (54%)
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Every healthy ecosystem looks like a Pareto curve. This one looks like a wall.&lt;/p&gt;




&lt;h2&gt;
  
  
  What this means if you build agent stuff
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;For consumers&lt;/strong&gt;: don't grade Skills by stars alone. The signal stops being useful below the top 1%. Look at &lt;code&gt;MISTAKES.md&lt;/code&gt;, recent commits, and whether the README has decision rules vs. prose.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;For authors&lt;/strong&gt;: you're competing in a market where 99% of attention goes to 1% of projects. Either invest in becoming top 1% (months of consistent shipping) or pick a niche where the top 1% doesn't exist yet (most domain-specific MCPs).&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;For platform builders&lt;/strong&gt;: the constraint isn't supply. It's discovery. Whoever solves "how do I find the right Skill in 30 seconds" wins more than whoever ships the next 1,000 Skills.&lt;/p&gt;




&lt;h2&gt;
  
  
  Where to dig deeper
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Source data&lt;/strong&gt;: &lt;a href="https://agentskillshub.top" rel="noopener noreferrer"&gt;agentskillshub.top&lt;/a&gt; — same data, browsable + searchable&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;The book&lt;/strong&gt; (free): &lt;a href="https://agentskillshub.top/book/" rel="noopener noreferrer"&gt;Skill Blue Book 2026&lt;/a&gt; — 12 chapters with full methodology, charts, scripts. PDF on &lt;a href="https://github.com/zhuyansen/skill-blue-book/releases/tag/v1.0" rel="noopener noreferrer"&gt;GitHub Releases&lt;/a&gt;.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Repo&lt;/strong&gt;: &lt;a href="https://github.com/zhuyansen/agent-skills-hub" rel="noopener noreferrer"&gt;github.com/zhuyansen/agent-skills-hub&lt;/a&gt; — quality scoring algorithm, sync pipeline, all open&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Find me&lt;/strong&gt;: &lt;a href="https://x.com/GoSailGlobal" rel="noopener noreferrer"&gt;@GoSailGlobal&lt;/a&gt; on X&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;If you found this useful and want the daily Top 10 picks of newly-indexed projects, AgentSkillsHub has a &lt;a href="https://agentskillshub.top/#newsletter" rel="noopener noreferrer"&gt;free newsletter&lt;/a&gt; (Mondays only).&lt;/p&gt;

&lt;p&gt;If you have data that contradicts any of this, please post it. The point of indexing 67K projects publicly is so the conclusions can be checked.&lt;/p&gt;




&lt;p&gt;&lt;em&gt;Cover image and methodology from&lt;/em&gt; &lt;a href="https://agentskillshub.top/book/ch03-market-landscape/" rel="noopener noreferrer"&gt;Skill 蓝皮书 2026&lt;/a&gt; (Chapter 3). &lt;em&gt;All data snapshots are reproducible — pull the script from&lt;/em&gt; &lt;a href="https://github.com/zhuyansen/skill-blue-book/blob/main/data/ch03_analysis.py" rel="noopener noreferrer"&gt;data/ch03_analysis.py&lt;/a&gt;.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>opensource</category>
      <category>mcp</category>
      <category>claude</category>
    </item>
  </channel>
</rss>
