<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Yugant Hadiyal</title>
    <description>The latest articles on DEV Community by Yugant Hadiyal (@yugantm).</description>
    <link>https://dev.to/yugantm</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3954556%2F828f9e8c-eba9-4fe0-99af-c84b52d00b36.png</url>
      <title>DEV Community: Yugant Hadiyal</title>
      <link>https://dev.to/yugantm</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/yugantm"/>
    <language>en</language>
    <item>
      <title>I Ranked 171 AI Agents by Trust — Here's What I Found About Safety and Transparency</title>
      <dc:creator>Yugant Hadiyal</dc:creator>
      <pubDate>Wed, 27 May 2026 15:34:37 +0000</pubDate>
      <link>https://dev.to/yugantm/i-ranked-171-ai-agents-by-trust-heres-what-i-found-about-safety-and-transparency-2imk</link>
      <guid>https://dev.to/yugantm/i-ranked-171-ai-agents-by-trust-heres-what-i-found-about-safety-and-transparency-2imk</guid>
      <description>&lt;p&gt;AI agents are everywhere. LangChain, AutoGPT, CrewAI, Dify, n8n — there are hundreds of open-source agent frameworks now, and the list keeps growing.&lt;/p&gt;

&lt;p&gt;But here's the question nobody is asking: &lt;strong&gt;which ones can you actually trust?&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;I spent the last month building &lt;a href="https://hvtracker.net" rel="noopener noreferrer"&gt;HVTracker&lt;/a&gt;, an open trust registry that scores 171 AI agents across five dimensions: Activity, Adoption, Transparency, Safety, and Identity.&lt;/p&gt;

&lt;p&gt;This post shares what I found.&lt;/p&gt;

&lt;h2&gt;
  
  
  The Problem
&lt;/h2&gt;

&lt;p&gt;Most developers pick an AI agent framework based on GitHub stars and vibes. Stars measure popularity, not trustworthiness. A repo with 100K stars can still have:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;No security policy&lt;/li&gt;
&lt;li&gt;No signed commits&lt;/li&gt;
&lt;li&gt;No supply chain provenance&lt;/li&gt;
&lt;li&gt;A failing OSSF Scorecard&lt;/li&gt;
&lt;li&gt;No license at all&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Stars tell you what's trendy. They don't tell you what's safe to deploy in production.&lt;/p&gt;

&lt;h2&gt;
  
  
  How HVTrust Scoring Works
&lt;/h2&gt;

&lt;p&gt;Every agent gets a composite trust score from 0–100 across five dimensions:&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Dimension&lt;/th&gt;
&lt;th&gt;Max Points&lt;/th&gt;
&lt;th&gt;What It Measures&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Activity&lt;/td&gt;
&lt;td&gt;25&lt;/td&gt;
&lt;td&gt;Recent commits, release freshness&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Adoption&lt;/td&gt;
&lt;td&gt;20&lt;/td&gt;
&lt;td&gt;GitHub stars, npm/PyPI downloads&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Transparency&lt;/td&gt;
&lt;td&gt;20&lt;/td&gt;
&lt;td&gt;License, docs, OSSF Scorecard&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Safety&lt;/td&gt;
&lt;td&gt;20&lt;/td&gt;
&lt;td&gt;OSSF score, provenance, signed commits&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Identity&lt;/td&gt;
&lt;td&gt;15&lt;/td&gt;
&lt;td&gt;Verification status, evidence coverage&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;Each agent also gets an &lt;strong&gt;Evidence Grade&lt;/strong&gt; (A through D) based on how many independent signal types we could verify:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Grade A&lt;/strong&gt;: 4+ signal types (GitHub + downloads + scorecard + provenance)&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Grade B&lt;/strong&gt;: 3 signal types&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Grade C&lt;/strong&gt;: 2 signal types&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Grade D&lt;/strong&gt;: GitHub only&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Surprising Findings
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;High stars doesn't mean high trust.&lt;/strong&gt; Several agents with 100K+ stars scored below 50/100 on trust because they lack basic security hygiene.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Transparency is the weakest dimension across the board.&lt;/strong&gt; Most agents have a license and README, but very few have OSSF Scorecards, signed commits, or provenance attestations.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Smaller projects sometimes score higher on safety.&lt;/strong&gt; Projects that adopted Sigstore, SLSA provenance, or GitHub's artifact attestations early tend to outperform larger projects that grew before these tools existed.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Only a handful of agents achieve Grade A evidence.&lt;/strong&gt; Most sit at Grade B or C — meaning we can only partially verify their trust signals from independent sources.&lt;/p&gt;

&lt;h2&gt;
  
  
  What Signals We Track
&lt;/h2&gt;

&lt;p&gt;HVTracker pulls data from multiple independent sources every 4 hours:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;GitHub API&lt;/strong&gt; — stars, forks, commits, license, last push date&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;npm / PyPI&lt;/strong&gt; — weekly downloads, provenance attestations&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;OSSF Scorecard&lt;/strong&gt; (via deps.dev) — security practices score&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;GitHub Search API&lt;/strong&gt; — fingerprint-based public actions&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Algolia HN API&lt;/strong&gt; — Hacker News mentions in the last 30 days&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;All signals refresh automatically via staggered GitHub Actions cron jobs — 6 batches per day, full cycle in 24 hours.&lt;/p&gt;

&lt;h2&gt;
  
  
  It's Fully Open
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;The full dataset is CC BY 4.0: &lt;a href="https://hvtracker.net/data/latest.json" rel="noopener noreferrer"&gt;hvtracker.net/data/latest.json&lt;/a&gt;
&lt;/li&gt;
&lt;li&gt;The scoring methodology is documented: &lt;a href="https://hvtracker.net/methodology" rel="noopener noreferrer"&gt;hvtracker.net/methodology&lt;/a&gt;
&lt;/li&gt;
&lt;li&gt;The source code is on GitHub: &lt;a href="https://github.com/YugantM/hvtracker" rel="noopener noreferrer"&gt;github.com/YugantM/hvtracker&lt;/a&gt;
&lt;/li&gt;
&lt;li&gt;Every agent has an individual profile page with all raw signals&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;There's no login, no tracking, no backend — it's a static site on GitHub Pages.&lt;/p&gt;

&lt;h2&gt;
  
  
  Embeddable Trust Badges
&lt;/h2&gt;

&lt;p&gt;Example badges for LangChain: &lt;code&gt;HVTrust: 85.0&lt;/code&gt; &lt;code&gt;Grade: B&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;See them live: &lt;a href="https://hvtracker.net/badge/langchain.svg" rel="noopener noreferrer"&gt;hvtracker.net/badge/langchain.svg&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Embed them in your README:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight markdown"&gt;&lt;code&gt;&lt;span class="p"&gt;[&lt;/span&gt;&lt;span class="nv"&gt;![HVTrust&lt;/span&gt;&lt;span class="p"&gt;](&lt;/span&gt;&lt;span class="sx"&gt;https://hvtracker.net/badge/YOUR-AGENT.svg&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt;](https://hvtracker.net/agents/YOUR-AGENT)
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h2&gt;
  
  
  What's Next
&lt;/h2&gt;

&lt;p&gt;I'm working on:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Agent comparison tool (compare 2–3 agents side by side)&lt;/li&gt;
&lt;li&gt;7-day trust trend indicators&lt;/li&gt;
&lt;li&gt;Agent submission via GitHub Issues&lt;/li&gt;
&lt;li&gt;Reputation event history (track trust changes over time)&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Try It
&lt;/h2&gt;

&lt;p&gt;Browse the registry: &lt;a href="https://hvtracker.net" rel="noopener noreferrer"&gt;hvtracker.net&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Find your favorite agent. Check its trust score. You might be surprised.&lt;/p&gt;

&lt;p&gt;I'd love feedback on the scoring methodology — especially whether the dimension weights feel right to you. Drop a comment or open an issue on GitHub.&lt;/p&gt;




&lt;p&gt;&lt;em&gt;Built solo as an open-source project. If you find it useful, a star on &lt;a href="https://github.com/YugantM/hvtracker" rel="noopener noreferrer"&gt;GitHub&lt;/a&gt; would mean a lot.&lt;/em&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>opensource</category>
      <category>security</category>
      <category>showdev</category>
    </item>
  </channel>
</rss>
