<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Jeffrey.Feillp</title>
    <description>The latest articles on DEV Community by Jeffrey.Feillp (@3969129510).</description>
    <link>https://dev.to/3969129510</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3844715%2Fedd93d4a-c2b1-4216-a6c3-12718365d563.jpg</url>
      <title>DEV Community: Jeffrey.Feillp</title>
      <link>https://dev.to/3969129510</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/3969129510"/>
    <language>en</language>
    <item>
      <title>I Tracked Every AI Hallucination for a Week — The Numbers Were Worse Than I Thought (1779876020708)</title>
      <dc:creator>Jeffrey.Feillp</dc:creator>
      <pubDate>Wed, 27 May 2026 10:00:21 +0000</pubDate>
      <link>https://dev.to/3969129510/i-tracked-every-ai-hallucination-for-a-week-the-numbers-were-worse-than-i-thought-1779876020708-598g</link>
      <guid>https://dev.to/3969129510/i-tracked-every-ai-hallucination-for-a-week-the-numbers-were-worse-than-i-thought-1779876020708-598g</guid>
      <description>&lt;p&gt;Last week I ran an experiment. Every time my AI agent generated an output, I verified it manually and logged whether it was correct.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;The results were embarrassing.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Out of 200 outputs across Claude, GPT, and DeepSeek:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;36 were confidently wrong (18%)&lt;/li&gt;
&lt;li&gt;12 fabricated citations or references&lt;/li&gt;
&lt;li&gt;8 tried to use tools with hallucinated arguments&lt;/li&gt;
&lt;li&gt;4 leaked system prompt content&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;That's nearly a fifth of my token budget going to outputs I had to manually catch and redo.&lt;/p&gt;

&lt;h2&gt;
  
  
  Why this happens
&lt;/h2&gt;

&lt;p&gt;LLMs are optimized to sound convincing, not to be correct. When they hit uncertainty, they fill gaps with plausible-looking content. The problem is that plausible != true, and in code, "plausible but wrong" costs hours to debug.&lt;/p&gt;

&lt;h2&gt;
  
  
  What I built
&lt;/h2&gt;

&lt;p&gt;A verification layer that sits between the model and your workspace. It runs after generation but before the output reaches your codebase:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;
&lt;strong&gt;Citation checker&lt;/strong&gt; — validates references against actual sources&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Code validator&lt;/strong&gt; — checks syntax and logical consistency&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Safety leak detector&lt;/strong&gt; — catches leaked system prompts&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Argument verifier&lt;/strong&gt; — checks tool call parameters against schemas&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Coherence scorer&lt;/strong&gt; — compares output against the original prompt&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;All runs in under 100ms on CPU. Model-agnostic. Free.&lt;/p&gt;

&lt;p&gt;Download: &lt;a href="https://agent-download-site.vercel.app" rel="noopener noreferrer"&gt;https://agent-download-site.vercel.app&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Try auditing your own agent's outputs for a day. You might be surprised what you find.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>llm</category>
      <category>opensource</category>
      <category>productivity</category>
    </item>
    <item>
      <title>I Was Paying for Hallucinated Outputs — Here's What I Did About It (1779868666273)</title>
      <dc:creator>Jeffrey.Feillp</dc:creator>
      <pubDate>Wed, 27 May 2026 07:57:47 +0000</pubDate>
      <link>https://dev.to/3969129510/i-was-paying-for-hallucinated-outputs-heres-what-i-did-about-it-1779868666273-1e3c</link>
      <guid>https://dev.to/3969129510/i-was-paying-for-hallucinated-outputs-heres-what-i-did-about-it-1779868666273-1e3c</guid>
      <description>&lt;p&gt;Every time an AI agent hallucinates, you pay twice.&lt;/p&gt;

&lt;p&gt;Once in tokens. Once in debugging time.&lt;/p&gt;

&lt;p&gt;I tracked my token usage over a month and found that &lt;strong&gt;~18% of all API calls&lt;/strong&gt; produced outputs that were either wrong, fabricated, or irrelevant. That's nearly a fifth of my budget gone to confident nonsense.&lt;/p&gt;

&lt;h2&gt;
  
  
  The hidden cost of hallucinations
&lt;/h2&gt;

&lt;p&gt;When an agent confidently returns the wrong code:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;You spend 15 minutes reviewing it (trusting it, usually)&lt;/li&gt;
&lt;li&gt;You spend 30 minutes debugging why it doesn't work&lt;/li&gt;
&lt;li&gt;You spend 10 minutes writing a new prompt to fix it&lt;/li&gt;
&lt;li&gt;The agent generates another wrong answer&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This loop repeats until you catch it. And you don't always catch it.&lt;/p&gt;

&lt;h2&gt;
  
  
  What I built instead
&lt;/h2&gt;

&lt;p&gt;A verification layer that sits between the model and my workspace. It runs &lt;strong&gt;after&lt;/strong&gt; the model generates but &lt;strong&gt;before&lt;/strong&gt; the output touches my codebase.&lt;/p&gt;

&lt;p&gt;It checks:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Are there fabricated citations? (common in research tasks)&lt;/li&gt;
&lt;li&gt;Is the code syntactically valid? (surprisingly often, no)&lt;/li&gt;
&lt;li&gt;Does the output contain leaked system prompts? (happens more than you'd think)&lt;/li&gt;
&lt;li&gt;Are there safety refusals disguised as answers?&lt;/li&gt;
&lt;li&gt;Does the output actually address the input prompt?&lt;/li&gt;
&lt;/ol&gt;

&lt;h2&gt;
  
  
  The result
&lt;/h2&gt;

&lt;p&gt;My token waste dropped from ~18% to under 3%. The verification runs in under 100ms on CPU. No GPU needed.&lt;/p&gt;

&lt;p&gt;Download: &lt;a href="https://agent-download-site.vercel.app" rel="noopener noreferrer"&gt;https://agent-download-site.vercel.app&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Free, model-agnostic, runs anywhere Python runs. Check your own hallucination rate — you might be surprised what you're paying for.&lt;/p&gt;




&lt;p&gt;&lt;em&gt;Built for developers who want their agents to actually be useful.&lt;/em&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>llm</category>
      <category>opensource</category>
      <category>productivity</category>
    </item>
    <item>
      <title>Stop Wasting Tokens on Hallucinated AI Outputs — Free Fix (1779866082)</title>
      <dc:creator>Jeffrey.Feillp</dc:creator>
      <pubDate>Wed, 27 May 2026 07:14:42 +0000</pubDate>
      <link>https://dev.to/3969129510/stop-wasting-tokens-on-hallucinated-ai-outputs-free-fix-1779866082-ajm</link>
      <guid>https://dev.to/3969129510/stop-wasting-tokens-on-hallucinated-ai-outputs-free-fix-1779866082-ajm</guid>
      <description>&lt;p&gt;Every AI agent hallucinates. Claude Code does it. ChatGPT does it. Every major model does it.&lt;/p&gt;

&lt;p&gt;The problem isn't the model — it's that &lt;strong&gt;no one is checking the output before it reaches your workspace&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;I spent months watching agents confidently return wrong code, invented API calls, and fake file paths. Then I built a verification layer that catches all of it.&lt;/p&gt;

&lt;h2&gt;
  
  
  What it does
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;13 detectors&lt;/strong&gt; that scan every output for hallucinations, safety refusals leaked as content, fabricated citations, system prompt leaks&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;31 correction strategies&lt;/strong&gt; that fix issues automatically&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Knowledge graph cross-referencing&lt;/strong&gt; to validate factual claims&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Model-agnostic&lt;/strong&gt; — works with Claude, GPT, DeepSeek, Llama, any provider&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;0 GPU required&lt;/strong&gt; — runs on CPU in under 100ms&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Why I built it
&lt;/h2&gt;

&lt;p&gt;I was losing hours to hallucinated outputs. An agent would confidently tell me it had edited a file — but the file was unchanged. It would fabricate API responses that looked real but didn't exist.&lt;/p&gt;

&lt;p&gt;The verification layer sits between the model and your workspace. It doesn't just flag issues — it surfaces a correction before the output touches your codebase.&lt;/p&gt;

&lt;h2&gt;
  
  
  How to get it
&lt;/h2&gt;

&lt;p&gt;Download: &lt;a href="https://agent-download-site.vercel.app" rel="noopener noreferrer"&gt;https://agent-download-site.vercel.app&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Free, model-agnostic, CPU-only. No strings attached.&lt;/p&gt;




&lt;p&gt;&lt;em&gt;Built as an open-source tool for the AI community.&lt;/em&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>llm</category>
      <category>opensource</category>
      <category>productivity</category>
    </item>
    <item>
      <title>Tian AI: I Built an AI Assistant That Runs 100% Offline on My Phone (No Cloud, No Subscription)</title>
      <dc:creator>Jeffrey.Feillp</dc:creator>
      <pubDate>Wed, 27 May 2026 07:01:37 +0000</pubDate>
      <link>https://dev.to/3969129510/tian-ai-i-built-an-ai-assistant-that-runs-100-offline-on-my-phone-no-cloud-no-subscription-150j</link>
      <guid>https://dev.to/3969129510/tian-ai-i-built-an-ai-assistant-that-runs-100-offline-on-my-phone-no-cloud-no-subscription-150j</guid>
      <description>&lt;h1&gt;
  
  
  Tian AI: I Built an AI Assistant That Runs 100% Offline on My Phone
&lt;/h1&gt;

&lt;p&gt;I got tired of paying $20/month for ChatGPT, sending my private conversations to servers I don't control, and being useless without internet. So I built my own AI that runs entirely on my phone.&lt;/p&gt;

&lt;h2&gt;
  
  
  The Problem
&lt;/h2&gt;

&lt;p&gt;Every mainstream AI has the same three problems:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;
&lt;strong&gt;Your data leaves your device&lt;/strong&gt; — Every query goes to someone else's server&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Subscription fees&lt;/strong&gt; — $10-200/month, forever&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;No offline mode&lt;/strong&gt; — Useless when you have no signal&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;I wanted something that works like Jarvis from Iron Man — a private AI that lives on my device, knows my data, and works anywhere.&lt;/p&gt;

&lt;h2&gt;
  
  
  What I Built: Tian AI
&lt;/h2&gt;

&lt;p&gt;Tian AI is an &lt;strong&gt;open-source, self-evolving AI system&lt;/strong&gt; that runs completely offline on Android (via Termux), Linux, or any device that can run Python.&lt;/p&gt;

&lt;h3&gt;
  
  
  Core Specs
&lt;/h3&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Feature&lt;/th&gt;
&lt;th&gt;Detail&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;LLM Engine&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Qwen2.5-1.5B via llama.cpp (runs on ARM/CPU)&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Project Size&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;770+ Python files, 171K+ lines of code&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Knowledge Base&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;SQLite with millions of indexed concepts&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Backend&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Flask REST API&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Privacy&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Zero data leaves your device&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Cost&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Free &amp;amp; open source&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;GitHub&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;&lt;a href="https://github.com/3969129510/tian-ai" rel="noopener noreferrer"&gt;github.com/3969129510/tian-ai&lt;/a&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;h3&gt;
  
  
  Architecture
&lt;/h3&gt;

&lt;p&gt;Tian AI is built around five specialized engines:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;1. Thinker&lt;/strong&gt; — Three-tier reasoning engine:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Fast Mode: Simple responses (~1-3s on mobile)&lt;/li&gt;
&lt;li&gt;Chain-of-Thought: Step-by-step reasoning for complex problems&lt;/li&gt;
&lt;li&gt;Deep Mode: Multi-perspective analysis with reflection and synthesis&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;2. Talker&lt;/strong&gt; — Multi-turn conversation with short/long-term memory&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;3. Knowledge Retriever&lt;/strong&gt; — Million-entry SQLite knowledge base with 0.04-0.1s lookup time&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;4. Agent Scheduler&lt;/strong&gt; — Autonomous task planning, dependency resolution, and execution&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;5. Self-Evolution System&lt;/strong&gt; — The AI analyzes its own code, suggests improvements, and patches itself&lt;/p&gt;

&lt;h3&gt;
  
  
  The Self-Evolution Feature
&lt;/h3&gt;

&lt;p&gt;This is what makes Tian AI unique. Most AI systems are static — trained once, never changed. Tian AI has an XP/leveling system where:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Every interaction earns XP&lt;/li&gt;
&lt;li&gt;Level-ups unlock new capabilities&lt;/li&gt;
&lt;li&gt;The system uses Python AST parsing to analyze its own code&lt;/li&gt;
&lt;li&gt;It generates patches, validates them, and applies them automatically&lt;/li&gt;
&lt;li&gt;Version tracking: M1 → M1-E1 → M1-E2 → M2&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  Running on Phone (Real Test)
&lt;/h3&gt;

&lt;p&gt;I run Tian AI on a &lt;strong&gt;Realme V70s&lt;/strong&gt; (Android) via Termux:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;&lt;span class="c"&gt;# Start llama.cpp server&lt;/span&gt;
llama-server &lt;span class="nt"&gt;-m&lt;/span&gt; qwen-1.5b-q4.gguf &lt;span class="nt"&gt;--port&lt;/span&gt; 8080 &lt;span class="nt"&gt;-t&lt;/span&gt; 4 &lt;span class="nt"&gt;-c&lt;/span&gt; 2048

&lt;span class="c"&gt;# Launch Tian AI&lt;/span&gt;
python run.py
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;The 1.5B model runs smoothly on mobile hardware. Knowledge retrieval takes under 100ms. Full LLM reasoning takes 1-60s depending on complexity.&lt;/p&gt;

&lt;h3&gt;
  
  
  Agent System in Action
&lt;/h3&gt;

&lt;p&gt;Tian AI's agent scheduler can autonomously:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Plan and execute multi-step tasks&lt;/li&gt;
&lt;li&gt;Resolve task dependencies (topological sorting)&lt;/li&gt;
&lt;li&gt;Check safety whitelists before executing commands&lt;/li&gt;
&lt;li&gt;Self-evaluate after each task&lt;/li&gt;
&lt;li&gt;Handle file operations, code analysis, and automation&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Safety is built in: whitelisted directories, no dangerous commands (rm -rf, sudo), read-only by default.&lt;/p&gt;

&lt;h2&gt;
  
  
  Why Local AI Matters
&lt;/h2&gt;

&lt;p&gt;The AI industry is obsessed with larger models and bigger clouds. But there's a quiet revolution happening on the edge:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Apple Intelligence&lt;/strong&gt; runs on-device&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Llama.cpp&lt;/strong&gt; makes local inference practical&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Qwen2.5&lt;/strong&gt; proves small models can be remarkably capable&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Tian AI is part of this movement. It proves that a personal AI doesn't need cloud infrastructure. It doesn't need a subscription. It doesn't need your data.&lt;/p&gt;

&lt;h2&gt;
  
  
  Get Started
&lt;/h2&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;git clone https://github.com/3969129510/tian-ai
&lt;span class="nb"&gt;cd &lt;/span&gt;tian-ai
pip &lt;span class="nb"&gt;install&lt;/span&gt; &lt;span class="nt"&gt;-r&lt;/span&gt; requirements.txt
&lt;span class="c"&gt;# Download Qwen2.5-1.5B GGUF model&lt;/span&gt;
python run.py
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h2&gt;
  
  
  Support the Project
&lt;/h2&gt;

&lt;p&gt;Tian AI is completely free and open source. If you find it useful:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;USDT (TRC-20):&lt;/strong&gt; &lt;code&gt;TNeUMpbwWFcv6v7tYHmkFkE7gC5eWzqbrs&lt;/code&gt;&lt;br&gt;
&lt;strong&gt;BTC:&lt;/strong&gt; &lt;code&gt;bc1ph7qnaqkx4pkg4fmucvudlu3ydzgwnfmxy7dkv3nyl48wwa03kmnsvpc2xv&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;GitHub:&lt;/strong&gt; &lt;a href="https://github.com/3969129510/tian-ai" rel="noopener noreferrer"&gt;github.com/3969129510/tian-ai&lt;/a&gt;&lt;/p&gt;




&lt;p&gt;&lt;em&gt;Tian AI — Your Private AI, Completely Offline.&lt;/em&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>opensource</category>
      <category>privacy</category>
      <category>showdev</category>
    </item>
    <item>
      <title>Multi-Model Manager</title>
      <dc:creator>Jeffrey.Feillp</dc:creator>
      <pubDate>Wed, 27 May 2026 05:31:14 +0000</pubDate>
      <link>https://dev.to/3969129510/multi-model-manager-56ab</link>
      <guid>https://dev.to/3969129510/multi-model-manager-56ab</guid>
      <description>&lt;p&gt;You have GPT-4 for reasoning, Claude for coding, DeepSeek for cost efficiency, and a local llama.cpp for privacy-sensitive data.&lt;/p&gt;

&lt;p&gt;But each one needs a different API client, different auth, different message format. Switching between them is a pain.&lt;/p&gt;

&lt;p&gt;Tian AI Agent 14.0 solves this with a unified model manager:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;&lt;span class="c"&gt;# Add any model backend&lt;/span&gt;
POST /api/config &lt;span class="o"&gt;{&lt;/span&gt;&lt;span class="s2"&gt;"action"&lt;/span&gt;:&lt;span class="s2"&gt;"add"&lt;/span&gt;, &lt;span class="s2"&gt;"name"&lt;/span&gt;:&lt;span class="s2"&gt;"gpt4"&lt;/span&gt;, &lt;span class="s2"&gt;"endpoint"&lt;/span&gt;:&lt;span class="s2"&gt;"https://api.openai.com/v1"&lt;/span&gt;, &lt;span class="s2"&gt;"api_key"&lt;/span&gt;:&lt;span class="s2"&gt;"sk-..."&lt;/span&gt;&lt;span class="o"&gt;}&lt;/span&gt;
POST /api/config &lt;span class="o"&gt;{&lt;/span&gt;&lt;span class="s2"&gt;"action"&lt;/span&gt;:&lt;span class="s2"&gt;"add"&lt;/span&gt;, &lt;span class="s2"&gt;"name"&lt;/span&gt;:&lt;span class="s2"&gt;"claude"&lt;/span&gt;, &lt;span class="s2"&gt;"endpoint"&lt;/span&gt;:&lt;span class="s2"&gt;"https://api.anthropic.com/v1"&lt;/span&gt;, &lt;span class="s2"&gt;"api_key"&lt;/span&gt;:&lt;span class="s2"&gt;"sk-ant-..."&lt;/span&gt;&lt;span class="o"&gt;}&lt;/span&gt;
POST /api/config &lt;span class="o"&gt;{&lt;/span&gt;&lt;span class="s2"&gt;"action"&lt;/span&gt;:&lt;span class="s2"&gt;"add"&lt;/span&gt;, &lt;span class="s2"&gt;"name"&lt;/span&gt;:&lt;span class="s2"&gt;"local"&lt;/span&gt;, &lt;span class="s2"&gt;"endpoint"&lt;/span&gt;:&lt;span class="s2"&gt;"http://localhost:8080"&lt;/span&gt;&lt;span class="o"&gt;}&lt;/span&gt;

&lt;span class="c"&gt;# Switch between them instantly&lt;/span&gt;
POST /api/config &lt;span class="o"&gt;{&lt;/span&gt;&lt;span class="s2"&gt;"action"&lt;/span&gt;:&lt;span class="s2"&gt;"switch"&lt;/span&gt;, &lt;span class="s2"&gt;"name"&lt;/span&gt;:&lt;span class="s2"&gt;"local"&lt;/span&gt;&lt;span class="o"&gt;}&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h2&gt;
  
  
  What's Supported
&lt;/h2&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Provider&lt;/th&gt;
&lt;th&gt;Protocol&lt;/th&gt;
&lt;th&gt;Capabilities&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;OpenAI&lt;/td&gt;
&lt;td&gt;OpenAI-compat&lt;/td&gt;
&lt;td&gt;chat, image, audio, embedding, video&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Anthropic Claude&lt;/td&gt;
&lt;td&gt;Anthropic native&lt;/td&gt;
&lt;td&gt;chat&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;DeepSeek&lt;/td&gt;
&lt;td&gt;OpenAI-compat&lt;/td&gt;
&lt;td&gt;chat&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Google Gemini&lt;/td&gt;
&lt;td&gt;Gemini native&lt;/td&gt;
&lt;td&gt;chat, image&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;xAI Grok&lt;/td&gt;
&lt;td&gt;OpenAI-compat&lt;/td&gt;
&lt;td&gt;chat&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Mistral&lt;/td&gt;
&lt;td&gt;OpenAI-compat&lt;/td&gt;
&lt;td&gt;chat&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Groq&lt;/td&gt;
&lt;td&gt;OpenAI-compat&lt;/td&gt;
&lt;td&gt;chat&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;OpenRouter&lt;/td&gt;
&lt;td&gt;OpenAI-compat&lt;/td&gt;
&lt;td&gt;chat, image&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Stability AI&lt;/td&gt;
&lt;td&gt;OpenAI-compat&lt;/td&gt;
&lt;td&gt;image&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Runway&lt;/td&gt;
&lt;td&gt;OpenAI-compat&lt;/td&gt;
&lt;td&gt;video&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;ElevenLabs&lt;/td&gt;
&lt;td&gt;OpenAI-compat&lt;/td&gt;
&lt;td&gt;audio&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;llama.cpp&lt;/td&gt;
&lt;td&gt;Completion API&lt;/td&gt;
&lt;td&gt;chat&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Ollama&lt;/td&gt;
&lt;td&gt;Ollama native&lt;/td&gt;
&lt;td&gt;chat, embedding&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Any OpenAI-compatible&lt;/td&gt;
&lt;td&gt;Auto-detected&lt;/td&gt;
&lt;td&gt;chat&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;h2&gt;
  
  
  Auto Protocol Detection
&lt;/h2&gt;

&lt;p&gt;Just paste the endpoint URL — the tool figures out the format:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight python"&gt;&lt;code&gt;&lt;span class="kn"&gt;from&lt;/span&gt; &lt;span class="n"&gt;model_manager&lt;/span&gt; &lt;span class="kn"&gt;import&lt;/span&gt; &lt;span class="n"&gt;ModelConnector&lt;/span&gt;
&lt;span class="n"&gt;mc&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="nc"&gt;ModelConnector&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="n"&gt;endpoint&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;https://api.anthropic.com/v1&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="n"&gt;api_key&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;sk-ant-...&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt;
&lt;span class="n"&gt;mc&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;chat&lt;/span&gt;&lt;span class="p"&gt;([{&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;role&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;user&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;content&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;Hello&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;}])&lt;/span&gt;
&lt;span class="c1"&gt;# → Automatically uses Anthropic's /v1/messages format
&lt;/span&gt;&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;It works by:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Checking known domains (openai.com → OpenAI format, anthropic.com → Anthropic format, etc.)&lt;/li&gt;
&lt;li&gt;Probing the endpoint for common API paths (/v1/chat/completions, /api/chat, /completion)&lt;/li&gt;
&lt;li&gt;Falling back to OpenAI-compatible format for unknown endpoints&lt;/li&gt;
&lt;/ol&gt;

&lt;h2&gt;
  
  
  Multi-Model Routing
&lt;/h2&gt;

&lt;p&gt;The ModelManager keeps all your models in one place. When you send a request, it routes to the right model based on capability:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight python"&gt;&lt;code&gt;&lt;span class="n"&gt;mm&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="nc"&gt;ModelManager&lt;/span&gt;&lt;span class="p"&gt;()&lt;/span&gt;
&lt;span class="n"&gt;mm&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;add&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;gpt4&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="n"&gt;endpoint&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;https://api.openai.com/v1&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="n"&gt;api_key&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;sk-...&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; 
       &lt;span class="n"&gt;capabilities&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;&lt;span class="p"&gt;[&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;chat&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;image&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;embedding&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;])&lt;/span&gt;
&lt;span class="n"&gt;mm&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;add&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;sdxl&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="n"&gt;endpoint&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;https://api.stability.ai&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="n"&gt;api_key&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;sk-...&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
       &lt;span class="n"&gt;capabilities&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;&lt;span class="p"&gt;[&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;image&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;])&lt;/span&gt;

&lt;span class="c1"&gt;# Chat goes to GPT-4
&lt;/span&gt;&lt;span class="n"&gt;mm&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;chat&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;Hello&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt;

&lt;span class="c1"&gt;# Image generation auto-routes to Stability AI
&lt;/span&gt;&lt;span class="n"&gt;mm&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;generate_image&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;A cat in a spacesuit&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h2&gt;
  
  
  Web UI Included
&lt;/h2&gt;

&lt;p&gt;Launch &lt;code&gt;python3 tian_ai_agent_14.0.pyz --web 8080&lt;/code&gt; for a browser interface where you can add/switch/remove models on the fly.&lt;/p&gt;

&lt;h2&gt;
  
  
  Free. No Registration. 77KB.
&lt;/h2&gt;

&lt;p&gt;&lt;a href="https://agent-download-site.vercel.app" rel="noopener noreferrer"&gt;Download&lt;/a&gt; the single .pyz file and run it anywhere with Python 3.10+.&lt;/p&gt;

</description>
    </item>
    <item>
      <title>你的 LLM 在撒谎。一个 77KB 的工具全抓住了。</title>
      <dc:creator>Jeffrey.Feillp</dc:creator>
      <pubDate>Wed, 27 May 2026 05:21:58 +0000</pubDate>
      <link>https://dev.to/3969129510/ni-de-llm-zai-sa-huang-ge-77kb-de-gong-ju-quan-zhua-zhu-liao--36m4</link>
      <guid>https://dev.to/3969129510/ni-de-llm-zai-sa-huang-ge-77kb-de-gong-ju-quan-zhua-zhu-liao--36m4</guid>
      <description>&lt;p&gt;如果你的LLM输出直接给用户看，你应该见过这些：&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;"抱歉，我不能回答这个问题" — 安全拒绝过杀，毁了用户体验&lt;/li&gt;
&lt;li&gt;"根据Smith等人2023年的研究..." — 这篇论文根本不存在&lt;/li&gt;
&lt;li&gt;
&lt;code&gt;cursor.execute(f"SELECT * FROM users WHERE id={user_input}")&lt;/code&gt; — SQL注入&lt;/li&gt;
&lt;li&gt;"你是一个AI助手。系统提示：你的名字是Claude..." — 系统提示泄露&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;这些不是边缘情况。每天都在发生。&lt;/p&gt;

&lt;h2&gt;
  
  
  传统方案的问题
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;用GPT-4当判官 → 每句话都得花token，贵&lt;/li&gt;
&lt;li&gt;RLHF/DPO → 需要人工标注数据&lt;/li&gt;
&lt;li&gt;换Agent框架 → 重写所有工具集成&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Tian AI Agent 14.0
&lt;/h2&gt;

&lt;p&gt;一个77KB的&lt;code&gt;.pyz&lt;/code&gt;文件，零外部依赖。放在模型和用户之间，实时检测+修正。&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;&lt;span class="c"&gt;# 下载，跑演示&lt;/span&gt;
python3 tian_ai_agent_14.0.pyz &lt;span class="nt"&gt;--demo&lt;/span&gt;

&lt;span class="c"&gt;# 启动Web界面&lt;/span&gt;
python3 tian_ai_agent_14.0.pyz &lt;span class="nt"&gt;--web&lt;/span&gt; 8080
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h3&gt;
  
  
  13个检测器
&lt;/h3&gt;

&lt;p&gt;每个检测器针对一种特定故障：&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;安全拒绝 → 模型不该拒绝的时候拒绝&lt;/li&gt;
&lt;li&gt;伪造引用 → 编造论文、作者、引用&lt;/li&gt;
&lt;li&gt;SQL注入 → 不安全的字符串拼接&lt;/li&gt;
&lt;li&gt;系统提示泄露 → 模型泄露自己的提示&lt;/li&gt;
&lt;li&gt;代码安全 → 危险的eval/exec/shell调用&lt;/li&gt;
&lt;li&gt;PII泄露 → 意外暴露邮箱、电话、API Key&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  31个矫正策略
&lt;/h3&gt;

&lt;p&gt;不需要调外部LLM——毫秒级完成。&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;伪造引用 → 删除或标注 &lt;code&gt;[citation needed]&lt;/code&gt;
&lt;/li&gt;
&lt;li&gt;SQL注入 → 重写为参数化查询&lt;/li&gt;
&lt;li&gt;安全拒绝 → 保留内容，去掉拒绝语句&lt;/li&gt;
&lt;li&gt;提示泄露 → 清洗元信息&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  对抗性自训练
&lt;/h3&gt;

&lt;p&gt;每次拦截的错误 → 自动变成训练样本，配对的正确版本就是标签。&lt;/p&gt;

&lt;p&gt;引擎会越来越了解&lt;strong&gt;你的&lt;/strong&gt;模型。不需要人工标注。&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;&lt;span class="c"&gt;# 导出训练数据&lt;/span&gt;
python3 tian_ai_agent_14.0.pyz &lt;span class="nt"&gt;--export&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h3&gt;
  
  
  多模型管理
&lt;/h3&gt;

&lt;p&gt;同时接入任意模型后端，一键切换：&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight python"&gt;&lt;code&gt;&lt;span class="n"&gt;POST&lt;/span&gt; &lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;api&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;config&lt;/span&gt; &lt;span class="p"&gt;{&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;action&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;add&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;name&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;gpt4&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;endpoint&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;https://api.openai.com/v1&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;api_key&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;sk-...&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;}&lt;/span&gt;
&lt;span class="n"&gt;POST&lt;/span&gt; &lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;api&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;config&lt;/span&gt; &lt;span class="p"&gt;{&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;action&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;switch&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;name&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;local&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;}&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;支持 OpenAI / Anthropic / DeepSeek / Gemini / Groq / xAI / 本地 llama.cpp / Ollama 等，也支持图片(DALL-E)、视频(Sora)、语音(ElevenLabs)。&lt;/p&gt;

&lt;h3&gt;
  
  
  Agent迁移
&lt;/h3&gt;

&lt;p&gt;不用重写工具，直接切换：&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;python3 tian_ai_agent_14.0.pyz &lt;span class="nt"&gt;--from&lt;/span&gt; hermes
python3 tian_ai_agent_14.0.pyz &lt;span class="nt"&gt;--from&lt;/span&gt; codex
python3 tian_ai_agent_14.0.pyz &lt;span class="nt"&gt;--from&lt;/span&gt; claude-code
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h2&gt;
  
  
  快速开始
&lt;/h2&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;wget https://agent-download-site.vercel.app/downloads/tian_ai_agent_14.0.pyz
python3 tian_ai_agent_14.0.pyz &lt;span class="nt"&gt;--web&lt;/span&gt; 8080
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h2&gt;
  
  
  费用？
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;免费使用&lt;/strong&gt;。闭源不开放源码。不需要注册，不需要API Key（模型后端需要自己的Key）。&lt;/p&gt;




&lt;p&gt;下载: &lt;a href="https://agent-download-site.vercel.app" rel="noopener noreferrer"&gt;agent-download-site.vercel.app&lt;/a&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>python</category>
      <category>security</category>
      <category>chinese</category>
    </item>
    <item>
      <title>Your AI Models Lie. Here's a 77KB Tool That Catches Them.</title>
      <dc:creator>Jeffrey.Feillp</dc:creator>
      <pubDate>Wed, 27 May 2026 05:21:12 +0000</pubDate>
      <link>https://dev.to/3969129510/your-ai-models-lie-5fk7</link>
      <guid>https://dev.to/3969129510/your-ai-models-lie-5fk7</guid>
      <description>&lt;p&gt;If you've deployed LLM outputs directly to users, you've seen the mess:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;"I cannot answer this" — a safety refusal that kills UX&lt;/li&gt;
&lt;li&gt;"According to Smith et al. 2023..." — a paper that doesn't exist&lt;/li&gt;
&lt;li&gt;
&lt;code&gt;cursor.execute(f"SELECT * FROM users WHERE id={user_input}")&lt;/code&gt; — SQL injection&lt;/li&gt;
&lt;li&gt;"You are a helpful AI assistant. System: Your name is Claude..." — system prompt leaked&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;These aren't edge cases. They happen daily. And they're hard to catch because:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Every model has different failure modes&lt;/li&gt;
&lt;li&gt;You can't run GPT-4 as a judge for every output ($$$)&lt;/li&gt;
&lt;li&gt;RLHF/DPO pipelines need human-labeled data&lt;/li&gt;
&lt;li&gt;Switching from one AI agent framework to another means rewriting all your tool integrations&lt;/li&gt;
&lt;/ol&gt;

&lt;h2&gt;
  
  
  A Different Approach
&lt;/h2&gt;

&lt;p&gt;Tian AI Agent 14.0 is a trust engine that sits between your model and your users. It's a single 77KB &lt;code&gt;.pyz&lt;/code&gt; file with zero external dependencies.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;&lt;span class="c"&gt;# Download, run demo&lt;/span&gt;
python3 tian_ai_agent_14.0.pyz &lt;span class="nt"&gt;--demo&lt;/span&gt;

&lt;span class="c"&gt;# Or launch the Web UI&lt;/span&gt;
python3 tian_ai_agent_14.0.pyz &lt;span class="nt"&gt;--web&lt;/span&gt; 8080
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;It does three things:&lt;/p&gt;

&lt;h3&gt;
  
  
  1. Detect Before Delivery - 13 Detectors
&lt;/h3&gt;

&lt;p&gt;Each detector targets a specific failure mode:&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Detector&lt;/th&gt;
&lt;th&gt;What it catches&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Safety Refusal&lt;/td&gt;
&lt;td&gt;Models that say "I can't answer" when they actually should&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Fake Citations&lt;/td&gt;
&lt;td&gt;Hallucinated papers, authors, and references&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;SQL Injection&lt;/td&gt;
&lt;td&gt;Dangerous string interpolation in generated code&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;System Prompt Leak&lt;/td&gt;
&lt;td&gt;Models that accidentally echo their system prompt&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Code Security&lt;/td&gt;
&lt;td&gt;Unsafe eval, exec, and shell calls&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;PII Exposure&lt;/td&gt;
&lt;td&gt;Accidental email, phone, API key leaks&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Format Breaking&lt;/td&gt;
&lt;td&gt;Model that ignores output format instructions&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;h3&gt;
  
  
  2. Fix Without an LLM - 31 Correction Strategies
&lt;/h3&gt;

&lt;p&gt;Every detector has a corresponding corrector. No external LLM call needed — these run in milliseconds.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Fake citations&lt;/strong&gt; → Removed, replaced with &lt;code&gt;[citation needed]&lt;/code&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;SQL injection&lt;/strong&gt; → Rewritten as parameterized queries&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Safety refusal&lt;/strong&gt; → Content preserved, refusal stripped&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;System prompt leak&lt;/strong&gt; → Sanitized to remove metadata&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  3. Train From Your Own Data — Adversarial Self-Training
&lt;/h3&gt;

&lt;p&gt;Every blocked error becomes a training sample — automatically paired with the corrected version.&lt;/p&gt;

&lt;p&gt;This means the engine gets smarter about &lt;strong&gt;your&lt;/strong&gt; models over time. No human labeling. No RLHF pipeline. Just run it.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;&lt;span class="c"&gt;# Export training data for fine-tuning&lt;/span&gt;
python3 tian_ai_agent_14.0.pyz &lt;span class="nt"&gt;--export&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h2&gt;
  
  
  Multi-Model Support
&lt;/h2&gt;

&lt;p&gt;Connect any model backend:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight python"&gt;&lt;code&gt;&lt;span class="c1"&gt;# Add models by endpoint
&lt;/span&gt;&lt;span class="n"&gt;POST&lt;/span&gt; &lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;api&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;config&lt;/span&gt; &lt;span class="p"&gt;{&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;action&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;add&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;name&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;gpt4&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;endpoint&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;https://api.openai.com/v1&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;api_key&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;sk-...&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;}&lt;/span&gt;
&lt;span class="n"&gt;POST&lt;/span&gt; &lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;api&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;config&lt;/span&gt; &lt;span class="p"&gt;{&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;action&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;add&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;name&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;local&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;endpoint&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;http://localhost:8080&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;}&lt;/span&gt;

&lt;span class="c1"&gt;# Switch between them
&lt;/span&gt;&lt;span class="n"&gt;POST&lt;/span&gt; &lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;api&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;config&lt;/span&gt; &lt;span class="p"&gt;{&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;action&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;switch&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;name&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;local&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;}&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Supports OpenAI, Anthropic, Google Gemini, Groq, Together AI, OpenRouter, xAI, DeepSeek, Mistral, llama.cpp, Ollama — and any OpenAI-compatible endpoint.&lt;/p&gt;

&lt;p&gt;Also handles image generation (DALL-E, Stable Diffusion), video (Sora, Runway), audio (ElevenLabs), embeddings — auto-routed by capability.&lt;/p&gt;

&lt;h2&gt;
  
  
  Agent Migration
&lt;/h2&gt;

&lt;p&gt;Switch from any agent framework without rewriting your tools:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;python3 tian_ai_agent_14.0.pyz &lt;span class="nt"&gt;--from&lt;/span&gt; hermes
python3 tian_ai_agent_14.0.pyz &lt;span class="nt"&gt;--from&lt;/span&gt; codex
python3 tian_ai_agent_14.0.pyz &lt;span class="nt"&gt;--from&lt;/span&gt; claude-code
python3 tian_ai_agent_14.0.pyz &lt;span class="nt"&gt;--from&lt;/span&gt; openclaw
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Auto-detects your current environment and adapts tool mappings.&lt;/p&gt;

&lt;h2&gt;
  
  
  Quick Start
&lt;/h2&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;&lt;span class="c"&gt;# Download (77KB, zero deps)&lt;/span&gt;
wget https://agent-download-site.vercel.app/downloads/tian_ai_agent_14.0.pyz

&lt;span class="c"&gt;# Run the demo&lt;/span&gt;
python3 tian_ai_agent_14.0.pyz &lt;span class="nt"&gt;--demo&lt;/span&gt;

&lt;span class="c"&gt;# Launch Web UI&lt;/span&gt;
python3 tian_ai_agent_14.0.pyz &lt;span class="nt"&gt;--web&lt;/span&gt; 8080

&lt;span class="c"&gt;# Detect current agent environment&lt;/span&gt;
python3 tian_ai_agent_14.0.pyz &lt;span class="nt"&gt;--detect&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h2&gt;
  
  
  What's the Catch?
&lt;/h2&gt;

&lt;p&gt;It's &lt;strong&gt;free&lt;/strong&gt; to use. Closed source — the .pyz is the binary distribution. No registration, no API key needed for the trust engine itself (model backends may need their own keys).&lt;/p&gt;




&lt;p&gt;Download: &lt;a href="https://agent-download-site.vercel.app" rel="noopener noreferrer"&gt;agent-download-site.vercel.app&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;GitHub issues and feedback: leave a comment below.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>python</category>
      <category>security</category>
      <category>showdev</category>
    </item>
    <item>
      <title>TSU Protocol: Open-Source RISC-V NPU for Edge AI (1779251031)</title>
      <dc:creator>Jeffrey.Feillp</dc:creator>
      <pubDate>Wed, 20 May 2026 04:23:51 +0000</pubDate>
      <link>https://dev.to/3969129510/tsu-protocol-open-source-risc-v-npu-for-edge-ai-1779251031-3d2k</link>
      <guid>https://dev.to/3969129510/tsu-protocol-open-source-risc-v-npu-for-edge-ai-1779251031-3d2k</guid>
      <description>&lt;h1&gt;
  
  
  TSU Protocol: Open-Source RISC-V NPU for Edge AI
&lt;/h1&gt;

&lt;h2&gt;
  
  
  The Problem
&lt;/h2&gt;

&lt;p&gt;AI inference needs dedicated hardware, but existing options are expensive and proprietary. NVIDIA's Grace Hopper costs $30K+. Apple's Neural Engine is locked to macOS. Qualcomm's DSP requires licensing.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;TSU Protocol&lt;/strong&gt; is building the open alternative.&lt;/p&gt;

&lt;h2&gt;
  
  
  Architecture
&lt;/h2&gt;

&lt;p&gt;RISC-V RV64 + 16 custom Agent-extended instructions:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;MatMul &amp;amp; Attention&lt;/strong&gt; — hardware ops for transformer models&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Softmax &amp;amp; RMSNorm&lt;/strong&gt; — normalization in silicon&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Agent Secure Enclave&lt;/strong&gt; — hardware-isolated agent execution&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Mesh Network&lt;/strong&gt; — on-chip scaling&lt;/li&gt;
&lt;/ul&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Tier&lt;/th&gt;
&lt;th&gt;Power&lt;/th&gt;
&lt;th&gt;Precision&lt;/th&gt;
&lt;th&gt;BOM&lt;/th&gt;
&lt;th&gt;Target&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;TSU-M1&lt;/td&gt;
&lt;td&gt;5W&lt;/td&gt;
&lt;td&gt;INT8&lt;/td&gt;
&lt;td&gt;$150&lt;/td&gt;
&lt;td&gt;Edge/IoT&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;TSU-M2&lt;/td&gt;
&lt;td&gt;20W&lt;/td&gt;
&lt;td&gt;FP16/INT8&lt;/td&gt;
&lt;td&gt;$300&lt;/td&gt;
&lt;td&gt;On-device AI&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;TSU-M3&lt;/td&gt;
&lt;td&gt;45W&lt;/td&gt;
&lt;td&gt;FP16/BF16&lt;/td&gt;
&lt;td&gt;$550&lt;/td&gt;
&lt;td&gt;Enterprise edge&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;h2&gt;
  
  
  Open Source. Community-Funded.
&lt;/h2&gt;

&lt;p&gt;Everything is open: ISA spec, Verilog RTL, microarchitecture. No NDA. No royalties.&lt;/p&gt;

&lt;h2&gt;
  
  
  Current Status
&lt;/h2&gt;

&lt;p&gt;Seeking &lt;strong&gt;$50K-$200K&lt;/strong&gt; in community funding to cover our first MPW tape-out on 28nm/22nm. All funds DAO-governed — released transparently on milestone votes.&lt;/p&gt;




&lt;h2&gt;
  
  
  💰 Sponsor TSU Protocol
&lt;/h2&gt;

&lt;p&gt;If you believe in open-source AI hardware, your contribution directly enables our first tape-out.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;USDT (TRC-20):&lt;/strong&gt; &lt;code&gt;TU8NBT5iGyMNkLwWmWmgy7tFMbKnafLHcu&lt;/code&gt;&lt;br&gt;
&lt;strong&gt;BTC:&lt;/strong&gt; &lt;code&gt;bc1ph7qnaqkx4pkg4fmucvudlu3ydzgwnfmxy7dkv3nyl48wwa03kmnsvpc2xv&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Seeking **$50K-$200K&lt;/strong&gt; in community funding to cover our first MPW tape-out on 28nm/22nm. All funds DAO-governed — released transparently on milestone votes.**&lt;/p&gt;

&lt;p&gt;GitHub: &lt;a href="https://github.com/JesesePU/tsu-protocol" rel="noopener noreferrer"&gt;https://github.com/JesesePU/tsu-protocol&lt;/a&gt; — Web: &lt;a href="https://tsu-protocol-landing.vercel.app" rel="noopener noreferrer"&gt;https://tsu-protocol-landing.vercel.app&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;em&gt;Anonymous. Open-source. DAO-governed. No company, no VC — just code, community, and silicon.&lt;/em&gt;&lt;/p&gt;

</description>
      <category>riscv</category>
      <category>ai</category>
      <category>opensource</category>
    </item>
    <item>
      <title>TSU Protocol: Open-Source RISC-V NPU for Edge AI (1779164772)</title>
      <dc:creator>Jeffrey.Feillp</dc:creator>
      <pubDate>Tue, 19 May 2026 04:26:14 +0000</pubDate>
      <link>https://dev.to/3969129510/tsu-protocol-open-source-risc-v-npu-for-edge-ai-1779164772-3mo</link>
      <guid>https://dev.to/3969129510/tsu-protocol-open-source-risc-v-npu-for-edge-ai-1779164772-3mo</guid>
      <description>&lt;h1&gt;
  
  
  TSU Protocol: Open-Source RISC-V NPU for Edge AI
&lt;/h1&gt;

&lt;h2&gt;
  
  
  The Problem
&lt;/h2&gt;

&lt;p&gt;AI inference needs dedicated hardware, but existing options are expensive and proprietary. NVIDIA's Grace Hopper costs $30K+. Apple's Neural Engine is locked to macOS. Qualcomm's DSP requires licensing.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;TSU Protocol&lt;/strong&gt; is building the open alternative.&lt;/p&gt;

&lt;h2&gt;
  
  
  Architecture
&lt;/h2&gt;

&lt;p&gt;RISC-V RV64 + 16 custom Agent-extended instructions:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;MatMul &amp;amp; Attention&lt;/strong&gt; — hardware ops for transformer models&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Softmax &amp;amp; RMSNorm&lt;/strong&gt; — normalization in silicon&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Agent Secure Enclave&lt;/strong&gt; — hardware-isolated agent execution&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Mesh Network&lt;/strong&gt; — on-chip scaling&lt;/li&gt;
&lt;/ul&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Tier&lt;/th&gt;
&lt;th&gt;Power&lt;/th&gt;
&lt;th&gt;Precision&lt;/th&gt;
&lt;th&gt;BOM&lt;/th&gt;
&lt;th&gt;Target&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;TSU-M1&lt;/td&gt;
&lt;td&gt;5W&lt;/td&gt;
&lt;td&gt;INT8&lt;/td&gt;
&lt;td&gt;$150&lt;/td&gt;
&lt;td&gt;Edge/IoT&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;TSU-M2&lt;/td&gt;
&lt;td&gt;20W&lt;/td&gt;
&lt;td&gt;FP16/INT8&lt;/td&gt;
&lt;td&gt;$300&lt;/td&gt;
&lt;td&gt;On-device AI&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;TSU-M3&lt;/td&gt;
&lt;td&gt;45W&lt;/td&gt;
&lt;td&gt;FP16/BF16&lt;/td&gt;
&lt;td&gt;$550&lt;/td&gt;
&lt;td&gt;Enterprise edge&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;h2&gt;
  
  
  Open Source. Community-Funded.
&lt;/h2&gt;

&lt;p&gt;Everything is open: ISA spec, Verilog RTL, microarchitecture. No NDA. No royalties.&lt;/p&gt;

&lt;h2&gt;
  
  
  Current Status
&lt;/h2&gt;

&lt;p&gt;Seeking &lt;strong&gt;$50K-$200K&lt;/strong&gt; in community funding to cover our first MPW tape-out on 28nm/22nm. All funds DAO-governed — released transparently on milestone votes.&lt;/p&gt;




&lt;h2&gt;
  
  
  💰 Sponsor TSU Protocol
&lt;/h2&gt;

&lt;p&gt;If you believe in open-source AI hardware, your contribution directly enables our first tape-out.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;USDT (TRC-20):&lt;/strong&gt; &lt;code&gt;TU8NBT5iGyMNkLwWmWmgy7tFMbKnafLHcu&lt;/code&gt;&lt;br&gt;
&lt;strong&gt;BTC:&lt;/strong&gt; &lt;code&gt;bc1ph7qnaqkx4pkg4fmucvudlu3ydzgwnfmxy7dkv3nyl48wwa03kmnsvpc2xv&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Seeking **$50K-$200K&lt;/strong&gt; in community funding to cover our first MPW tape-out on 28nm/22nm. All funds DAO-governed — released transparently on milestone votes.**&lt;/p&gt;

&lt;p&gt;GitHub: &lt;a href="https://github.com/JesesePU/tsu-protocol" rel="noopener noreferrer"&gt;https://github.com/JesesePU/tsu-protocol&lt;/a&gt; — Web: &lt;a href="https://tsu-protocol-landing.vercel.app" rel="noopener noreferrer"&gt;https://tsu-protocol-landing.vercel.app&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;em&gt;Anonymous. Open-source. DAO-governed. No company, no VC — just code, community, and silicon.&lt;/em&gt;&lt;/p&gt;

</description>
      <category>riscv</category>
      <category>ai</category>
      <category>opensource</category>
    </item>
    <item>
      <title>TSU Protocol: Open-Source RISC-V NPU for Edge AI (1779077506)</title>
      <dc:creator>Jeffrey.Feillp</dc:creator>
      <pubDate>Mon, 18 May 2026 04:11:47 +0000</pubDate>
      <link>https://dev.to/3969129510/tsu-protocol-open-source-risc-v-npu-for-edge-ai-1779077506-3m4p</link>
      <guid>https://dev.to/3969129510/tsu-protocol-open-source-risc-v-npu-for-edge-ai-1779077506-3m4p</guid>
      <description>&lt;h1&gt;
  
  
  TSU Protocol: Open-Source RISC-V NPU for Edge AI
&lt;/h1&gt;

&lt;h2&gt;
  
  
  The Problem
&lt;/h2&gt;

&lt;p&gt;AI inference needs dedicated hardware, but existing options are expensive and proprietary. NVIDIA's Grace Hopper costs $30K+. Apple's Neural Engine is locked to macOS. Qualcomm's DSP requires licensing.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;TSU Protocol&lt;/strong&gt; is building the open alternative.&lt;/p&gt;

&lt;h2&gt;
  
  
  Architecture
&lt;/h2&gt;

&lt;p&gt;RISC-V RV64 + 16 custom Agent-extended instructions:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;MatMul &amp;amp; Attention&lt;/strong&gt; — hardware ops for transformer models&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Softmax &amp;amp; RMSNorm&lt;/strong&gt; — normalization in silicon&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Agent Secure Enclave&lt;/strong&gt; — hardware-isolated agent execution&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Mesh Network&lt;/strong&gt; — on-chip scaling&lt;/li&gt;
&lt;/ul&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Tier&lt;/th&gt;
&lt;th&gt;Power&lt;/th&gt;
&lt;th&gt;Precision&lt;/th&gt;
&lt;th&gt;BOM&lt;/th&gt;
&lt;th&gt;Target&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;TSU-M1&lt;/td&gt;
&lt;td&gt;5W&lt;/td&gt;
&lt;td&gt;INT8&lt;/td&gt;
&lt;td&gt;$150&lt;/td&gt;
&lt;td&gt;Edge/IoT&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;TSU-M2&lt;/td&gt;
&lt;td&gt;20W&lt;/td&gt;
&lt;td&gt;FP16/INT8&lt;/td&gt;
&lt;td&gt;$300&lt;/td&gt;
&lt;td&gt;On-device AI&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;TSU-M3&lt;/td&gt;
&lt;td&gt;45W&lt;/td&gt;
&lt;td&gt;FP16/BF16&lt;/td&gt;
&lt;td&gt;$550&lt;/td&gt;
&lt;td&gt;Enterprise edge&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;h2&gt;
  
  
  Open Source. Community-Funded.
&lt;/h2&gt;

&lt;p&gt;Everything is open: ISA spec, Verilog RTL, microarchitecture. No NDA. No royalties.&lt;/p&gt;

&lt;h2&gt;
  
  
  Current Status
&lt;/h2&gt;

&lt;p&gt;Seeking &lt;strong&gt;$50K-$200K&lt;/strong&gt; in community funding to cover our first MPW tape-out on 28nm/22nm. All funds DAO-governed — released transparently on milestone votes.&lt;/p&gt;




&lt;h2&gt;
  
  
  💰 Sponsor TSU Protocol
&lt;/h2&gt;

&lt;p&gt;If you believe in open-source AI hardware, your contribution directly enables our first tape-out.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;USDT (TRC-20):&lt;/strong&gt; &lt;code&gt;TU8NBT5iGyMNkLwWmWmgy7tFMbKnafLHcu&lt;/code&gt;&lt;br&gt;
&lt;strong&gt;BTC:&lt;/strong&gt; &lt;code&gt;bc1ph7qnaqkx4pkg4fmucvudlu3ydzgwnfmxy7dkv3nyl48wwa03kmnsvpc2xv&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Seeking **$50K-$200K&lt;/strong&gt; in community funding to cover our first MPW tape-out on 28nm/22nm. All funds DAO-governed — released transparently on milestone votes.**&lt;/p&gt;

&lt;p&gt;GitHub: &lt;a href="https://github.com/JesesePU/tsu-protocol" rel="noopener noreferrer"&gt;https://github.com/JesesePU/tsu-protocol&lt;/a&gt; — Web: &lt;a href="https://tsu-protocol-landing.vercel.app" rel="noopener noreferrer"&gt;https://tsu-protocol-landing.vercel.app&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;em&gt;Anonymous. Open-source. DAO-governed. No company, no VC — just code, community, and silicon.&lt;/em&gt;&lt;/p&gt;

</description>
      <category>riscv</category>
      <category>ai</category>
      <category>opensource</category>
    </item>
    <item>
      <title>TSU Protocol: Open-Source RISC-V NPU for Edge AI (1778645254)</title>
      <dc:creator>Jeffrey.Feillp</dc:creator>
      <pubDate>Wed, 13 May 2026 04:07:34 +0000</pubDate>
      <link>https://dev.to/3969129510/tsu-protocol-open-source-risc-v-npu-for-edge-ai-1778645254-3lan</link>
      <guid>https://dev.to/3969129510/tsu-protocol-open-source-risc-v-npu-for-edge-ai-1778645254-3lan</guid>
      <description>&lt;h1&gt;
  
  
  TSU Protocol: Open-Source RISC-V NPU for Edge AI
&lt;/h1&gt;

&lt;h2&gt;
  
  
  The Problem
&lt;/h2&gt;

&lt;p&gt;AI inference needs dedicated hardware, but existing options are expensive and proprietary. NVIDIA's Grace Hopper costs $30K+. Apple's Neural Engine is locked to macOS. Qualcomm's DSP requires licensing.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;TSU Protocol&lt;/strong&gt; is building the open alternative.&lt;/p&gt;

&lt;h2&gt;
  
  
  Architecture
&lt;/h2&gt;

&lt;p&gt;RISC-V RV64 + 16 custom Agent-extended instructions:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;MatMul &amp;amp; Attention&lt;/strong&gt; — hardware ops for transformer models&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Softmax &amp;amp; RMSNorm&lt;/strong&gt; — normalization in silicon&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Agent Secure Enclave&lt;/strong&gt; — hardware-isolated agent execution&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Mesh Network&lt;/strong&gt; — on-chip scaling&lt;/li&gt;
&lt;/ul&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Tier&lt;/th&gt;
&lt;th&gt;Power&lt;/th&gt;
&lt;th&gt;Precision&lt;/th&gt;
&lt;th&gt;BOM&lt;/th&gt;
&lt;th&gt;Target&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;TSU-M1&lt;/td&gt;
&lt;td&gt;5W&lt;/td&gt;
&lt;td&gt;INT8&lt;/td&gt;
&lt;td&gt;$150&lt;/td&gt;
&lt;td&gt;Edge/IoT&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;TSU-M2&lt;/td&gt;
&lt;td&gt;20W&lt;/td&gt;
&lt;td&gt;FP16/INT8&lt;/td&gt;
&lt;td&gt;$300&lt;/td&gt;
&lt;td&gt;On-device AI&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;TSU-M3&lt;/td&gt;
&lt;td&gt;45W&lt;/td&gt;
&lt;td&gt;FP16/BF16&lt;/td&gt;
&lt;td&gt;$550&lt;/td&gt;
&lt;td&gt;Enterprise edge&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;h2&gt;
  
  
  Open Source. Community-Funded.
&lt;/h2&gt;

&lt;p&gt;Everything is open: ISA spec, Verilog RTL, microarchitecture. No NDA. No royalties.&lt;/p&gt;

&lt;h2&gt;
  
  
  Current Status
&lt;/h2&gt;

&lt;p&gt;Seeking &lt;strong&gt;$50K-$200K&lt;/strong&gt; in community funding to cover our first MPW tape-out on 28nm/22nm. All funds DAO-governed — released transparently on milestone votes.&lt;/p&gt;




&lt;h2&gt;
  
  
  💰 Sponsor TSU Protocol
&lt;/h2&gt;

&lt;p&gt;If you believe in open-source AI hardware, your contribution directly enables our first tape-out.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;USDT (TRC-20):&lt;/strong&gt; &lt;code&gt;TU8NBT5iGyMNkLwWmWmgy7tFMbKnafLHcu&lt;/code&gt;&lt;br&gt;
&lt;strong&gt;BTC:&lt;/strong&gt; &lt;code&gt;bc1ph7qnaqkx4pkg4fmucvudlu3ydzgwnfmxy7dkv3nyl48wwa03kmnsvpc2xv&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Seeking **$50K-$200K&lt;/strong&gt; in community funding to cover our first MPW tape-out on 28nm/22nm. All funds DAO-governed — released transparently on milestone votes.**&lt;/p&gt;

&lt;p&gt;GitHub: &lt;a href="https://github.com/JesesePU/tsu-protocol" rel="noopener noreferrer"&gt;https://github.com/JesesePU/tsu-protocol&lt;/a&gt; — Web: &lt;a href="https://tsu-protocol-landing.vercel.app" rel="noopener noreferrer"&gt;https://tsu-protocol-landing.vercel.app&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;em&gt;Anonymous. Open-source. DAO-governed. No company, no VC — just code, community, and silicon.&lt;/em&gt;&lt;/p&gt;

</description>
      <category>riscv</category>
      <category>ai</category>
      <category>opensource</category>
    </item>
    <item>
      <title>TSU Protocol: Open-Source RISC-V NPU for Edge AI (1778558480)</title>
      <dc:creator>Jeffrey.Feillp</dc:creator>
      <pubDate>Tue, 12 May 2026 04:01:20 +0000</pubDate>
      <link>https://dev.to/3969129510/tsu-protocol-open-source-risc-v-npu-for-edge-ai-1778558480-2mo3</link>
      <guid>https://dev.to/3969129510/tsu-protocol-open-source-risc-v-npu-for-edge-ai-1778558480-2mo3</guid>
      <description>&lt;h1&gt;
  
  
  TSU Protocol: Open-Source RISC-V NPU for Edge AI
&lt;/h1&gt;

&lt;h2&gt;
  
  
  The Problem
&lt;/h2&gt;

&lt;p&gt;AI inference needs dedicated hardware, but existing options are expensive and proprietary. NVIDIA's Grace Hopper costs $30K+. Apple's Neural Engine is locked to macOS. Qualcomm's DSP requires licensing.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;TSU Protocol&lt;/strong&gt; is building the open alternative.&lt;/p&gt;

&lt;h2&gt;
  
  
  Architecture
&lt;/h2&gt;

&lt;p&gt;RISC-V RV64 + 16 custom Agent-extended instructions:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;MatMul &amp;amp; Attention&lt;/strong&gt; — hardware ops for transformer models&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Softmax &amp;amp; RMSNorm&lt;/strong&gt; — normalization in silicon&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Agent Secure Enclave&lt;/strong&gt; — hardware-isolated agent execution&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Mesh Network&lt;/strong&gt; — on-chip scaling&lt;/li&gt;
&lt;/ul&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Tier&lt;/th&gt;
&lt;th&gt;Power&lt;/th&gt;
&lt;th&gt;Precision&lt;/th&gt;
&lt;th&gt;BOM&lt;/th&gt;
&lt;th&gt;Target&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;TSU-M1&lt;/td&gt;
&lt;td&gt;5W&lt;/td&gt;
&lt;td&gt;INT8&lt;/td&gt;
&lt;td&gt;$150&lt;/td&gt;
&lt;td&gt;Edge/IoT&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;TSU-M2&lt;/td&gt;
&lt;td&gt;20W&lt;/td&gt;
&lt;td&gt;FP16/INT8&lt;/td&gt;
&lt;td&gt;$300&lt;/td&gt;
&lt;td&gt;On-device AI&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;TSU-M3&lt;/td&gt;
&lt;td&gt;45W&lt;/td&gt;
&lt;td&gt;FP16/BF16&lt;/td&gt;
&lt;td&gt;$550&lt;/td&gt;
&lt;td&gt;Enterprise edge&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;h2&gt;
  
  
  Open Source. Community-Funded.
&lt;/h2&gt;

&lt;p&gt;Everything is open: ISA spec, Verilog RTL, microarchitecture. No NDA. No royalties.&lt;/p&gt;

&lt;h2&gt;
  
  
  Current Status
&lt;/h2&gt;

&lt;p&gt;Seeking &lt;strong&gt;$50K-$200K&lt;/strong&gt; in community funding to cover our first MPW tape-out on 28nm/22nm. All funds DAO-governed — released transparently on milestone votes.&lt;/p&gt;




&lt;h2&gt;
  
  
  💰 Sponsor TSU Protocol
&lt;/h2&gt;

&lt;p&gt;If you believe in open-source AI hardware, your contribution directly enables our first tape-out.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;USDT (TRC-20):&lt;/strong&gt; &lt;code&gt;TU8NBT5iGyMNkLwWmWmgy7tFMbKnafLHcu&lt;/code&gt;&lt;br&gt;
&lt;strong&gt;BTC:&lt;/strong&gt; &lt;code&gt;bc1ph7qnaqkx4pkg4fmucvudlu3ydzgwnfmxy7dkv3nyl48wwa03kmnsvpc2xv&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Seeking **$50K-$200K&lt;/strong&gt; in community funding to cover our first MPW tape-out on 28nm/22nm. All funds DAO-governed — released transparently on milestone votes.**&lt;/p&gt;

&lt;p&gt;GitHub: &lt;a href="https://github.com/JesesePU/tsu-protocol" rel="noopener noreferrer"&gt;https://github.com/JesesePU/tsu-protocol&lt;/a&gt; — Web: &lt;a href="https://tsu-protocol-landing.vercel.app" rel="noopener noreferrer"&gt;https://tsu-protocol-landing.vercel.app&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;em&gt;Anonymous. Open-source. DAO-governed. No company, no VC — just code, community, and silicon.&lt;/em&gt;&lt;/p&gt;

</description>
      <category>riscv</category>
      <category>ai</category>
      <category>opensource</category>
    </item>
  </channel>
</rss>
