<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Soros</title>
    <description>The latest articles on DEV Community by Soros (@soros_02159c95a2582796088).</description>
    <link>https://dev.to/soros_02159c95a2582796088</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3943332%2F5dea517f-59f9-4bc7-ba65-9c704923d82d.png</url>
      <title>DEV Community: Soros</title>
      <link>https://dev.to/soros_02159c95a2582796088</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/soros_02159c95a2582796088"/>
    <language>en</language>
    <item>
      <title>ICECODE — Self-hosted AI Agent Platform: Multi-Agent Swarm, Local RAG, 26-Page Web UI, 34 Platform Gateways</title>
      <dc:creator>Soros</dc:creator>
      <pubDate>Thu, 21 May 2026 05:36:50 +0000</pubDate>
      <link>https://dev.to/soros_02159c95a2582796088/icecode-self-hosted-ai-agent-platform-multi-agent-swarm-local-rag-26-page-web-ui-34-platform-ig3</link>
      <guid>https://dev.to/soros_02159c95a2582796088/icecode-self-hosted-ai-agent-platform-multi-agent-swarm-local-rag-26-page-web-ui-34-platform-ig3</guid>
      <description>&lt;p&gt;ICECODE is a self-hosted AI agent platform I built by unifying three open-source projects (Hermes, OpenCode, ClawX) into one cohesive system. It runs entirely on your machine — no data leaves your network.&lt;/p&gt;

&lt;h2&gt;
  
  
  What it does
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Multi-Agent Swarm&lt;/strong&gt; — pipeline and parallel orchestration: agents collaborate, pass context, and vote on answers&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Local RAG&lt;/strong&gt; — FAISS + sentence-transformers, 100% offline — index any file, search semantically
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;26-Page Web UI&lt;/strong&gt; — dashboard, chat, kanban, goals, swarm, knowledge, MCP, benchmark, and more&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Cost Optimizer&lt;/strong&gt; — semantic cache (skip duplicate API calls), context compressor, smart model router&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;174 REST API routes&lt;/strong&gt; with auto-docs (FastAPI)&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;34 platform gateways&lt;/strong&gt; — WhatsApp, Telegram, Discord, Slack, Email, WeChat, Feishu, and 27 more&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;14 LLM providers&lt;/strong&gt; — Anthropic, OpenAI, Ollama, Gemini, Mistral, Bedrock, Azure, OpenRouter...&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;90+ agent tools&lt;/strong&gt; — file, web, browser, terminal, vision, kanban, MCP, code execution&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Self-learning skills&lt;/strong&gt; — agents learn new skills at runtime, stored across sessions&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Reinforcement learning&lt;/strong&gt; environment for agent improvement&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;WebSocket chat&lt;/strong&gt; — bidirectional with cancel support&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Token tracking + cost&lt;/strong&gt; — per-session usage across all providers&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Quick start
&lt;/h2&gt;

&lt;p&gt;[1m&lt;br&gt;
╔══════════════════════════════════════════════════════╗&lt;br&gt;
║       ICECODE Super-Agent Network — Installer        ║&lt;br&gt;
║                    v2.0.0                            ║&lt;br&gt;
╚══════════════════════════════════════════════════════╝&lt;br&gt;
[0m&lt;br&gt;
[0;33m→[0m Checking Python version...&lt;br&gt;
[0;32m✓[0m Python 3.12 found&lt;br&gt;
[0;32m✓[0m Virtual environment already exists&lt;br&gt;
[0;33m→[0m Installing Python dependencies...&lt;br&gt;
Starting ICECODE server on &lt;a href="http://localhost:13210" rel="noopener noreferrer"&gt;http://localhost:13210&lt;/a&gt;...&lt;br&gt;
[32m23:36:33[0m | [1mINFO   [0m |   React UI served at /desktop/&lt;br&gt;
INFO:     Started server process [423207]&lt;br&gt;
INFO:     Waiting for application startup.&lt;br&gt;
[32m23:36:33[0m | [1mINFO   [0m | ============================================================&lt;br&gt;
[32m23:36:33[0m | [1mINFO   [0m | ICECODE Super-Agent Network v2.0.0 starting...&lt;br&gt;
[32m23:36:33[0m | [1mINFO   [0m |   Port: 13210&lt;br&gt;
[32m23:36:33[0m | [1mINFO   [0m |   DB:   ~/.icecode/data/icecode.db&lt;br&gt;
[32m23:36:33[0m | [1mINFO   [0m |   Home: /home/claudiu/.icecode&lt;br&gt;
[32m23:36:33[0m | [1mINFO   [0m | ============================================================&lt;br&gt;
[32m23:36:33[0m | [1mINFO   [0m |   [✓] Database initialized at ~/.icecode/data/icecode.db&lt;br&gt;
[32m23:36:33[0m | [1mINFO   [0m |   [✓] Self-learning system ready&lt;br&gt;
[32m23:36:33[0m | [1mINFO   [0m |   [✓] Cron scheduler ready&lt;br&gt;
[32m23:36:33[0m | [1mINFO   [0m |   [✓] Goals system (Ralph Loop) ready&lt;br&gt;
[32m23:36:33[0m | [1mINFO   [0m |   [✓] Knowledge auto-index task started&lt;br&gt;
[32m23:36:33[0m | [1mINFO   [0m | All ICECODE systems online.&lt;br&gt;
INFO:     Application startup complete.&lt;br&gt;
ERROR:    [Errno 98] error while attempting to bind on address ('0.0.0.0', 13210): address already in use&lt;br&gt;
INFO:     Waiting for application shutdown.&lt;br&gt;
[32m23:36:33[0m | [1mINFO   [0m | ICECODE shutting down...&lt;br&gt;
INFO:     Application shutdown complete.&lt;/p&gt;

&lt;h2&gt;
  
  
  Architecture
&lt;/h2&gt;

&lt;h2&gt;
  
  
  Cost Optimizer (new in v2)
&lt;/h2&gt;

&lt;p&gt;The cost optimizer has three components that work together automatically:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Semantic Cache&lt;/strong&gt; — uses sentence-transformer embeddings + cosine similarity (≥0.92 threshold). If you ask a semantically similar question to a previous one, it returns the cached answer instantly — zero API tokens consumed.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Context Compressor&lt;/strong&gt; — when conversation history exceeds 3000 tokens, older messages are summarized instead of sent verbatim. Keeps the last 6 messages intact.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Smart Model Router&lt;/strong&gt; — analyzes prompt complexity (score 1-10) and routes to the cheapest capable model. Simple questions go to cheap models, complex reasoning goes to powerful ones.&lt;/p&gt;&lt;/li&gt;
&lt;/ol&gt;

&lt;h2&gt;
  
  
  Multi-Agent Swarm
&lt;/h2&gt;

&lt;p&gt;Two orchestration modes:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Pipeline&lt;/strong&gt;: agents run sequentially, each building on the previous output.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Parallel&lt;/strong&gt;: all agents receive the same input simultaneously, results are merged.&lt;/p&gt;

&lt;p&gt;Built-in templates: Research &amp;amp; Write, Code Review, Brainstorm, Security Audit.&lt;/p&gt;

&lt;h2&gt;
  
  
  Local RAG
&lt;/h2&gt;

&lt;p&gt;No API keys needed. No data leaves your machine.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Supports , , , , , , , &lt;/li&gt;
&lt;li&gt;Chunk strategy: sliding window (512 tokens, 50 overlap)&lt;/li&gt;
&lt;li&gt;Embeddings:  (90MB, runs locally)&lt;/li&gt;
&lt;li&gt;Vector store: FAISS IndexFlatL2&lt;/li&gt;
&lt;li&gt;Persistence: &lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Index a directory:&lt;br&gt;
{"detail":[{"type":"json_invalid","loc":["body",0],"msg":"JSON decode error","input":{},"ctx":{"error":"Expecting value"}}]}&lt;/p&gt;

&lt;h2&gt;
  
  
  Tech stack
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;Backend&lt;/strong&gt; — Python 3.12 · FastAPI · Uvicorn · SQLite · Pydantic v2 · FAISS · sentence-transformers · Loguru&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Frontend&lt;/strong&gt; — Single HTML file, no build step · Pure JS ES2022 · CSS variables · Server-Sent Events + WebSocket&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;TypeScript&lt;/strong&gt; — pnpm workspaces · turbo · CLI with Ink TUI · 14 LLM provider protocols · MCP client&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Desktop&lt;/strong&gt; — Electron 33&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Infrastructure&lt;/strong&gt; — Docker · GitHub Actions CI · pytest (108 tests) · ruff&lt;/p&gt;

&lt;h2&gt;
  
  
  Links
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;GitHub: &lt;a href="https://github.com/iceslim409/icecode" rel="noopener noreferrer"&gt;https://github.com/iceslim409/icecode&lt;/a&gt;
&lt;/li&gt;
&lt;li&gt;Demo GIF in README shows all major pages&lt;/li&gt;
&lt;li&gt;MIT-style non-commercial license (ICECODE-NC-1.0)&lt;/li&gt;
&lt;/ul&gt;




&lt;p&gt;If you try it, I'd love to hear what you think. Issues and PRs welcome.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>python</category>
      <category>opensource</category>
      <category>selfhosted</category>
    </item>
  </channel>
</rss>
