DEV Community

Cover image for I built a free offline AI assistant where three models debate each other before giving you an answer — no subscriptions, no data leaving your PC
Malx
Malx

Posted on

I built a free offline AI assistant where three models debate each other before giving you an answer — no subscriptions, no data leaving your PC

I got tired of every AI tool requiring a subscription and storing conversations on someone else's server, so I built Axiom — a Windows desktop app that runs entirely on your machine.

The main feature is the council pipeline. Instead of one model answering your question, three independent models work sequentially. The Architect breaks down the problem into a numbered plan. The Builder implements it. The Critic reviews the output and flags issues before you ever see it. If the Critic finds problems, the Builder revises. You get a confidence label on every response High Confidence, Moderate Confidence, or Flagged for Review.

It also has a normal chat mode with a Python sandbox that actually executes code and verifies math results rather than guessing, web search that extracts real page content not just snippets, LaTeX rendering for equations, and optional cloud mode via OpenRouter if you want access to larger models like GPT-OSS-120B using your own free API key.

The default model is Volta, based on Qwen3-4B Q4_K_M — about 2.5GB, runs on any modern PC without a GPU.

Everything is free. MIT licensed. No telemetry. Nothing leaves your machine in local mode.

Download and source on GitHub: [https://github.com/YoMosa2009/Axiom]

Happy to answer questions about how the pipeline works.

Top comments (0)