DEV Community

Cover image for I built a content tool where the AI runs 100% in the browser — no servers, no API keys
Lukasz
Lukasz

Posted on

I built a content tool where the AI runs 100% in the browser — no servers, no API keys

Most "AI content" tools send everything you write to a server. I didn't want that — so I built 24U to run the model entirely in the browser.

The idea

Paste one article, note, or voice memo → get 11 platform-native posts (LinkedIn, Instagram, TikTok, X, Reddit, YouTube, Threads, Pinterest, Telegram, WhatsApp, Facebook). Each written for that platform's style, not copy-paste.

The twist: nothing you write ever leaves your device.

How it runs locally

  • WebLLM + WebGPU load a quantized LLM into the browser. After a one-time ~2GB download it runs offline.
  • No backend inference, no API keys, no per-token cost — the "server" is the user's own GPU.
  • COOP/COEP headers enable SharedArrayBuffer (required for the WASM/WebGPU runtime).
  • ONNX/native deps are kept out of the server bundle so the Next.js function stays tiny.

Tradeoffs (honest)

  • Needs a WebGPU browser — Chrome/Edge today, Safari/Firefox are WIP.
  • That one-time 2GB download is the price of full privacy + offline use.
  • A small in-browser model isn't GPT-4 — but for "rewrite this for platform X" it's more than enough, and it never sees a server.

Why one-time pricing

No server compute to pay for → no subscription. You own it once.

Building 24U solo and in public — privacy-first AI that runs 100% in your browser (competitive analysis, SWOT & content from your data, nothing uploaded). I'm all-in and running lean. If it resonates, support means the world — backers get a free lifetime license 🙏 ko-fi.com/lukaszdev

Curious what the dev crowd thinks of in-browser inference as a privacy pattern — would you trust "runs locally" over "we promise we don't read your data"?

Try it: https://24-u.vercel.app/landing

Top comments (0)