Most "AI content" tools send everything you write to a server. I didn't want that — so I built 24U to run the model entirely in the browser.
The idea
Paste one article, note, or voice memo → get 11 platform-native posts (LinkedIn, Instagram, TikTok, X, Reddit, YouTube, Threads, Pinterest, Telegram, WhatsApp, Facebook). Each written for that platform's style, not copy-paste.
The twist: nothing you write ever leaves your device.
How it runs locally
- WebLLM + WebGPU load a quantized LLM into the browser. After a one-time ~2GB download it runs offline.
- No backend inference, no API keys, no per-token cost — the "server" is the user's own GPU.
-
COOP/COEPheaders enableSharedArrayBuffer(required for the WASM/WebGPU runtime). - ONNX/native deps are kept out of the server bundle so the Next.js function stays tiny.
Tradeoffs (honest)
- Needs a WebGPU browser — Chrome/Edge today, Safari/Firefox are WIP.
- That one-time 2GB download is the price of full privacy + offline use.
- A small in-browser model isn't GPT-4 — but for "rewrite this for platform X" it's more than enough, and it never sees a server.
Why one-time pricing
No server compute to pay for → no subscription. You own it once.
Building 24U solo and in public — privacy-first AI that runs 100% in your browser (competitive analysis, SWOT & content from your data, nothing uploaded). I'm all-in and running lean. If it resonates, support means the world — backers get a free lifetime license 🙏 ko-fi.com/lukaszdev
Curious what the dev crowd thinks of in-browser inference as a privacy pattern — would you trust "runs locally" over "we promise we don't read your data"?
Try it: https://24-u.vercel.app/landing
Top comments (0)