Your AI, Your Device, Your Data - Introducing Aide

Swapnil — Mon, 25 May 2026 05:57:48 +0000

This is a submission for the Gemma 4 Challenge: Build with Gemma 4

The way we use AI on our phones is changing. People want more from their devices, but the main way we reach all these breakthroughs in AI is still a chatbot window. And we don't just want assistance, we want assistance that actually knows us. The AI should understand who you are, get the help you really need, and step in right where you need it. The catch is that for a long time you had to choose: you could have real personalization, or you could have real privacy, but not both. Anything that knows you that well usually ends up living in someone else's cloud.

Gemma 4 changes that. It's small enough to run on the phone in your pocket, but still capable enough to reason, see, and hold a real conversation, which means an assistant can finally know you without sending your life off to a server. That's the whole idea behind Aide: a personal, on-device assistant powered by Gemma 4, where the intelligence stays with you and you stay in control.

What I Built

Aide is a private-first Android app that puts a frontier-class AI model on the two surfaces you already touch every day: the system keyboard and the assistant button. Instead of asking you to open yet another chatbot, it brings Gemma 4 to where you already type, talk, and tap. Everything runs on-device by default through LiteRT-LM, and the cloud is strictly opt-in. The same loaded model backs all three surfaces, so chat, keyboard, and assistant stay in sync.

The keyboard is a real Android input method with a transform bar sitting above the keys. You can rephrase, simplify, fix grammar, summarize, change tone, pull out key points, or turn a mess of text into bullets or a table, and a magic button runs your own one-shot custom instruction. Every transform is just a row in a local task table, so you can edit the prompt, reorder them, or add your own with a {{text}} template. The best translation model on device is the Gemma 4 which can translate across 37 languages paired with 20+ on-device speech models, so it can listen in one language and write back in another without the data ever leaving the device.

The assistant button opens a turn-based voice loop (speech to text, then VAD, then Gemma 4, then text to speech) that stays local by default. The built-in voice recognition and TTS on phones are fast but flat and robotic, so Aide ships a real choice of voices instead: roughly 23 Piper, Kokoro, MeloTTS, and Matcha voices for output and a stack of Zipformer, Whisper, Moonshine, Parakeet, SenseVoice, and GigaAM models for input. The whole voice toolkit runs on Sherpa-ONNX, which gives Aide one fast, on-device runtime for STT, TTS, and VAD across all of those models, so none of your audio has to leave the phone to be heard or spoken back.

Under all of that, Gemma 4's native function calling drives a single tool dispatcher for alarms, calendar, contacts, phone, clipboard, files, web search, calculator, and time, with per-category permission toggles and a confirmation gate in front of anything destructive. Chat is fully multimodal, reading images as a Gemma 4 turn and exposing the model's reasoning through a tappable thinking chip. Pick a local weight (E2B or E4B) for daily use, or point at an Ollama endpoint when you want the heavier 26B or 31B models, swappable per chat and mid-session. The whole point is trust: an assistant that actually knows you, without shipping your life, your memories, and your conversations off to someone else's server.

And here is where the real power is: all of this works offline. One-time setup downloads the Gemma 4 weight and the voice models you want, and after that everything keeps running with the network off. No account, no sign-in, nothing to phone home to. Few apps this feature-rich stay fully on-device, and that was the bar Aide set for itself. Turn on airplane mode and the assistant is still right there.

Demo

Video walkthrough (Please excuse the poor video quality, I am not much of a video editor)

Code

Code Repository: https://github.com/swaptr/aide
The APK file is available under the Releases section.

How I Used Gemma 4

Gemma 4 is the right fit because of what an on-device personal assistant actually demands: it has to see, reason, and call tools, all while running on a phone instead of a datacenter. Gemma 4 is natively multimodal, reasons well, and carries a 128K context window, and it does all of that at a size that still fits in your pocket. That combination is rare.

For day-to-day use, Aide runs the E2B and E4B weights fully on-device through LiteRT-LM. These small variants are built for mobile and edge, and they are quick enough to back the keyboard transforms, the voice loop, and chat without a network round-trip. The same multimodal model reads an image in chat, drives native function calling for the tool dispatcher, and exposes its reasoning trace through the thinking chip. For most daily interactions, E2B and E4B carry the whole experience locally.

When you want more reasoning power, Aide hands off to an optional Ollama endpoint, swappable per chat and even mid-session. You can point it at a self-hosted Ollama server for the heavier Gemma 4 26B and 31B Dense weights and keep everything inside your own infrastructure, or use Ollama Cloud when you want frontier-grade output. Either way the choice is yours and the default stays local. That is the whole point of Aide: AI used the way it should be, one that knows you without shipping your life, your memories, and your conversations to someone else's server.

Next, I want to bring some of the Gemini Live experience to Aide: sharing the screen with Gemma 4, drafting artifacts from what it sees, and then referencing those artifacts back in chat and pulling them into your writing straight from the keyboard.

Acknowledgements

Special thanks to the following projects and services that made this application possible: