DEV Community

Sarath Chandra Pandurangi
Sarath Chandra Pandurangi

Posted on

GemmaNotes

Gemma 4 Challenge: Build With Gemma 4 Submission

This is a submission for the Gemma 4 Challenge: Build with Gemma 4

What I Built

GemmaNotes is a private, completely offline, AI voice-recorder plugin for Obsidian that turns your spoken thoughts into tidy text notes with a single click.

Unlike most dictation tools, GemmaNotes performs all of its processing completely on your own computer. It does not send your voice to the cloud or even require an internet connection, keeping your recordings and private and secure.

Simply hold down microphone key to record, speak your mind. Once you are done, watch your words appear directly at your cursor, free of filler words.

Even so, if your spoken draft is like mine, a rambling blob of text, you can also use the built-in magic rewrite feature to fix grammar, and polish your speech into clear, structured prose.

Demo

https://www.youtube.com/watch?v=_8CRAbo_a0Y

Code

https://github.com/sarath/gemmanotes

How I Used Gemma 4

GemmaNotes leverages Google’s Gemma 4 E2B (Effective 2B) and option to choose E4B model to enable fully offline, push-to-talk voice dictation directly inside Obsidian notes. Rather than relying on a complex, multi-stage pipeline that pipes separate transcription audio into a separate large language model, Gemma 4’s native multimodal capabilities handle the entire processing flow in a single, unified stage—converting raw audio inputs directly into structured text in-process via transformers.js and WebGPU. We chose the E2B variant over E4B or the massive workstation-class 31B Dense model because of its optimized performance-to-size ratio.

With a download footprint of approximately 3.2 GB, E2B strikes the perfect balance for local desktop applications; it is lightweight enough to avoid depleting host system RAM or overloading WebGPU runtimes while remaining highly capable of capturing speech accurately and generating natural punctuation.

Furthermore, this compact footprint allows the same E2B instance to power our on-demand rewriting features, tidying up raw conversational transcripts into polished prose without loading a secondary LLM. Selecting Gemma 4 E2B ensures that GemmaNotes delivers a highly private, zero-latency note-taking assistant that runs completely local on the user's desktop hardware without any external API keys or network reliance.

Top comments (0)