DEV Community

StackFoss
StackFoss

Posted on • Originally published at stackfoss.com on

Buzz - Offline Audio Transcription and Translation Tool Powered by OpenAI's Whisper

Buzz is an open-source application that can transcribe and translate audio in real-time. It can import audio and video files and export transcripts to TXT, SRT, and VTT formats. Buzz supports the following models: Whisper, Whisper.cpp, Whisper-compatible Hugging Face models, and the OpenAI Whisper API. It is available on Mac, Windows, and Linux.

To install Buzz, you need to download the latest version for your operating system from the Buzz GitHub releases page. For macOS 11.7 and later, you can install Buzz via Brew or download and run the Buzz-x.y.z.dmg file. For Windows 10 and later, download and run the Buzz-x.y.z.exe file. For Ubuntu 20.04 and later, install libportaudio2 and download and extract the Buzz-x.y.z-unix.tar.gz file.

To start a live recording with Buzz, select a recording task, language, quality, and microphone, and click Record. The Task can be set to "Transcribe" or "Translate," Language can be set to "Detect Language" or one of the supported languages, Quality can be set to "Very Low," "Low," "Medium," or "High," and Microphone can be set to the available system microphones or the default system microphone.

Note that transcribing audio using the default Whisper model is resource-intensive. If your computer is unable to keep up with real-time transcription, consider turning on GGML inference.

Top comments (0)