🎙️ Voice Notes Transcriber & Organizer

Giacomo Verdi · 2025-05-26T23:16:24Z

This is a submission for the Postmark Challenge: Inbox Innovators . What I Built I built Voice Notes Transcriber , an AI-powered system that transforms voice memos sent via email into searchable, organized text notes. Simply email an audio file to your Postmark inbound address, and the system automatically: 🎙️ Transcribes audio using Google Speech-to-Text API 🤖 Generates summaries and extracts action items with AI 🏷️ Auto-categorizes notes using NLP 🔍 Makes everything searchable 📊 Provides a beautiful dashboard to manage your notes 🔄 Optionally syncs to Notion It solves the problem of voice notes being quick to create but hard to organize and search through later. Demo Live App : https://voice-notes.jugaad.digital/ Test Credentials : Email: demo@voicenotes.app Password: demo123 How to Test : Login with test credentials Send an audio file (MP3, WAV, M4A) to: your-address@inbound.postmarkapp.com Wait ~30 (depending on duration) seconds for processing See your transcribed note appear in the dashboard! Screenshots Dashboard View Audio Player with Transcription Email Processing Flow Code Repository giacomoverdi / voice-notes-transcriber 🎙️ Voice Notes Transcriber & Organizer Un sistema intelligente che trascrive automaticamente le note vocali inviate via email utilizzando il parsing delle email in entrata di Postmark, l'API Google Speech-to-Text, e le organizza con categorizzazione basata su AI. ✨ Caratteristiche Funzionalità Core 📧 Email-to-Transcription : Invia note vocali come allegati email per ottenere trascrizioni istantanee 🎯 Elaborazione AI : Trascrizione automatica usando Google Speech-to-Text 📝 Riassunti Intelligenti : Generazione di riassunti e estrazione di action items 🏷️ Auto-Categorizzazione : Categorizzazione intelligente basata sul contenuto 🔍 Ricerca Full-Text : Cerca attraverso trascrizioni, riassunti e metadati 📱 Dashboard Responsive : Interfaccia web elegante per gestire le note Funzionalità Avanzate 🔄 Integrazione Notion : Sincronizza le note trascritte con il tuo workspace Notion 🎵 Riproduzione Audio : Player audio integrato con visualizzazione dell'onda 🌐 Supporto Multilingua : Trascrivi audio in più lingue 📊 Dashboard Analitica : Monitora pattern di utilizzo e insights 🔐 … View on GitHub How I Built It Tech Stack Backend : Node.js, Express, PostgreSQL, Redis Frontend : React, Vite, TailwindCSS AI/ML : Google Speech-to-Text API, Google Cloud AI Email : Postmark Inbound Email Parsing Storage : Google Cloud Storage (optional) or local Infrastructure : Docker, Docker Compose, Nginx Postmark Implementation The core feature uses Postmark's inbound email parsing webhook: javascript // Webhook endpoint that receives emails from Postmark async handleInboundEmail(req, res) { const inboundEmail = req.body; // Validate webhook signature for security if (!postmarkService.validateWebhookSignature(req)) { return res.status(401).json({ error: 'Invalid signature' }); } // Extract audio attachments const audioAttachments = inboundEmail.Attachments?.filter(att => ['audio/mpeg', 'audio/wav', 'audio/mp4'].includes(att.ContentType) ); // Process each audio file for (const attachment of audioAttachments) { // Decode base64 audio const audioBuffer = Buffer.from(attachment.Content, 'base64'); // Save to Google Cloud Storage or local const audioUrl = await storageService.uploadAudio(audioBuffer); // Queue transcription job await transcriptionQueue.add({ audioUrl, userId: user.id, emailSubject: inboundEmail.Subject }); } // Send confirmation email await postmarkService.sendProcessingConfirmation(inboundEmail.From); } Enter fullscreen mode Exit fullscreen mode

Un sistema intelligente che trascrive automaticamente le note vocali inviate via email utilizzando il parsing delle email in entrata di Postmark, l'API Google Speech-to-Text, e le organizza con categorizzazione basata su AI.

✨ Caratteristiche

Funzionalità Core

📧 Email-to-Transcription: Invia note vocali come allegati email per ottenere trascrizioni istantanee
🎯 Elaborazione AI: Trascrizione automatica usando Google Speech-to-Text
📝 Riassunti Intelligenti: Generazione di riassunti e estrazione di action items
🏷️ Auto-Categorizzazione: Categorizzazione intelligente basata sul contenuto
🔍 Ricerca Full-Text: Cerca attraverso trascrizioni, riassunti e metadati
📱 Dashboard Responsive: Interfaccia web elegante per gestire le note

Funzionalità Avanzate

🔄 Integrazione Notion: Sincronizza le note trascritte con il tuo workspace Notion
🎵 Riproduzione Audio: Player audio integrato con visualizzazione dell'onda
🌐 Supporto Multilingua: Trascrivi audio in più lingue
📊 Dashboard Analitica: Monitora pattern di utilizzo e insights
🔐…

Top comments (2)

Dotallio • May 27

This is actually super useful for organizing all those scattered voice notes. Which use case do you see getting the most traction so far?

Giacomo Verdi • Jun 26

for example I make a lots of meetings, It could help me to make a summary of the meet, focal points, etc.... we can use the AI not only for transcribe but for summarizing also

DEV Community

Voice Notes Transcriber - Email Your Audio, Get Smart Transcriptions

What I Built

Demo

Screenshots

Code Repository

giacomoverdi / voice-notes-transcriber

🎙️ Voice Notes Transcriber & Organizer

✨ Caratteristiche

Funzionalità Core

Funzionalità Avanzate

How I Built It

Tech Stack

Postmark Implementation

Top comments (2)