๐ง Real-Time Voice Assistant with AssemblyAI
This is a submission for the AssemblyAI Voice Agents Challenge
๐ก What I Built
This project is a real-time voice assistant designed for the Real-Time Voice Performance category of the AssemblyAI Challenge.
It listens continuously and reacts instantly to spoken commands like:
- ๐ โDime la horaโ
- ๐ก โEnciende la luzโ
- ๐ โApaga la luzโ
- ๐จ โActiva la alarmaโ
With latency under 100 ms, it demonstrates fast and natural voice interaction. Perfect for use cases like smart homes or accessibility tools.
๐ฅ Demo Video
Here's a short demo showing how the voice assistant works in real time:
๐ Watch the video on Google Drive
๐ GitHub Repository
๐๏ธ GitHub โ Calcedo87/AI-Voice-Agent
โ๏ธ Technical Overview
The assistant uses a modular architecture based on the following components:
- ๐๏ธ Audio Input:
MicrophoneStreamcaptures real-time audio. - ๐ง Command Matching:
handle_command()detects commands using fuzzy matching. - ๐ฃ๏ธ Text-to-Speech: Uses
pyttsx3for voice responses. - ๐ AssemblyAI Integration: Real-time transcription via WebSocket streaming API.
๐ง AssemblyAI Integration Snippet
python
API_ENDPOINT = f"wss://streaming.assemblyai.com/v3/ws?{urlencode(CONNECTION_PARAMS)}"
ws_app = websocket.WebSocketApp(
API_ENDPOINT,
header={"Authorization": MY_API_KEY},
on_open=on_open,
on_message=on_message,
on_error=on_error,
on_close=on_close,
)
---
๐ค Built by [@Calcedo87](https://github.com/Calcedo87)
Thanks for checking out my project!
Top comments (0)