I've been building a speech-to-text API and wanted to see if there's interest before pushing it further.
It's powered by Whisper Large V3 Turbo and supports both real-time WebSocket streaming and pre-recorded transcription.
Planned features & pricing:
- Pre-recorded audio: $0.03/hour
- Real-time WebSocket streaming: $0.10/hour
- Speaker diarization
- Punctuation detection
- VAD support
Mostly looking for feedback and a few early users to test it out.
Waitlist link: https://makeform.ai/f/mtwDANdO
Top comments (0)