The Problem
I needed to separate vocals from songs for a karaoke project. Building stem separation from scratch? Not happening.
Enter AudioPod
AudioPod's API handles:
- Stem separation - Extract vocals, drums, bass, and other instruments
- Text-to-music - Generate songs from descriptions
- Speech-to-text - Accurate transcription
- Noise reduction - Clean up recordings
Quick Example
import requests
# Separate stems from a song
response = requests.post(
'https://api.audiopod.ai/v1/stems/separate',
headers={'X-API-Key': 'your_key'},
json={'audio_url': 'https://example.com/song.mp3', 'mode': 4}
)
job_id = response.json()['job_id']
# Poll for completion, download stems
What I Built
A karaoke pipeline:
- Song goes in
- Vocals get isolated
- Lyrics get transcribed with timestamps
- Video gets rendered with bouncing ball
The vocal isolation quality surprised me — clean separation even on complex mixes.
Pricing
Free tier: 10 hours of processing. Enough to experiment.
Links
Building something with audio? I'd love to hear about it.
Top comments (0)