If you use n8n for workflow automation, you can now add voice AI to any workflow. Our community node is live on npm.
Install it from Settings → Community Nodes in your n8n instance:
n8n-nodes-leanvox
That's it. No Docker config, no environment variables, no SDK installation.
What Can You Build?
The node covers the full LeanVox API — text-to-speech, speech-to-text, and multi-speaker dialogue. Here are some workflows that take minutes to set up:
Blog to Podcast
RSS feed triggers → extract article text → LeanVox Generate Speech → upload MP3 to S3 or your podcast host. Every new blog post automatically becomes an audio version.
Meeting Transcriber
Webhook receives recording → LeanVox Transcribe (with diarization + summary) → post summary to Slack. Know who said what without listening to the whole meeting.
Multilingual Voicemail
Form submission → LeanVox Generate Speech in 10 languages → email each version. One form, global reach.
Content Moderation Pipeline
Audio upload webhook → LeanVox Transcribe → scan transcript for flagged keywords → alert on Slack or email. Automate audio review at scale.
Available Operations
The node gives you 8 operations:
- Generate Speech — text to audio using Standard (fast), Pro (238 curated voices), or Max (instruction-based voice design)
- Generate Speech (Async) — for long text — kicks off a background job so your workflow doesn't time out
- Check Job — poll an async job until complete
- Dialogue — multi-speaker conversations with different voices per line
- Transcribe — audio → text with optional speaker diarization and AI summary
- List Voices — get all available voice IDs
- List Curated Voices — browse 238 curated voices with preview audio
- Check Balance — see your remaining credits
Setup in 60 Seconds
- In n8n, go to Settings → Community Nodes → Install
- Enter
n8n-nodes-leanvox - Add a LeanVox API credential with your API key (get one here)
- Drag the LeanVox node into any workflow
Example: Text to Speech in a Workflow
- Add a Manual Trigger or Webhook node
- Add the LeanVox node
- Set Resource to Speech, Operation to Generate
- Pick a model:
standardfor speed,profor voice quality,maxfor custom voice instructions - Set a voice ID (e.g.
podcast_conversational_female) - Pass your text
The node returns JSON with an audio_url you can pass to any downstream node — upload to S3, send via email, post to Slack, whatever your workflow needs.
Async for Long Content
For longer text (articles, chapters, scripts), use Generate Speech (Async). It queues a background job and returns a job_id. Chain it with the Check Job operation to poll until complete. No timeout issues, even for book-length content.
Pricing
Same credits as the API. Standard $0.005/1K chars · Pro $0.01/1K chars · Max $0.03/1K chars · Transcription $0.002/min. Full pricing.
Links
Originally published at leanvox.com/blog
Top comments (0)