Audio Intelligence: Transcription + Speaker Labels for $0.002/min (with Free Diarization)

#python

LeanVox started as a text-to-speech API. Today it handles both sides of audio.

Meet Audio Intelligence — transcription, speaker diarization, and AI summarization in a single API call. Same API key. Same dashboard. No new account.

One endpoint. Three outputs.

from leanvox import Leanvox

client = Leanvox(api_key="lv_live_...")

result = client.audio.transcribe(
    file="meeting.mp3",
    features=["transcribe", "diarize", "summarize"]
)

print(result.formatted_transcript)
# SPEAKER_0: Welcome to the show.
# SPEAKER_1: Thanks for having me.

print(result.summary)
# "Team discussed Q1 roadmap priorities..."

Or with Node.js:

const result = await client.audio.transcribe({
  file: "meeting.mp3",
  features: ["transcribe", "diarize", "summarize"]
})

console.log(result.formatted_transcript)
console.log(result.summary)

Pricing that actually makes sense

We benchmarked Whisper Large V3 + Pyannote 3.1 on dedicated GPU hardware:

Feature	LeanVox	AssemblyAI	Deepgram
Transcription	$0.002/min	$0.0025/min	$0.0043/min
Speaker diarization	Free	+$0.007/min	+$0.014/min
Total (transcript + speakers)	$0.002/min	$0.0095/min	$0.018/min

4.75× cheaper than AssemblyAI. 9× cheaper than Deepgram. Speaker labels included free.

Your free signup credit covers 200+ minutes of TTS audio and 500 minutes of transcription. A 1-hour meeting costs $0.12.

Why diarization is free

Most providers charge extra for speaker detection. We don't — our infrastructure makes it nearly zero marginal cost (adds <0.5s to processing). We'd rather bundle it and give you a better product.

Works with the MCP server too

No code required with Claude:

{
  "mcpServers": {
    "leanvox": {
      "command": "npx",
      "args": ["leanvox-mcp"],
      "env": { "LEANVOX_API_KEY": "lv_live_..." }
    }
  }
}

Tell Claude: "Transcribe this audio file and give me a summary with speaker labels." Zero code.

What's supported

Formats: mp3, wav, ogg, flac, m4a, webm (up to 500MB)
Languages: 99 (auto-detected or specify)
Processing: Sync for files ≤5 min, async with webhook callbacks for longer
SDKs: Python and Node.js (v0.3.0)

Getting started

# Install
pip install leanvox  # or npm install leanvox

# Transcribe
curl -X POST https://api.leanvox.com/v1/audio/transcribe \
  -H "Authorization: Bearer lv_your_key_here" \
  -F "file=@audio.mp3"

No-code option: n8n

Prefer a visual workflow? The LeanVox n8n community node includes a Transcribe operation — build no-code audio pipelines (webhook → transcribe → Slack summary) in minutes. Install via Settings → Community Nodes in n8n.

→ Quickstart guide · API reference · Get your API key