DEV Community

Leanvox
Leanvox

Posted on

Audio Intelligence: Transcription + Speaker Labels for $0.002/min (with Free Diarization)

LeanVox started as a text-to-speech API. Today it handles both sides of audio.

Meet Audio Intelligence — transcription, speaker diarization, and AI summarization in a single API call. Same API key. Same dashboard. No new account.

One endpoint. Three outputs.

from leanvox import Leanvox

client = Leanvox(api_key="lv_live_...")

result = client.audio.transcribe(
    file="meeting.mp3",
    features=["transcribe", "diarize", "summarize"]
)

print(result.formatted_transcript)
# SPEAKER_0: Welcome to the show.
# SPEAKER_1: Thanks for having me.

print(result.summary)
# "Team discussed Q1 roadmap priorities..."
Enter fullscreen mode Exit fullscreen mode

Or with Node.js:

const result = await client.audio.transcribe({
  file: "meeting.mp3",
  features: ["transcribe", "diarize", "summarize"]
})

console.log(result.formatted_transcript)
console.log(result.summary)
Enter fullscreen mode Exit fullscreen mode

Pricing that actually makes sense

We benchmarked Whisper Large V3 + Pyannote 3.1 on dedicated GPU hardware:

Feature LeanVox AssemblyAI Deepgram
Transcription $0.002/min $0.0025/min $0.0043/min
Speaker diarization Free +$0.007/min +$0.014/min
Total (transcript + speakers) $0.002/min $0.0095/min $0.018/min

4.75× cheaper than AssemblyAI. 9× cheaper than Deepgram. Speaker labels included free.

Your $1.00 signup credit covers 500 minutes of transcription. A 1-hour meeting costs $0.12.

Why diarization is free

Most providers charge extra for speaker detection. We don't — our infrastructure makes it nearly zero marginal cost (adds <0.5s to processing). We'd rather bundle it and give you a better product.

Works with the MCP server too

No code required with Claude:

{
  "mcpServers": {
    "leanvox": {
      "command": "npx",
      "args": ["leanvox-mcp"],
      "env": { "LEANVOX_API_KEY": "lv_live_..." }
    }
  }
}
Enter fullscreen mode Exit fullscreen mode

Tell Claude: "Transcribe this audio file and give me a summary with speaker labels." Zero code.

What's supported

  • Formats: mp3, wav, ogg, flac, m4a, webm (up to 500MB)
  • Languages: 99 (auto-detected or specify)
  • Processing: Sync for files ≤5 min, async with webhook callbacks for longer
  • SDKs: Python and Node.js (v0.3.0)

Getting started

# Install
pip install leanvox  # or npm install leanvox

# Transcribe
curl -X POST https://api.leanvox.com/v1/audio/transcribe \
  -H "Authorization: Bearer lv_your_key_here" \
  -F "file=@audio.mp3"
Enter fullscreen mode Exit fullscreen mode

Quickstart guide · API reference · Get your API key

Top comments (0)