DEV Community

Cover image for Day 10: Transcription with 3 lines of Python
Dilek Karasoy for Picovoice

Posted on • Edited on

Day 10: Transcription with 3 lines of Python

Amazon Transcribe, Google Speech-to-Text, Azure Cognitive Services, IBM Watson, AssemblyAI, DeepGram, Speechmatics, and Rev, …all provide APIs to transcribe audio files. So why should you care about Picovoice Leopard? Just for the Free Tier? Nope! We have more reasons:

Let's get started!

1- Install
Install Leopard from a terminal:

pip3 install pvleoparddemo
Enter fullscreen mode Exit fullscreen mode

2- Try it
Grab your free AccessKey from Picovoice Console and run the microphone demo from the terminal:

leopard_demo_mic --access_key ${YOUR_ACCESS_KEY}
Enter fullscreen mode Exit fullscreen mode

3- Build
Create an instance of Leopard:

from pvleopard import *
o = create(access_key=${YOUR_ACCESS_KEY})
Enter fullscreen mode Exit fullscreen mode

Transcribe an audio file:

transcript, words = o.process_file(${YOUR_AUDIO_FILE_PATH})
print(transcript)
Enter fullscreen mode Exit fullscreen mode

Enjoy!

AWS Security LIVE!

Join us for AWS Security LIVE!

Discover the future of cloud security. Tune in live for trends, tips, and solutions from AWS and AWS Partners.

Learn More

Top comments (0)

Sentry image

See why 4M developers consider Sentry, “not bad.”

Fixing code doesn’t have to be the worst part of your day. Learn how Sentry can help.

Learn more

👋 Kindness is contagious

Please leave a ❤️ or a friendly comment on this post if you found it helpful!

Okay