DEV Community

Sunder Kumar
Sunder Kumar

Posted on

Speech to Text using Assembly AI

This is a submission for the AssemblyAI Challenge : Sophisticated Speech-to-Text.

What I Built

I built a Speech-to-Text Application that showcases the power of Universal-2, AssemblyAI’s latest speech-to-text model. The application:

  1. Supports Multilingual Transcription Users can choose from multiple languages, ensuring global accessibility.
  2. Outputs with Formatting and Timestamps application delivers well-structured transcripts, complete with proper nouns, punctuation, and timestamps.
  3. User-Friendly Interface built using Streamlit, the app features an intuitive frontend for easy navigation and interaction.

Demo

Link to Github Repository

Journey

Incorporating Universal-2:
The application utilizes Universal-2 through AssemblyAI’s robust API. The backend:

  1. Uploads audio files using AssemblyAI's upload endpoint.
  2. Submits transcription requests, including optional parameters like language_code and punctuate.
  3. Polls transcription progress until completion and fetches the final transcript with timestamps, and word-by-word breakdown.

Screenshots

Home Page
Audio Processing
Final Results

Team Submission:
I worked on this project independently-Sunder Kumar

Image of Timescale

🚀 pgai Vectorizer: SQLAlchemy and LiteLLM Make Vector Search Simple

We built pgai Vectorizer to simplify embedding management for AI applications—without needing a separate database or complex infrastructure. Since launch, developers have created over 3,000 vectorizers on Timescale Cloud, with many more self-hosted.

Read more →

Top comments (0)

Image of Docusign

🛠️ Bring your solution into Docusign. Reach over 1.6M customers.

Docusign is now extensible. Overcome challenges with disconnected products and inaccessible data by bringing your solutions into Docusign and publishing to 1.6M customers in the App Center.

Learn more

👋 Kindness is contagious

Please leave a ❤️ or a friendly comment on this post if you found it helpful!

Okay