DEV Community

Cover image for Speech-to-text AssemblyAi
Bill
Bill

Posted on • Edited on

Speech-to-text AssemblyAi

This is a submission for the AssemblyAI Challenge : Sophisticated Speech-to-Text.

What I Built

Using the assemblyAI's test audio file, I transcribed a conversation between a weather podcast host and his expert guest discussing wildfires.

Using the AI's ability to detect individual speakers, I transcribed the audio using the utterance feature and labeled each speaker either 'host' or 'guest', respectively.

Rather than display all the text, I allow readers to paginate through sections of the conversation so they can read without scrolling on a standard-size computer screen.

Click the label associated with a section of host-guest text to view it on a webpage. 

Demo

Link to project
Github

Journey

I used AssemblyAI's speech-to-text Model to transcribe an audio file into a user-friendly, readable format.

  • used AssemblyAI to transcribe from an audio file and differentiate between speakers.
  • rendered in a user-friendly, readable format.
  • tested successfully for accessibility (tabs through nicely).
  • creatively used pagination to break up the conversation into sections.
  • let users focus on a single question-and-answer section.

Prompts

I accomplished the speech-to-text prompt with the provided audio file but did not use other prompts like streaming audio.

Team Member Submission

Just me - William Pope

Image of Timescale

🚀 pgai Vectorizer: SQLAlchemy and LiteLLM Make Vector Search Simple

We built pgai Vectorizer to simplify embedding management for AI applications—without needing a separate database or complex infrastructure. Since launch, developers have created over 3,000 vectorizers on Timescale Cloud, with many more self-hosted.

Read full post →

Top comments (0)

Billboard image

The Next Generation Developer Platform

Coherence is the first Platform-as-a-Service you can control. Unlike "black-box" platforms that are opinionated about the infra you can deploy, Coherence is powered by CNC, the open-source IaC framework, which offers limitless customization.

Learn more

👋 Kindness is contagious

Please leave a ❤️ or a friendly comment on this post if you found it helpful!

Okay