DEV Community

Cover image for Speech-to-text AssemblyAi
Bill
Bill

Posted on • Edited on

Speech-to-text AssemblyAi

This is a submission for the AssemblyAI Challenge : Sophisticated Speech-to-Text.

What I Built

Using the assemblyAI's test audio file, I transcribed a conversation between a weather podcast host and his expert guest discussing wildfires.

Using the AI's ability to detect individual speakers, I transcribed the audio using the utterance feature and labeled each speaker either 'host' or 'guest', respectively.

Rather than display all the text, I allow readers to paginate through sections of the conversation so they can read without scrolling on a standard-size computer screen.

Click the label associated with a section of host-guest text to view it on a webpage. 

Demo

Link to project
Github

Journey

I used AssemblyAI's speech-to-text Model to transcribe an audio file into a user-friendly, readable format.

  • used AssemblyAI to transcribe from an audio file and differentiate between speakers.
  • rendered in a user-friendly, readable format.
  • tested successfully for accessibility (tabs through nicely).
  • creatively used pagination to break up the conversation into sections.
  • let users focus on a single question-and-answer section.

Prompts

I accomplished the speech-to-text prompt with the provided audio file but did not use other prompts like streaming audio.

Team Member Submission

Just me - William Pope

Heroku

Build apps, not infrastructure.

Dealing with servers, hardware, and infrastructure can take up your valuable time. Discover the benefits of Heroku, the PaaS of choice for developers since 2007.

Visit Site

Top comments (0)

Postmark Image

Speedy emails, satisfied customers

Are delayed transactional emails costing you user satisfaction? Postmark delivers your emails almost instantly, keeping your customers happy and connected.

Sign up

👋 Kindness is contagious

Please leave a ❤️ or a friendly comment on this post if you found it helpful!

Okay