DEV Community

Cover image for Speech-to-text AssemblyAi
Bill
Bill

Posted on • Edited on

Speech-to-text AssemblyAi

This is a submission for the AssemblyAI Challenge : Sophisticated Speech-to-Text.

What I Built

Using the assemblyAI's test audio file, I transcribed a conversation between a weather podcast host and his expert guest discussing wildfires.

Using the AI's ability to detect individual speakers, I transcribed the audio using the utterance feature and labeled each speaker either 'host' or 'guest', respectively.

Rather than display all the text, I allow readers to paginate through sections of the conversation so they can read without scrolling on a standard-size computer screen.

Click the label associated with a section of host-guest text to view it on a webpage. 

Demo

Link to project
Github

Journey

I used AssemblyAI's speech-to-text Model to transcribe an audio file into a user-friendly, readable format.

  • used AssemblyAI to transcribe from an audio file and differentiate between speakers.
  • rendered in a user-friendly, readable format.
  • tested successfully for accessibility (tabs through nicely).
  • creatively used pagination to break up the conversation into sections.
  • let users focus on a single question-and-answer section.

Prompts

I accomplished the speech-to-text prompt with the provided audio file but did not use other prompts like streaming audio.

Team Member Submission

Just me - William Pope

Top comments (0)