DEV Community

GargeeBhatnagar for AWS Community Builders

Posted on

3

Input Voice into Text Conversion using Amazon Transcribe

“ I have checked the documents of AWS for the input voice into text conversion using amazon transcribe. Amazon Transcribe service has batch transcription jobs and streaming transcription jobs option. In terms of cost, the solution is cheaper and secure.”

Amazon Transcribe is a fully managed, automatic speech recognition (ASR) service that makes it easy for developers to add speech to text capabilities to their applications. It is powered by a next-generation, multi-billion parameter speech foundation model that delivers high accuracy transcriptions for streaming and recorded speech. Thousands of customers across industries use it to automate manual tasks, unlock rich insights, increase accessibility, and boost discoverability of audio and video content.

In this post, you will experience the input voice into text conversion using amazon transcribe. Here I have created a s3 bucket and batch transcription job.

Architecture Overview

Image description
The architecture diagram shows the overall deployment architecture with data flow, s3 bucket and amazon transcribe.

Solution overview

The blog post consists of the following phases:

  1. Create a Amazon Transcribe as Batch Transcription with S3 Bucket
  2. Transcribe Job Creation Output as Audio File or Video File MP4 Conversion Voice into Text

Phase 1: Create a Amazon Transcribe as Batch Transcription with S3 Bucket

  1. Open the amazon transcribe console, create a transcription job with specify job details as name, language, model type, input file location in s3 bucket and output data location type. In addition, we can set audio settings, content removal and customization settings as required.

Image description

Image description

Image description

Image description

Image description

Image description

Image description

Image description

Image description

Phase 2: Transcribe Job Creation Output as Audio File or Video File MP4 Conversion Voice into Text

Image description

Image description

Image description

Image description

Image description

Clean-up

Amazon Transcribe, s3 bucket.

Pricing

I review the pricing and estimated cost of this example.

Cost of Simple Storage Service = $0.023

Cost of Amazon Transcribe (Standard Batch Transcription per minute: $0.07500) = $0.0

Total Cost = $0.023

Summary

In this post, I showed “input voice into text conversion using amazon transcribe”.

For more details on Amazon Transcribe, Checkout Get started Amazon Transcribe, open the Amazon Transcribe console. To learn more, read the Amazon Transcribe documentation.

Thanks for reading!

Connect with me: Linkedin
Image description

Image of AssemblyAI tool

Transforming Interviews into Publishable Stories with AssemblyAI

Insightview is a modern web application that streamlines the interview workflow for journalists. By leveraging AssemblyAI's LeMUR and Universal-2 technology, it transforms raw interview recordings into structured, actionable content, dramatically reducing the time from recording to publication.

Key Features:
🎥 Audio/video file upload with real-time preview
🗣️ Advanced transcription with speaker identification
⭐ Automatic highlight extraction of key moments
✍️ AI-powered article draft generation
📤 Export interview's subtitles in VTT format

Read full post

Top comments (0)

👋 Kindness is contagious

Discover a treasure trove of wisdom within this insightful piece, highly respected in the nurturing DEV Community enviroment. Developers, whether novice or expert, are encouraged to participate and add to our shared knowledge basin.

A simple "thank you" can illuminate someone's day. Express your appreciation in the comments section!

On DEV, sharing ideas smoothens our journey and strengthens our community ties. Learn something useful? Offering a quick thanks to the author is deeply appreciated.

Okay