DEV Community

Suji Matts
Suji Matts

Posted on

Configure Google Cloud Speech-to-Text API

Overview

The Speech-to-Text API enables easy integration of Google speech recognition technologies into developer applications. It allows you to send audio and receive a text transcription from the service.

What we'll cover
In this lab, you will learn how to:

  • Create an API key
  • Create a Speech-to-Text API request
  • Call the Speech-to-Text API

Step 1: Create an API Key

In the Google Cloud Console, navigate to Navigation menu > APIs & services > Credentials.
Click on Create credentials and select API key.
Copy the generated key and click Close.

Save API Key as Environment Variable

Connect to your VM instance via SSH.
In the command line, set the environment variable

export API_KEY=<YOUR_API_KEY>

Enter fullscreen mode Exit fullscreen mode

Step 2: Create Your Speech-to-Text API Request

Create a new file named request.json:

touch request.json

Enter fullscreen mode Exit fullscreen mode

Open the file in a text editor and add the following JSON configuration, specifying the audio file’s URI:

{
  "config": {
    "encoding": "FLAC",
    "languageCode": "en-US"
  },
  "audio": {
    "uri": "gs://cloud-samples-tests/speech/brooklyn.flac"
  }
}
Enter fullscreen mode Exit fullscreen mode

Step 3: Call the Speech-to-Text API

curl -s -X POST -H "Content-Type: application/json" --data-binary @request.json "https://speech.googleapis.com/v1/speech:recognize?key=${API_KEY}"
Enter fullscreen mode Exit fullscreen mode

The response will include the transcript and a confidence score.

Save Response to a File

curl -s -X POST -H "Content-Type: application/json" --data-binary @request.json "https://speech.googleapis.com/v1/speech:recognize?key=${API_KEY}" > result.json

Conclusion

Congratulations! You have successfully used the Speech-to-Text API to transcribe an audio file. This hands-on lab demonstrated how to create an API key, construct a request, and call the Speech-to-Text service.

Read More: https://codelabs.developers.google.com/codelabs/cloud-speech-text-python3#0

Top comments (0)