This guide will teach you the steps to import audio data using Langchain and develop an application capable of answering queries about an audio file, thanks to LangChain's latest integration with AssemblyAI.
What is LangChain?
Developed by Harrison Chase, and debuted in October 2022, LangChain serves as an open-source platform designed for constructing sturdy applications powered by Large Language Models, such as chatbots like ChatGPT and various tailor-made applications.
Langchain seeks to equip data engineers with an all-encompassing toolkit for utilizing LLMs in diverse use-cases, such as chatbots, automated question-answering, text summarization, and beyond.
Know more about LangChain and Large Language Models (LLMs) in my other tutorial.
A Beginner’s Guide to Building LLM-Powered Applications with LangChain!
Pavan Belagatti ・ Aug 30
What is AssemblyAI?
AssemblyAI offers the quickest route to AI-powered audio solutions. Utilize a straightforward API to tap into ready-to-use AI models designed for speech transcription and comprehension. As a company specializing in applied AI, AssemblyAI is committed to the development, training, and deployment of cutting-edge AI models that developers and product teams can seamlessly incorporate into their applications or products.
Tutorial
LangChain offers an integration with AssemblyAI that enables you to import audio data using only a handful of code lines.
Create and activate the new virtual environment
# Mac/Linux:
python3 -m venv venv
. venv/bin/activate
# Windows:
python -m venv venv
.\venv\Scripts\activate.bat
Install both LangChain and the AssemblyAI Python package
pip install langchain
pip install assemblyai
Set your AssemblyAI API key. You can get your free API Key here
[Note: Here, for our tutorial example, we will be using an mp3 audio file link. The audio is an interview with Peter DiCarlo, an associate professor in the Department of Environmental Health and Engineering at Johns Hopkins University, discussing the impact of Canadian wildfires on air quality in the United States. The interview covers the factors contributing to the spread of smoke, the health risks associated with high levels of particulate matter in the air, vulnerable populations, and the potential for worsening conditions due to climate change.]
Create a python file demo.py
and add the following code.
import assemblyai as aai
# replace with your API token
aai.settings.api_key = f"Your API Key"
# URL of the file to transcribe
FILE_URL = "https://github.com/AssemblyAI-Examples/audio-examples/raw/main/20230607_me_canadian_wildfires.mp3"
transcriber = aai.Transcriber()
transcript = transcriber.transcribe(FILE_URL)
print(transcript.text)
Now, run the application with the following command.
Python3 demo.py
You should see the transcript of the audio link we provided in the application.
Let's Add Question & Answer Capabilities Using OpenAI
Get the OpenAI API key and set it.
On your terminal inside the application folder, set the OPENAI API key.
export OPENAI_API_KEY=<Your API Key>
Go back to your demo.py
file and modify the code to work with question and answer format.
from langchain.document_loaders import AssemblyAIAudioTranscriptLoader
from langchain.llms import OpenAI
from langchain.chains.question_answering import load_qa_chain
FILE_URL = "https://github.com/AssemblyAI-Examples/audio-examples/raw/main/20230607_me_canadian_wildfires.mp3"
loader = AssemblyAIAudioTranscriptLoader(FILE_URL)
docs = loader.load()
llm = OpenAI()
qa_chain = load_qa_chain(llm, chain_type="stuff")
answer = qa_chain.run(input_documents=docs,
question="Where did the wildfire start?")
print(answer)
Run the application with the command Python3 demo.py
and you should see the following output. It should be an answer to your question.
The wildfires started in Canada.
Let's change the question again. Let's ask what was the professor's name who was called to talk about wildfire in Canada.
The answer should be Peter DiCarlo
Let's ask how did it impact the health of people?
Below is the answer you should receive.
Exposure to high levels of particulate matter in the air can lead to a host of health problems, including impacts to the respiratory system, cardiovascular system, and neurological system. People most vulnerable are those whose bodies are still developing (children), the elderly, and people with preexisting health conditions.
Keep asking questions related to the audio and the chatbot keeps answering your questions.
I hope this small and simple tutorial helped you learn how to set up a virtual environment, install necessary packages, and write Python code to transcribe audio files. Using LangChain and AssemblyAI makes more unique. More importantly, you've integrated OpenAI's API to add a question-answering feature to your application, making it not just a transcription tool but an interactive platform for audio data analysis.
Top comments (14)
Hello Sir I am unable to run the export command
It show an error as "'export' is not recognized as an internal or external command,
operable program or batch file."
and when I run the Python3 demo.py It shows this ValueError "Please provide an API key via the ASSEMBLYAI_API_KEY environment variable or the global settings."
Will you please help me in resolving the errors??
Please reply
Did you add your ASSEMBLYAI_API_KEY?
the command "export OPENAI_API_KEY=................."
is not executing and giving this error "'export' is not recognized as an internal or external command, operable program or batch file."
Where can I find the ASSEMBLYAI_API_KEY environment variable?
Check in the tutorial, I have mentioned 'Set your AssemblyAI API key. You can get your free API Key here'
I got the API Key and Executed the first code and got the transcript of the Audio link.
But I am struck at the Question & Answer Section because of the above mentioned errors.
Please help me in resolving the errors
Oh okay, for that you need OpenAI API Key to be set.
how to set it????
export OPENAI_API_KEY= is throwing error like "'export' is not recognized as an internal or external command, operable program or batch file."
What to do?
It is because you might not have paid account of OpenAI. You might have utilized all your free quota so it is not working for you.
Will try that soon after l am done with python and fastapi.
Thanks for sharing!
You are welcome!
Interesting side project for the weekend, cheers!
Absolutely, try it and let me know.
It's so rejuvenating.