Mohsin Rashid

Posted on Jun 13, 2024 • Edited on Jul 25, 2024

RAG with OLLAMA

#python #llama #ollama

In the world of natural language processing (NLP), combining retrieval and generation capabilities has led to significant advancements. Retrieval-Augmented Generation (RAG) enhances the quality of generated text by integrating external information sources. This article demonstrates how to create a RAG system using a free Large Language Model (LLM). We will be using OLLAMA and the LLaMA 3 model, providing a practical approach to leveraging cutting-edge NLP techniques without incurring costs. Whether you're a developer, researcher, or enthusiast, this guide will help you implement a RAG system efficiently and effectively.

Note: Before proceeding further you need to download and run Ollama, you can do so by clicking here.

The following is an example on how to setup a very basic yet intuitive RAG

Import Libraries

import os
from langchain_community.llms import Ollama
from dotenv import load_dotenv
from langchain_community.embeddings import OllamaEmbeddings
from langchain.document_loaders import TextLoader
from langchain.text_splitter import RecursiveCharacterTextSplitter
from langchain.vectorstores import Chroma
from langchain.chains import create_retrieval_chain
from langchain import hub
from langchain.chains.combine_documents import create_stuff_documents_chain

Loading The LLM (Language Model)

llm = Ollama(model="llama3.1", base_url="http://127.0.0.1:11434")

Setting Ollama Embeddings

embed_model = OllamaEmbeddings(
    model="llama3.1",
    base_url='http://127.0.0.1:11434'
)

Loading Text

text = """
    In the lush canopy of a tropical rainforest, two mischievous monkeys, Coco and Mango, swung from branch to branch, their playful antics echoing through the trees. They were inseparable companions, sharing everything from juicy fruits to secret hideouts high above the forest floor. One day, while exploring a new part of the forest, Coco stumbled upon a beautiful orchid hidden among the foliage. Entranced by its delicate petals, Coco plucked it and presented it to Mango with a wide grin. Overwhelmed by Coco's gesture of friendship, Mango hugged Coco tightly, cherishing the bond they shared. From that day on, Coco and Mango ventured through the forest together, their friendship growing stronger with each passing adventure. As they watched the sun dip below the horizon, casting a golden glow over the treetops, they knew that no matter what challenges lay ahead, they would always have each other, and their hearts brimmed with joy.
    """

Splitting Text into Chunks

text_splitter = RecursiveCharacterTextSplitter(chunk_size=512, chunk_overlap=128)
chunks = text_splitter.split_text(text)

Creating a Vector Store (Chroma) from Text

vector_store = Chroma.from_texts(chunks, embed_model)

Creating a Retriever

retriever = vector_store.as_retriever()

Creating a Retrieval Chain

chain = create_retrieval_chain(combine_docs_chain=llm,retriever=retriever)

Retrieval-QA Chat Prompt

retrieval_qa_chat_prompt = hub.pull("langchain-ai/retrieval-qa-chat")

Combining Documents

combine_docs_chain = create_stuff_documents_chain(
    llm, retrieval_qa_chat_prompt
)

Final Retrieval Chain

retrieval_chain = create_retrieval_chain(retriever, combine_docs_chain)

Invoking the Retrieval Chain

response = retrieval_chain.invoke({"input": "Tell me name of monkeys and where do they live"})
print(response['answer'])

Top comments (3)

Muhammad Atif • Jul 25 '24

Thanks, Mohsin Rashid.
I recently read the "Rag with Ollam," and I must say, it is both good and quite informative. The content is well-researched, and the insights provided are valuable. The way it presents information is engaging and easy to understand, making it a great resource for anyone looking to deepen their knowledge on the topic. Highly recommended!

Umair Yousaf • Jul 25 '24

Excellent! Worked without any issue, Thanks

Leandro D'Archivio • Jul 22 '24

Perfect, thanks!!!