DEV Community

Cover image for Enhance Your RAG Application With Web Searching Capability!
Veli EROGLU
Veli EROGLU

Posted on • Edited on

5

Enhance Your RAG Application With Web Searching Capability!

Introduction

When building fun projects with Retrieval-Augmented Generation (RAG) applications, we often face limitations like browsing restrictions, making it hard to get the latest information or current data, like weather updates (i hope something more funny). To solve this, we can equip our RAG application with tools to search the internet. Let’s dive in!

 Our Tool-bench

  • LangChain (Framework for building applications with large language models)
  • SearXNG (free metasearch engine)
  • CPython (a C language wrapper :> )
  • Docker

Setup

First we start with the SearXNG installation.

1 -) Get SearXNG-docker

git clone https://github.com/searxng/searxng-docker.git

2 -) Edit the .env file to set the hostname and an email

3 -) Generate the secret key

<Linux>

sed -i "s|ultrasecretkey|$(openssl rand -hex 32)|g" searxng/settings.yml

<MacOS>
sed -i"" -e "s|ultrasecretkey|$(openssl rand -hex 32)|g" searxng/settings.yml 

<Windows>
$randomBytes = New-Object byte[] 32
(New-Object Security.Cryptography.RNGCryptoServiceProvider).GetBytes($randomBytes)
$secretKey = -join ($randomBytes | ForEach-Object { "{0:x2}" -f $_ })
(Get-Content searxng/settings.yml) -replace 'ultrasecretkey', $secretKey | Set-Content searxng/settings.yml
Enter fullscreen mode Exit fullscreen mode

4 -) Update the searxng/settings.yml to enable available search formats and disable the limiter for our LangChain instance:

use_default_settings: true
server:
  # base_url is defined in the SEARXNG_BASE_URL environment variable, see .env and docker-compose.yml
  secret_key: "<secret-key>"  # change this!
  limiter: false
  image_proxy: true
ui:
  static_use_hash: true
redis:
  url: redis://redis:6379/0

search:
    formats:
        - html
        - json
Enter fullscreen mode Exit fullscreen mode

5-) Run SearXNG Instance

docker compose up

Check the SearXNG deployment in Docker. If everything looks good, you’re ready to continue.

 Demo Application

1 -) Create a virtual environment & activate

python3 -m venv .venv
source .venv/bin/activate
Enter fullscreen mode Exit fullscreen mode

2 -) Install Langchain

pip install langchain langchain-community
Enter fullscreen mode Exit fullscreen mode

3 -) Create main.py

## Simple Get Results
from langchain_community.utilities import SearxSearchWrapper
import pprint

s = SearxSearchWrapper(searx_host="http://localhost:8080",)
result = s.results("What is RAG?", num_results=10, engines=["google"])
pprint.pprint(result)

## Github Tool

from langchain_community.tools.searx_search.tool import SearxSearchResults

wrapper = SearxSearchWrapper(searx_host="**")
github_tool = SearxSearchResults(name="Github", wrapper=wrapper,
                            kwargs = {
                                "engines": ["github"],
                                })

Enter fullscreen mode Exit fullscreen mode

And there you have it! Your RAG application now has search capabilities. This guide doesn’t introduce anything new but aims to bring together the steps for adding web searching functionality to your RAG application. I hope it helps!

Hostinger image

Get n8n VPS hosting 3x cheaper than a cloud solution

Get fast, easy, secure n8n VPS hosting from $4.99/mo at Hostinger. Automate any workflow using a pre-installed n8n application and no-code customization.

Start now

Top comments (0)

The Most Contextual AI Development Assistant

Pieces.app image

Our centralized storage agent works on-device, unifying various developer tools to proactively capture and enrich useful materials, streamline collaboration, and solve complex problems through a contextual understanding of your unique workflow.

👥 Ideal for solo developers, teams, and cross-company projects

Learn more

👋 Kindness is contagious

Please leave a ❤️ or a friendly comment on this post if you found it helpful!

Okay