Rakshit Raj

Posted on Apr 5

New Way to Handle 2 Million Search Queries Without a Search Server

#javascript #webdev #tutorial #opensource

Imgine this: You are in the middle of your fourth-round interview. Having a great chat about your recent projects (not most optimized), and then the interviewer throws out the ultimate stress-test question:

"If this project suddenly blew up and you got hit with 2 million search queries, how would you handle the traffic?"

TL;DR: The VS Code team recently open-sourced docfind, a search engine built with Rust and WebAssembly. Instead of sending user searches to a database (which gets crazy expensive at 2 million users), docfind packs the entire search engine and your website's index into a tiny ~2.7MB file. The user downloads it once, and all the searching happens instantly right inside their browser. Zero servers, zero API costs, and lightning-fast results.To see the implementation and how to implement it, Start Building

Introduction: The Interview

Introduction: The Interview

Normally, this is where you will recite the common developer script. You talk about subscribing to bigger servers, setting up complex Elasticsearch clusters, caching things with Redis, or just throwing your wallet at an expensive managed service like Algolia.

Instead, I took the risk by expressing them my current most favorite tech obsession right now.

DRUMROLLS

"What if," I said, "we handled those 2 million searches without a single database hit? What if our servers handled exactly zero search traffic?"

The interviewer leaned in, okay I'm all ears. That’s when I introduced them to the my utter obsession with WebAssembly, and specifically, an incredible new tool I discovered called docfind.

Recently, the engineering team behind VS Code wanted to make their documentation search faster. But instead of buying a massive server to handle all the thousands of developers searching their site every day, they did something brilliantly out-of-the-box. They wrote a tiny, super-fast search engine in a language called Rust, and compiled it into WebAssembly (Wasm).

If you're new to WebAssembly, think of it as a way to run heavy-duty code directly inside a user's web browser at near-native speed, side-by-side with JavaScript.

With docfind, when a user visits your site, they silently download a single, tiny file (about 2.7MB). That file contains everything: the search algorithm and all the searchable text on your website. When they type in the search bar, the search happens entirely on their own laptop or phone.

🤔 Why Not Just Use Algolia or Elasticsearch?

If you've ever built search for a website, your mind probably jumps to one of two standard solutions: setting up a backend search server (like Elasticsearch or Typesense) or paying for a slick, managed API service (like Algolia).

These tools are incredibly powerful, but when you hit serious scale—like our hypothetical 2 million searches—they introduce massive trade-offs:

The Server-Side Tax: With traditional backend search, every single time a user types a keystroke ("W"... "A"... "S"...), their browser fires off a network request to a server. The server crunches the data, formats a response, and sends it back across the internet. At 2 million queries, you are looking at serious server stress, complex caching layers, and (if you are using a paid service) a very painful monthly bill. Plus, you are at the mercy of network latency. It will never feel truly, perfectly "instant."

The JavaScript Bloat: To avoid those server costs, developers sometimes use pure client-side JavaScript libraries (like Lunr.js). This moves the search into the user's browser, which is great! But it introduces a new problem: massive file sizes. The VS Code team actually tested this exact approach. When they fed their documentation into a JS search library, the generated index file swelled to over 10MB. Forcing users to download a 10MB dictionary file on a mobile connection just to use your search bar is a terrible user experience.

The WebAssembly "Goldilocks" Solution

This is exactly why docfind and WebAssembly are such a breakthrough for static sites.

By writing the search engine in Rust and compiling it directly to WebAssembly, the VS Code team hit the perfect sweet spot:

Zero Server Traffic: The search happens 100% locally on the user's device. No network latency, no API limits, and absolutely zero server costs.
Incredibly Small Footprint: Unlike bloated JavaScript indexes, WebAssembly uses a highly compact binary format. By combining Rust's memory efficiency with an advanced string compression algorithm (FSST), the entire VS Code docs index was squished down to just ~2.7MB.
Near-Native Speed: WebAssembly runs alongside JavaScript but executes at near-native hardware speeds. It can parse that binary data and return search results in roughly 0.4 milliseconds.

You get the raw power of a backend server, the cost-efficiency of a static HTML file, and performance speeds that pure JavaScript simply can't touch.

🛠️ Let’s Build It: Adding WebAssembly Search to Your Site

The coolest part about docfind isn't just the underlying tech; it’s how easy the VS Code team made it for the rest of us to use. You don't need to know how to write Rust, and you don't need to understand memory management.

You just need a JSON file and some basic JavaScript. Here is how you can implement it for your own documentation or product pages.

Step 1: Install the CLI

First, we need the docfind tool. This is a simple Command Line Interface (CLI) that takes your website's content and magically compiles it into that tiny WebAssembly file.

Open your terminal and run:

(Mac/Linux)

curl -fsSL https://microsoft.github.io/docfind/install.sh | sh

(Windows PowerShell)

irm https://microsoft.github.io/docfind/install.ps1 | iex

Step 2: Prepare Your Data

docfind needs to know what it is searching through. You need to create a file called documents.json. This file should contain an array of all the pages, products, or docs you want people to find.

Here is what it looks like: ( yep, just for testing you need to copy this into your document.json file ) Get the document.json, here

Pro-tip: If you use a static site generator like Astro, Hugo, or Next.js, you can easily write a quick script to generate this JSON file during your build process!

Step 3: Build the WebAssembly Index

Once you have your documents.json ready, run this single command:

docfind documents.json output

Boom. Magic happens here. The CLI analyzes your text, extracts the most important keywords, compresses everything, and injects it into a pre-built WebAssembly template.

Check your output folder. You will see two important files:

docfind_bg.wasm (The actual search engine and your compressed data)
docfind.js (A tiny JavaScript wrapper to help you talk to the WASM file)

Step 4: Add it to Your Frontend

Now, just move those two files into your website's public folder.

test-directory/                                                                                                              
  ├─ index.html
  ├── documents.json
  └── public/
      └── search/
          ├── docfind.js
          └── docfind_bg.wasm

Here is how you actually trigger a search using normal JavaScript. You just import the function, initialize it, and pass it a user's search query, direct html implementation, copy and paste:

<!DOCTYPE html>
<html lang="en">
<body>
    <input type="text" id="search-input" placeholder="Type to search..." />
    <ul id="results"></ul>

    <script type="module">
        // Import the search function. 
        // Ensure docfind_bg.wasm is in the same folder as docfind.js
        import search from './public/search/docfind.js';

        const input = document.getElementById('search-input');
        const list = document.getElementById('results');

        input.addEventListener('input', async (e) => {
            const query = e.target.value;

            // This is where the magic happens. 
            // The first time this runs, it downloads the WASM file.
            // Subsequent searches are instant (0.4ms).
            const results = await search(query);

            // Render results
            list.innerHTML = results
                .map(doc => `
                    <li>
                        <a href="${doc.href}">
                            <strong>${doc.title}</strong>
                            <small>${doc.category}</small>
                        </a>
                    </li>`
                )
                .join('');
        });
    </script>
</body>
</html>

And that’s it! The results variable will return an array of the objects you defined in your documents.json, ranked by relevance.

You’ve just built a serverless, incredibly fast search engine. If 2 million people visit your site tomorrow and start searching, your server won't even notice.

🪄 How `docfind` Works (The 30-Second Version)

Think of docfind as a super-smart, lightning-fast librarian that lives directly in your users' browsers. To make the search so fast and the file so small, it relies on a "magic trio" of tools:

1. RAKE (The Highlighter): Before the search even begins, this tool scans all your website's text and pulls out only the most important keywords, ignoring filler words like "the" or "and."
2. FSST (The Shrink-Ray): Text takes up a lot of space. This tool squishes all your website's words down into a tiny, compressed format so the search file downloads instantly.
3. FST (The Speed Index): When a user starts typing in the search bar, this acts like the index at the back of a textbook. Instead of reading every page to find a match, it instantly points exactly to where the right answer lives.

All three of these are bundled into one tiny WebAssembly (WASM) file. That means the user downloads the entire search engine in a split second, and all the actual searching happens directly on their device—no servers required!

⚖️ The Trade-Offs: Is `docfind` Right for You?

Before you rip out your entire backend search architecture, it is important to remember that every engineering decision comes with trade-offs. While WebAssembly is practically magic, docfind is specifically designed for a certain type of content.

The Pros:

Zero Backend Costs: You completely eliminate database queries and API fees.
Infinite Scalability: Whether you get 10 visitors or 2 million, the "server load" remains exactly zero.
Instantaneous Speed: Searches resolve in milliseconds (~0.4ms) because there is no network latency.
Privacy: The user's search queries never leave their device.

The Cons (The Reality Check):

The Caching Trap: Browsers heavily cache .wasm and .js files. If you update your website's documentation, returning users might still have the older index file cached in their browser. They will be searching an outdated version of your site unless you implement strict cache-busting strategies (like appending a hash to the filename on every build).
Not for Dynamic Data: Because the index is compiled into a static file, docfind is strictly for content that changes at build-time (like documentation, blogs, or static product catalogs). It is highly discouraged for frequently changing data—like live inventory, social media feeds, or anything that updates by the minute.

🏆 Credit Where It’s Due & Resources

A massive shoutout to the VS Code Engineering Team for building this and open-sourcing it. It’s incredibly inspiring to see big teams pushing the boundaries of what the browser can handle.

If you want to dive deeper, test it out, or yell at my code, here are all the links you need:

📖 The Inspiration: Read the fantastic original engineering blog by the VS Code team: Building docfind: Fast Client-Side Search with Rust and WebAssembly.
🐙 The Official Source: Check out the inner workings in the microsoft/docfind GitHub Repo.
🎮 The Live Demo: Try their Interactive Live Demo. (Pro-tip: Open your browser's Network tab before you search. Watching a search return results without a single network request firing is incredibly satisfying).
💻 Try My Code: I’ve put together a clean, ready-to-use repository containing the JavaScript setup we walked through. Clone it, run a local server, and test it yourself: github.com/404rakshit/docfind-blog.

Have you tried playing around with WebAssembly yet? Let me know your thoughts in the comments below!

Happy coding! 🚀

DEV Community

New Way to Handle 2 Million Search Queries Without a Search Server

Table of Contents

Introduction: The Interview

🤔 Why Not Just Use Algolia or Elasticsearch?

🛠️ Let’s Build It: Adding WebAssembly Search to Your Site

Step 1: Install the CLI

Step 2: Prepare Your Data

Step 3: Build the WebAssembly Index

Step 4: Add it to Your Frontend

🪄 How `docfind` Works (The 30-Second Version)

⚖️ The Trade-Offs: Is `docfind` Right for You?

🏆 Credit Where It’s Due & Resources

Top comments (0)

Table of Contents

Introduction: The Interview

🤔 Why Not Just Use Algolia or Elasticsearch?

🛠️ Let’s Build It: Adding WebAssembly Search to Your Site

Step 1: Install the CLI

Step 2: Prepare Your Data

Step 3: Build the WebAssembly Index

Step 4: Add it to Your Frontend

🪄 How docfind Works (The 30-Second Version)

⚖️ The Trade-Offs: Is docfind Right for You?

🏆 Credit Where It’s Due & Resources

🪄 How `docfind` Works (The 30-Second Version)

⚖️ The Trade-Offs: Is `docfind` Right for You?