Michael Madumere

Posted on Jul 2

Understanding Multi-Hop Reasoning: How Graph Databases Traverse Data for AI

#ai #rag #agents

Vector databases are highly effective for semantic search, but they often struggle when an application requires multi-step reasoning across people, systems, and events. Standard vector Retrieval-Augmented Generation (RAG) retrieves isolated chunks of text based on mathematical distance, meaning it frequently fails to capture the structured, relational context between data points.

To solve this, developers are turning to GraphRAG, which connects Large Language Models (LLMs) to a live knowledge graph that can be queried deterministically.

Here is a technical breakdown of how graph databases physically traverse data to supply LLMs with explicit, highly relevant connections, and how you can configure AI agents to control this retrieval.

1. Multi-Hop Reasoning (The Architecture of a "Hop")

In a Text-Attributed Graph, data is stored as nodes (entities like users, products, or tickets) connected by edges (relationships like "IMPACTS" or "CLONED_FROM"). A "hop" is the traversal from one node to another via an edge.

Multi-hop reasoning is the process of capturing a sequence of these relationships across multiple interconnected nodes. While standard RAG looks for a single text chunk containing an answer, multi-hop traversal allows the system to extract indirect connections that are never explicitly written together in a single document.

The Customer Support Chatbot Example:
Imagine a developer is building an internal chatbot to help customer support agents troubleshoot issues. An agent asks the chatbot: "How do I resolve the CSV upload error for user emails?"

If the developer used standard vector RAG, the system would simply perform a semantic search for the keywords "CSV upload error" and "user email". It would likely return the very first result that matches that text, which could be an outdated FAQ page, a completely unrelated bug report, or a generic troubleshooting guide. Standard RAG matches the words but misses the business context.

A multi-hop GraphRAG system, however, explicitly verifies the exact sequence of events. It traces the relationships across user profiles, historical bug tickets, and process logs to give a concrete, source-backed explanation. The graph database might traverse the following path:
(Node: User Bug Report) -> [Edge: CLONED_FROM] -> (Node: Master Engineering Ticket) -> [Edge: HAS_RESOLUTION] -> (Node: Patch Deployed).

This ensures the LLM receives the exact, verified resolution steps rather than a loose semantic match.

2. The Depth Lever (Vertical Traversal)

Depth is a query mechanism used when an AI needs to drill down vertically through layers of relationships to reach a highly specific target. This is the exact programmatic lever used for automated root-cause analysis.

In graph query languages like Cypher (used by Neo4j and Memgraph), developers control depth by specifying a range for the number of hops the database should traverse away from the starting node.

// Example: Deep traversal to find reproduction steps for a specific bug ticket
MATCH (j:Ticket {ticket_ID: 'ENT-22970'})-[:HAS_DESCRIPTION*1..5]->(desc:Description)-[:HAS_STEPS_TO_REPRODUCE*1..5]->(steps:StepsToReproduce)
RETURN steps.value

How it works: The *1..5 parameter is the depth lever. It instructs the database engine to traverse a minimum of 1 and a maximum of 5 relationship layers away from the starting ticket node to find the specific target nodes. In our customer support example, this allows the system to navigate deep, multi-tiered dependencies from the initial ticket description down to the exact reproduction steps, a path that a vector search would completely miss.

3. The Breadth Lever (Horizontal Traversal)

Breadth traversal is used when a query requires a broad overview or an understanding of an entire ecosystem. Instead of drilling down a single chain of events, breadth expands the search horizontally to explore the immediate connections directly surrounding a starting node.

By looking at how many different nodes, or specific types of nodes, a central entity is connected to, developers can retrieve a complete horizontal view of a scenario.

// Example: Breadth traversal to pull all active support tickets impacting a product line
MATCH (p:Product {name: 'Enterprise Dashboard'})-[:IMPACTED_BY]->(t:SupportTicket)
RETURN t.id, t.status

How it works: Instead of traversing deep chains, this query pulls in all parallel nodes of the type SupportTicket that are directly connected to the Enterprise Dashboard node. For a customer support chatbot, this allows the LLM to process a broad set of adjacent concepts simultaneously, such as server outages, user login bugs, and billing errors that are all impacting the same product line at that exact moment.

4. Configuring the Agentic Workflow

For developers building advanced, autonomous RAG pipelines, breadth and depth should not be hard-coded static queries. Instead, they are designed as dynamic levers controlled by a multi-agent system.

To build this architecture, you need to configure two specific agent roles:

The Coordinator Agent: This agent acts as the query planner. It evaluates the user's natural language prompt to determine the exact balance of breadth and depth required to answer the question. The agent maps its evaluation to a discrete mathematical range (e.g., a float between 0.0 and 1.0) and injects that parameter into the graph query generation. For instance, a broad research question regarding overall platform stability scores a high breadth parameter, triggering a wide horizontal search. Meanwhile, a specific troubleshooting question about a single delayed order scores a high depth parameter.
The Recursive Retrieval Agent: As the database executes the query and traverses horizontally or vertically, this agent continuously evaluates the newly discovered nodes. It dynamically decides what contextual data to keep and what to prune before the final graph extraction is formatted as a node sequence or adjacency table and passed to the generator LLM.

By wiring these agents to control graph database traversal, you build a system that dynamically scales its investigation scope, resulting in highly deterministic, complete answers that standard vector similarity simply cannot achieve.

Top comments (2)

Tae Kim • Jul 3

The chunk-mismatch problem you describe — where the answer exists in a chain of relationships that no single document captures — was the exact failure mode that pushed me from flat vector RAG toward a KG-backed approach on a financial news corpus. Entity resolution turned out to be the prerequisite nobody mentions: until the same company appeared under one canonical node (not split across "Samsung SDI," "Samsung SDI Co.," and a ticker symbol), multi-hop traversal was returning paths that crossed phantom duplicates and silently mixing two different entities. Once entity identity was enforced at write time, 1-2 hop Cypher queries over Neo4j outperformed embedding search on relationship queries with a substantial margin.

Michael Madumere • Jul 4

This is a brilliant insight. The chunk mismatch problem was a similar failure mode I experienced a while ago while building a customer service agent for a client that had multiple entities within their business. They had a hotel, a beach resort, and other booking based businesses, and we realized that with standard vector RAG, segmenting the data into fixed-length text chunks completely disconnected the related content. As a result, we consistently mismatched booking queries across the different entities.
Going back to the structure of each entity gave us clarity on how they were connected and ensured we had full resolution of the relationships at write time. We had to unify those relationships before the retrieval stage to make the architecture work.

This is a massive point. I might actually need to do a follow-up piece specifically on the engineering mechanics of entity resolution for GraphRAG just based on your comment. Thanks for sharing this!