DEV Community

# rag

Retrieval augmented generation, or RAG, is an architectural approach that can improve the efficacy of large language model (LLM) applications by leveraging custom data.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
VaultGuard-AI: Building a Local-First Hybrid Search RAG for Private Equity Intelligence

VaultGuard-AI: Building a Local-First Hybrid Search RAG for Private Equity Intelligence

Comments
5 min read
The Knowledge Base That Lied to 10,000 Customers (And How We Caught It)

The Knowledge Base That Lied to 10,000 Customers (And How We Caught It)

Comments
6 min read
The “Too Smart” Knowledge Base Problem: When Your AI Knows Too Much for Its Own Good

The “Too Smart” Knowledge Base Problem: When Your AI Knows Too Much for Its Own Good

Comments
5 min read
Beyond RAG: Building an AI Companion with "Deep Memory" using Knowledge Graphs

Beyond RAG: Building an AI Companion with "Deep Memory" using Knowledge Graphs

20
Comments 28
7 min read
Choosing the Right Vector Embedding Model and Dimension: A School Analogy That Makes Everything Clear

Choosing the Right Vector Embedding Model and Dimension: A School Analogy That Makes Everything Clear

4
Comments 4
5 min read
I Built a pip-installable RAG Chatbot — Chat With Any Document in 3 Lines of Python

I Built a pip-installable RAG Chatbot — Chat With Any Document in 3 Lines of Python

1
Comments
2 min read
Beyond the Context Window: Choosing Between RAG and MCP

Beyond the Context Window: Choosing Between RAG and MCP

Comments 1
3 min read
I Built Vector-Only Search First. Here's Why I Had to Rewrite It.

I Built Vector-Only Search First. Here's Why I Had to Rewrite It.

1
Comments
4 min read
Dev Log: Building a Secure RAG Agent for 150k Records

Dev Log: Building a Secure RAG Agent for 150k Records

1
Comments
3 min read
AI Agents Don’t Scale Like Chatbots

AI Agents Don’t Scale Like Chatbots

5
Comments 9
2 min read
how we built the most advanced ai product planner

how we built the most advanced ai product planner

10
Comments
3 min read
Distilling Knowledge into Tiny LLMs

Distilling Knowledge into Tiny LLMs

1
Comments
3 min read
A New Era of Determinism

A New Era of Determinism

6
Comments
8 min read
Brave Search MCP Server Token Optimization

Brave Search MCP Server Token Optimization

1
Comments
4 min read
I built a RAG system where hallucinations aren't acceptable. Here's what actually worked.

I built a RAG system where hallucinations aren't acceptable. Here's what actually worked.

Comments
5 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.