DEV Community

# rag

Retrieval augmented generation, or RAG, is an architectural approach that can improve the efficacy of large language model (LLM) applications by leveraging custom data.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Cosine Similarity Failed Our RAG on Exact Terms — BM25 Fixed It

Cosine Similarity Failed Our RAG on Exact Terms — BM25 Fixed It

1
Comments
6 min read
RetailRAG-AI: AI-Powered Retail Intelligence

RetailRAG-AI: AI-Powered Retail Intelligence

Comments
2 min read
Index-RAG: Citation-first approach to RAG

Index-RAG: Citation-first approach to RAG

1
Comments
5 min read
When CLAUDE.md Stops Working: Adding Vector Memory to Claude Code

When CLAUDE.md Stops Working: Adding Vector Memory to Claude Code

1
Comments
10 min read
AIGoat - AI Security Playground to Attack and Defend LLMs. All Running Locally

AIGoat - AI Security Playground to Attack and Defend LLMs. All Running Locally

2
Comments 1
3 min read
How I Built a Hallucination Detector for RAG Pipelines in Python

How I Built a Hallucination Detector for RAG Pipelines in Python

Comments 1
3 min read
The architecture of persistent AI memory: Beyond simple vector search

The architecture of persistent AI memory: Beyond simple vector search

Comments
2 min read
~1ms hybrid graph + vector queries (network is now the bottleneck)

~1ms hybrid graph + vector queries (network is now the bottleneck)

Comments
3 min read
Compound AI Systems: How I Connect Multiple Models in a Single Production Product

Compound AI Systems: How I Connect Multiple Models in a Single Production Product

Comments
2 min read
RAG Architecture: Building AI Apps That Know Your Data" platform

RAG Architecture: Building AI Apps That Know Your Data" platform

1
Comments
10 min read
Why Your LLM Ignores Detailed Instructions (It's Not a Bug)

Why Your LLM Ignores Detailed Instructions (It's Not a Bug)

Comments
2 min read
Most GenAI chatbot tutorials stop at “call an LLM get an answer.”

Most GenAI chatbot tutorials stop at “call an LLM get an answer.”

Comments
1 min read
The Next Frontier of AI Agent Runtimes: Observability, MCP, and High-Precision RAG

The Next Frontier of AI Agent Runtimes: Observability, MCP, and High-Precision RAG

5
Comments
3 min read
RAG finds chunks. TrailGraph finds answers. Here's the difference.

RAG finds chunks. TrailGraph finds answers. Here's the difference.

1
Comments
7 min read
Beyond Vector Search: Building a Clause Forest (FoC) Architecture for Financial RAG

Beyond Vector Search: Building a Clause Forest (FoC) Architecture for Financial RAG

Comments
7 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.