DEV Community

# rag

Retrieval augmented generation, or RAG, is an architectural approach that can improve the efficacy of large language model (LLM) applications by leveraging custom data.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Why Naive Similarity Search Will Destroy Your RAG Agent (And What To Do Instead)

Why Naive Similarity Search Will Destroy Your RAG Agent (And What To Do Instead)

Comments
4 min read
Why RAG Is Failing at Complex Questions (And How Knowledge Graphs Fix It)

Why RAG Is Failing at Complex Questions (And How Knowledge Graphs Fix It)

Comments
6 min read
RAG Is Not Dead: Advanced Retrieval Patterns That Actually Work in 2026

RAG Is Not Dead: Advanced Retrieval Patterns That Actually Work in 2026

Comments
6 min read
Our Group project was chaos until this agent

Our Group project was chaos until this agent

Comments 1
6 min read
Why Most RAG Systems Fail in Production (And How to Design One That Actually Works)

Why Most RAG Systems Fail in Production (And How to Design One That Actually Works)

Comments 1
4 min read
Bringing The Receipts - 95% AI LLM Token Savings

Bringing The Receipts - 95% AI LLM Token Savings

1
Comments
10 min read
Building a Perplexity Clone for Local LLMs in 50 Lines of Python

Building a Perplexity Clone for Local LLMs in 50 Lines of Python

1
Comments 1
6 min read
Scaling LLMs at the Edge: A journey through distillation, routers, and embeddings

Scaling LLMs at the Edge: A journey through distillation, routers, and embeddings

1
Comments
20 min read
Self-Improving RAG: Teaching Claude Code to Learn From Errors

Self-Improving RAG: Teaching Claude Code to Learn From Errors

Comments
6 min read
RAG + FastAPI in Action: Creating a Smart Business Analytics Dashboard in Python

RAG + FastAPI in Action: Creating a Smart Business Analytics Dashboard in Python

Comments
9 min read
I Built a Fully Local Paper RAG on an RTX 4060 8GB — BGE-M3 + Qwen2.5-32B + ChromaDB

I Built a Fully Local Paper RAG on an RTX 4060 8GB — BGE-M3 + Qwen2.5-32B + ChromaDB

Comments
10 min read
What MCP Actually Is (And Why It Exists)

What MCP Actually Is (And Why It Exists)

2
Comments 3
4 min read
How CodiLay Reads a Codebase the Way a Detective Reads a Crime Scene

How CodiLay Reads a Codebase the Way a Detective Reads a Crime Scene

2
Comments
9 min read
Anatomy of a RAG System Architecture

Anatomy of a RAG System Architecture

Comments
5 min read
OpenViking คืออะไร

OpenViking คืออะไร

Comments
3 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.