DEV Community

# rag

Retrieval augmented generation, or RAG, is an architectural approach that can improve the efficacy of large language model (LLM) applications by leveraging custom data.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Why “Just Prompting” Fails on Private Data: A RAG Post‑Mortem

Why “Just Prompting” Fails on Private Data: A RAG Post‑Mortem

1
Comments 1
5 min read
The End of the Memory Tax: How Google’s TurboQuant is Rewriting the Rules of Local RAG Systems

The End of the Memory Tax: How Google’s TurboQuant is Rewriting the Rules of Local RAG Systems

2
Comments
3 min read
Why Search Breaks in Production

Why Search Breaks in Production

Comments
6 min read
Building a Multi-Agent Research System with LangGraph: How I Taught Three AI Agents to Collaborate

Building a Multi-Agent Research System with LangGraph: How I Taught Three AI Agents to Collaborate

Comments
6 min read
AI Agents: Cost-Optimized Orchestration & Robust Text-to-SQL with Python

AI Agents: Cost-Optimized Orchestration & Robust Text-to-SQL with Python

Comments
4 min read
I built a production RAG pipeline. Here's what most tutorials skip.

I built a production RAG pipeline. Here's what most tutorials skip.

3
Comments 2
7 min read
Embeddings Just Went Multimodal: What Sentence Transformers 5.4 Means for RAG

Embeddings Just Went Multimodal: What Sentence Transformers 5.4 Means for RAG

Comments
2 min read
Building a production-ready RAG pipeline

Building a production-ready RAG pipeline

2
Comments 1
10 min read
Marker, hosted: a scientific PDF parser API with LaTeX equations preserved

Marker, hosted: a scientific PDF parser API with LaTeX equations preserved

Comments
4 min read
RAG Series (15): CRAG — Self-Correcting When Retrieval Falls Short

RAG Series (15): CRAG — Self-Correcting When Retrieval Falls Short

Comments
6 min read
Anthropic Launches Managed Agents, Optimize LLM Context, Python Memory Needed

Anthropic Launches Managed Agents, Optimize LLM Context, Python Memory Needed

Comments
3 min read
A Vector Database Is Not a RAG Pipeline -And Confusing the Two Will Cost You

A Vector Database Is Not a RAG Pipeline -And Confusing the Two Will Cost You

Comments
7 min read
When Your AI Wiki Outgrows the Context Window — A Practical Guide to RAG

When Your AI Wiki Outgrows the Context Window — A Practical Guide to RAG

Comments
6 min read
Everyone Building AI Research Tools Is Solving the Wrong Problem

Everyone Building AI Research Tools Is Solving the Wrong Problem

4
Comments
7 min read
Building a Local Code Search System with Ollama and AST-Aware RAG

Building a Local Code Search System with Ollama and AST-Aware RAG

1
Comments
4 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.