DEV Community

# rag

Retrieval augmented generation, or RAG, is an architectural approach that can improve the efficacy of large language model (LLM) applications by leveraging custom data.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Prompt -> RAG -> Eval: System Overview for LLM Engineers

Prompt -> RAG -> Eval: System Overview for LLM Engineers

Comments
3 min read
Implementing Retrieval-Augmented Generation (RAG) with Real-World Constraints

Implementing Retrieval-Augmented Generation (RAG) with Real-World Constraints

Comments
3 min read
Learn How to Build Reliable RAG Applications in 2026!

Learn How to Build Reliable RAG Applications in 2026!

6
Comments 1
8 min read
Mastering Google Gemini: How to Choose Between Speed and Power (and Save Your Budget)

Mastering Google Gemini: How to Choose Between Speed and Power (and Save Your Budget)

Comments
7 min read
Functional MCP AI System Diagram

Functional MCP AI System Diagram

Comments
1 min read
Running a RAG Pipeline in a Production Full-Stack Application (Without a Vector Database)

Running a RAG Pipeline in a Production Full-Stack Application (Without a Vector Database)

Comments
6 min read
Why GenAI Observability Breaks in Production

Why GenAI Observability Breaks in Production

Comments
2 min read
Launching your personal assistant

Launching your personal assistant

5
Comments
14 min read
Before You Build a Client RAG/Agent: My Pre-Build Checklist (With Examples + What to Automate)

Before You Build a Client RAG/Agent: My Pre-Build Checklist (With Examples + What to Automate)

Comments
5 min read
Multi-Step Reasoning and Agentic Workflows: Building AI That Plans and Executes

Multi-Step Reasoning and Agentic Workflows: Building AI That Plans and Executes

Comments
16 min read
I made a fast, structured PDF extractor for RAG; 300 pages a second

I made a fast, structured PDF extractor for RAG; 300 pages a second

Comments
3 min read
RAG for Developers — Built for Code, Not Just Text (Review Requested)

RAG for Developers — Built for Code, Not Just Text (Review Requested)

Comments
1 min read
Why Static Load Balancing Fails for LLM Infrastructure (And What Works Instead)

Why Static Load Balancing Fails for LLM Infrastructure (And What Works Instead)

5
Comments
7 min read
Building a RAG-Powered Documentation Assistant: Why I Used Bifrost LLM Gateway Instead of Direct API Calls

Building a RAG-Powered Documentation Assistant: Why I Used Bifrost LLM Gateway Instead of Direct API Calls

6
Comments
5 min read
Beyond RAG: Building an Autonomous "Epistemic Engine" to Fight AI Hallucination

Beyond RAG: Building an Autonomous "Epistemic Engine" to Fight AI Hallucination

Comments
2 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.