DEV Community

# rag

Retrieval augmented generation, or RAG, is an architectural approach that can improve the efficacy of large language model (LLM) applications by leveraging custom data.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
RAG(Retrieval-Augmented Generation) Demystified: A Question-First Guide for Software Developers

RAG(Retrieval-Augmented Generation) Demystified: A Question-First Guide for Software Developers

Comments
7 min read
Data Governance in RAG Systems: Security, Privacy, and Compliance by Design

Data Governance in RAG Systems: Security, Privacy, and Compliance by Design

1
Comments
4 min read
The Orphan Axiom Problem in Ontology-Based RAG

The Orphan Axiom Problem in Ontology-Based RAG

Comments
6 min read
Building a Scalable RAG System for Repository Intelligence

Building a Scalable RAG System for Repository Intelligence

Comments 1
3 min read
Agentic RAG: Letting LLMs Choose What to Retrieve

Agentic RAG: Letting LLMs Choose What to Retrieve

Comments
11 min read
Optimal Chunking for Ontology RAG: Empirical Analysis & Orphan Axiom Problem

Optimal Chunking for Ontology RAG: Empirical Analysis & Orphan Axiom Problem

Comments
12 min read
Semantic Caching with Bifrost: Reduce LLM Costs and Latency by Up to 70%

Semantic Caching with Bifrost: Reduce LLM Costs and Latency by Up to 70%

Comments
7 min read
How to Evaluate Your RAG System: A Complete Guide to Metrics, Methods, and Best Practices

How to Evaluate Your RAG System: A Complete Guide to Metrics, Methods, and Best Practices

Comments
18 min read
Enterprise RAG Architecture: A Complete Technical Guide by AgenixHub

Enterprise RAG Architecture: A Complete Technical Guide by AgenixHub

Comments
2 min read
Por Qué el 83% de Herramientas de Detección de Alucinaciones RAG Fallan en Producción

Por Qué el 83% de Herramientas de Detección de Alucinaciones RAG Fallan en Producción

Comments
3 min read
OWL-Aware Chunking Strategies: A Comprehensive Performance Analysis

OWL-Aware Chunking Strategies: A Comprehensive Performance Analysis

Comments
12 min read
Why AI Video Feels Unreliable — and What Reference-to-Video Fixes

Why AI Video Feels Unreliable — and What Reference-to-Video Fixes

Comments
2 min read
Routing, Load Balancing, and Failover in LLM Systems

Routing, Load Balancing, and Failover in LLM Systems

5
Comments
3 min read
Human-in-the-Loop Systems: Building AI That Knows When to Ask for Help

Human-in-the-Loop Systems: Building AI That Knows When to Ask for Help

Comments
17 min read
I Built a PDF Chat App in Under an Hour Using RAG- Here's How You Can Too

I Built a PDF Chat App in Under an Hour Using RAG- Here's How You Can Too

Comments
3 min read
Prompt -> RAG -> Eval: System Overview for LLM Engineers

Prompt -> RAG -> Eval: System Overview for LLM Engineers

Comments
3 min read
Implementing Retrieval-Augmented Generation (RAG) with Real-World Constraints

Implementing Retrieval-Augmented Generation (RAG) with Real-World Constraints

Comments
3 min read
Functional MCP AI System Diagram

Functional MCP AI System Diagram

Comments
1 min read
Engineers who explore build better AI products

Engineers who explore build better AI products

2
Comments
2 min read
Why GenAI Observability Breaks in Production

Why GenAI Observability Breaks in Production

Comments
2 min read
Launching your personal assistant

Launching your personal assistant

5
Comments
14 min read
Why RAG is the Future of Search (And How Elastic Search Makes it Possible )

Why RAG is the Future of Search (And How Elastic Search Makes it Possible )

1
Comments
4 min read
Before You Build a Client RAG/Agent: My Pre-Build Checklist (With Examples + What to Automate)

Before You Build a Client RAG/Agent: My Pre-Build Checklist (With Examples + What to Automate)

Comments
5 min read
Multi-Step Reasoning and Agentic Workflows: Building AI That Plans and Executes

Multi-Step Reasoning and Agentic Workflows: Building AI That Plans and Executes

Comments
16 min read
I made a fast, structured PDF extractor for RAG; 300 pages a second

I made a fast, structured PDF extractor for RAG; 300 pages a second

Comments
3 min read
loading...