DEV Community

# rag

Retrieval augmented generation, or RAG, is an architectural approach that can improve the efficacy of large language model (LLM) applications by leveraging custom data.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Building a Scalable RAG System for Repository Intelligence

Building a Scalable RAG System for Repository Intelligence

Comments
3 min read
RAG Evaluation Metrics: Measuring What Actually Matters

RAG Evaluation Metrics: Measuring What Actually Matters

Comments
10 min read
**Chunklet-py: One Library to Split Them All - Sentence, Code, Docs**

**Chunklet-py: One Library to Split Them All - Sentence, Code, Docs**

Comments
2 min read
The Missing Step in RAG: Why Your Vector DB is Bloated (and how to fix it locally)

The Missing Step in RAG: Why Your Vector DB is Bloated (and how to fix it locally)

1
Comments
3 min read
**Chunklet-py: One Library to Split Them All - Sentence, Code, Docs**

**Chunklet-py: One Library to Split Them All - Sentence, Code, Docs**

Comments
2 min read
Building NovaMem: The Local-First, Open-Source Vector Database for AI Agents

Building NovaMem: The Local-First, Open-Source Vector Database for AI Agents

Comments
3 min read
Dual-Source AI: Integrating RAG & Live Search for Real-Time Answers

Dual-Source AI: Integrating RAG & Live Search for Real-Time Answers

Comments
5 min read
RAG(Retrieval-Augmented Generation) Demystified: A Question-First Guide for Software Developers

RAG(Retrieval-Augmented Generation) Demystified: A Question-First Guide for Software Developers

Comments
7 min read
The Orphan Axiom Problem in Ontology-Based RAG

The Orphan Axiom Problem in Ontology-Based RAG

Comments
6 min read
Agentic RAG: Letting LLMs Choose What to Retrieve

Agentic RAG: Letting LLMs Choose What to Retrieve

Comments
11 min read
Optimal Chunking for Ontology RAG: Empirical Analysis & Orphan Axiom Problem

Optimal Chunking for Ontology RAG: Empirical Analysis & Orphan Axiom Problem

Comments
12 min read
Semantic Caching with Bifrost: Reduce LLM Costs and Latency by Up to 70%

Semantic Caching with Bifrost: Reduce LLM Costs and Latency by Up to 70%

Comments
7 min read
How to Evaluate Your RAG System: A Complete Guide to Metrics, Methods, and Best Practices

How to Evaluate Your RAG System: A Complete Guide to Metrics, Methods, and Best Practices

Comments
18 min read
Enterprise RAG Architecture: A Complete Technical Guide by AgenixHub

Enterprise RAG Architecture: A Complete Technical Guide by AgenixHub

Comments
2 min read
OWL-Aware Chunking Strategies: A Comprehensive Performance Analysis

OWL-Aware Chunking Strategies: A Comprehensive Performance Analysis

Comments
12 min read
Por Qué el 83% de Herramientas de Detección de Alucinaciones RAG Fallan en Producción

Por Qué el 83% de Herramientas de Detección de Alucinaciones RAG Fallan en Producción

Comments
3 min read
Why AI Video Feels Unreliable — and What Reference-to-Video Fixes

Why AI Video Feels Unreliable — and What Reference-to-Video Fixes

Comments
2 min read
AWS Knowledge Bases: Building Intelligent, Context-Aware Applications at Scale

AWS Knowledge Bases: Building Intelligent, Context-Aware Applications at Scale

1
Comments
3 min read
TrueFoundry vs Bifrost: Why We Chose Specialization Over an All-in-One MLOps Platform

TrueFoundry vs Bifrost: Why We Chose Specialization Over an All-in-One MLOps Platform

6
Comments
5 min read
Building a Page-Level PDF Processing Pipeline for Smarter RAG Systems

Building a Page-Level PDF Processing Pipeline for Smarter RAG Systems

Comments
7 min read
Human-in-the-Loop Systems: Building AI That Knows When to Ask for Help

Human-in-the-Loop Systems: Building AI That Knows When to Ask for Help

Comments
17 min read
Prompt -> RAG -> Eval: System Overview for LLM Engineers

Prompt -> RAG -> Eval: System Overview for LLM Engineers

Comments
3 min read
Implementing Retrieval-Augmented Generation (RAG) with Real-World Constraints

Implementing Retrieval-Augmented Generation (RAG) with Real-World Constraints

Comments
3 min read
Our attempt to reduce the boring 40–60% of AI engineering

Our attempt to reduce the boring 40–60% of AI engineering

2
Comments 1
2 min read
Functional MCP AI System Diagram

Functional MCP AI System Diagram

Comments
1 min read
loading...