DEV Community

# rag

Retrieval augmented generation, or RAG, is an architectural approach that can improve the efficacy of large language model (LLM) applications by leveraging custom data.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
You Don't Know RAG. You Know Simple RAG.

You Don't Know RAG. You Know Simple RAG.

Comments
4 min read
Building a Browser-Based RAG System with WebGPU

Building a Browser-Based RAG System with WebGPU

3
Comments
3 min read
# Medical RAG Architecture Overview #llmszoomcamp

# Medical RAG Architecture Overview #llmszoomcamp

Comments
5 min read
# Medical RAG Architecture Overview #llmszoomcamp

# Medical RAG Architecture Overview #llmszoomcamp

Comments
5 min read
Building Custom Evaluators for AI Applications: A Technical Guide to AI Quality Assessment

Building Custom Evaluators for AI Applications: A Technical Guide to AI Quality Assessment

Comments
19 min read
RAG LLM: Why Your AI Costs 10x More Than It Should (And How to Fix It)

RAG LLM: Why Your AI Costs 10x More Than It Should (And How to Fix It)

Comments
5 min read
# Data Ingestion & Vector Store #llmszoomcamp

# Data Ingestion & Vector Store #llmszoomcamp

Comments
2 min read
TOON Benchmarks: A Critical Analysis of Different Results

TOON Benchmarks: A Critical Analysis of Different Results

2
Comments 1
7 min read
Prompt Caching Slashed My AI Bills by 90%. Here's What Nobody Tells You.

Prompt Caching Slashed My AI Bills by 90%. Here's What Nobody Tells You.

Comments
5 min read
Train it or feed it? Teaching LLMs your data the smart way

Train it or feed it? Teaching LLMs your data the smart way

Comments
4 min read
Understanding RAG: How AI Models Learn to Search Before They Speak

Understanding RAG: How AI Models Learn to Search Before They Speak

1
Comments
3 min read
Utilizing RAG Techniques for Improved AI Agent Performance

Utilizing RAG Techniques for Improved AI Agent Performance

Comments
8 min read
About context and LLM

About context and LLM

2
Comments
7 min read
The RAG Debugging Playbook: A Step-by-Step Guide to Trace-Level Failures and Fixes

The RAG Debugging Playbook: A Step-by-Step Guide to Trace-Level Failures and Fixes

Comments
10 min read
Synthetic Data for RAG: Safe Generation, Deduplication, and Drift-Aware Curation in 2025

Synthetic Data for RAG: Safe Generation, Deduplication, and Drift-Aware Curation in 2025

2
Comments
10 min read
Why Your AI Agents Keep Dropping the Ball—and How LangChain Plus PyTorch Can Salvage Your Solo Gig

Why Your AI Agents Keep Dropping the Ball—and How LangChain Plus PyTorch Can Salvage Your Solo Gig

Comments
6 min read
Building a Simple Modern RAG Application with Asyncio and Chainlit

Building a Simple Modern RAG Application with Asyncio and Chainlit

2
Comments
4 min read
Tired of AI Hallucinations? I Built a RAG App to Keep My Research Grounded.

Tired of AI Hallucinations? I Built a RAG App to Keep My Research Grounded.

5
Comments 1
4 min read
Why Most RAG Pipelines Fail in Production (and How to Fix Them)

Why Most RAG Pipelines Fail in Production (and How to Fix Them)

3
Comments
2 min read
Set up RAG with Genkit and Firebase in 15 minutes

Set up RAG with Genkit and Firebase in 15 minutes

2
Comments
6 min read
# Comprehensive Monitoring & Observability #llmszoomcamp

# Comprehensive Monitoring & Observability #llmszoomcamp

Comments
8 min read
Official Native Java Support for Docling: Building Better Apps Just Got Easier

Official Native Java Support for Docling: Building Better Apps Just Got Easier

Comments
4 min read
Building Semantica: An AI-Powered Academic Search Platform with MindsDB

Building Semantica: An AI-Powered Academic Search Platform with MindsDB

10
Comments
9 min read
Amazon S3 Vectors: When Storage Learns to Think

Amazon S3 Vectors: When Storage Learns to Think

Comments
8 min read
From 70K to 2K Tokens: Optimizing SQL Generation with RAG Architecture

From 70K to 2K Tokens: Optimizing SQL Generation with RAG Architecture

15
Comments 1
4 min read
loading...