DEV Community

# rag

Retrieval augmented generation, or RAG, is an architectural approach that can improve the efficacy of large language model (LLM) applications by leveraging custom data.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
How to Build a Simple Persistent Memory Layer for LLM Apps (With Code)

How to Build a Simple Persistent Memory Layer for LLM Apps (With Code)

Comments
3 min read
Building a Production-Ready RAG Chatbot with AWS Bedrock, LangChain, and Terraform

Building a Production-Ready RAG Chatbot with AWS Bedrock, LangChain, and Terraform

Comments
12 min read
LLM-as-a-Judge: Automated Scoring and Reliability vs. Human Evaluation

LLM-as-a-Judge: Automated Scoring and Reliability vs. Human Evaluation

Comments
6 min read
Accuracy Is Expensive: How to Evaluate ‘Quality per $’ for Agents and RAG

Accuracy Is Expensive: How to Evaluate ‘Quality per $’ for Agents and RAG

Comments
6 min read
How LLM Memory Actually Works in Production Systems

How LLM Memory Actually Works in Production Systems

Comments
4 min read
I Built a RAG Bot to Fix Flaky Cypress Tests

I Built a RAG Bot to Fix Flaky Cypress Tests

Comments
4 min read
What Is LLM Grounding? A Developer's Guide

What Is LLM Grounding? A Developer's Guide

Comments
6 min read
I Built a pip-installable RAG Chatbot — Chat With Any Document in 3 Lines of Python

I Built a pip-installable RAG Chatbot — Chat With Any Document in 3 Lines of Python

Comments
2 min read
RAG Research: Bridging the Gap Between LLMs and Knowledge

RAG Research: Bridging the Gap Between LLMs and Knowledge

Comments
3 min read
Zero-Downtime Embedding Migration: Switching from text-embedding-004 to text-embedding-3-large in Production

Zero-Downtime Embedding Migration: Switching from text-embedding-004 to text-embedding-3-large in Production

Comments
3 min read
Hybrid RAG System over SEC Filings

Hybrid RAG System over SEC Filings

Comments
19 min read
Beyond the Context Window: Choosing Between RAG and MCP

Beyond the Context Window: Choosing Between RAG and MCP

Comments
3 min read
Dev Log: Building a Secure RAG Agent for 150k Records

Dev Log: Building a Secure RAG Agent for 150k Records

1
Comments
3 min read
Unifying Ranking and Generation in Query Auto-Completion via Retrieval-Augmented Generation and Multi-Objective Alignment

Unifying Ranking and Generation in Query Auto-Completion via Retrieval-Augmented Generation and Multi-Objective Alignment

Comments
4 min read
I Built Vector-Only Search First. Here's Why I Had to Rewrite It.

I Built Vector-Only Search First. Here's Why I Had to Rewrite It.

Comments
4 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.