DEV Community

# rag

Retrieval augmented generation, or RAG, is an architectural approach that can improve the efficacy of large language model (LLM) applications by leveraging custom data.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Build and deploy a RAG pipeline as a REST API in under 5 minutes with RAGLight

Build and deploy a RAG pipeline as a REST API in under 5 minutes with RAGLight

Comments
3 min read
Fine-tuning vs RAG: Cuándo Usar Cada Enfoque para LLMs en Producción

Fine-tuning vs RAG: Cuándo Usar Cada Enfoque para LLMs en Producción

Comments
8 min read
Ask vs Act: RAG, Tool Use and AI agents

Ask vs Act: RAG, Tool Use and AI agents

2
Comments
4 min read
L88 – An Agentic Local RAG Knowledge Engine (Looking for Feedback & Contributors)

L88 – An Agentic Local RAG Knowledge Engine (Looking for Feedback & Contributors)

1
Comments 1
2 min read
Fine-tuning vs RAG: Cuándo Usar Cada Enfoque para LLMs en Producción

Fine-tuning vs RAG: Cuándo Usar Cada Enfoque para LLMs en Producción

Comments
8 min read
Build Chatbot with RAG: Why Your Architecture Matters

Build Chatbot with RAG: Why Your Architecture Matters

1
Comments
7 min read
Vector Databases for AI Agents: Which One Actually Works in Production?

Vector Databases for AI Agents: Which One Actually Works in Production?

1
Comments 2
14 min read
How Large Language Models Work: Explained Simply

How Large Language Models Work: Explained Simply

Comments
3 min read
Building a Production-Grade RAG System (Not Just a Demo)

Building a Production-Grade RAG System (Not Just a Demo)

3
Comments
8 min read
Context Engineering: The Production Problem Nobody Writes About

Context Engineering: The Production Problem Nobody Writes About

2
Comments
6 min read
Tracing a RAG Chain End-to-End: Where OpenTelemetry Stops and Where You Need to Instrument Yourself

Tracing a RAG Chain End-to-End: Where OpenTelemetry Stops and Where You Need to Instrument Yourself

2
Comments
8 min read
Implementing a RAG system: Walk

Implementing a RAG system: Walk

8
Comments 2
4 min read
Building an Agentic Access-Aware RAG System with Amazon FSx for NetApp ONTAP, S3 Vectors, and S3 Access Points— Where AI Respects File Permissions

Building an Agentic Access-Aware RAG System with Amazon FSx for NetApp ONTAP, S3 Vectors, and S3 Access Points— Where AI Respects File Permissions

2
Comments 1
16 min read
Prompt Stuffing Is Killing Your Agent

Prompt Stuffing Is Killing Your Agent

28
Comments 5
6 min read
~1ms hybrid graph + vector queries (network is now the bottleneck)

~1ms hybrid graph + vector queries (network is now the bottleneck)

Comments
3 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.