DEV Community

# rag

Retrieval augmented generation, or RAG, is an architectural approach that can improve the efficacy of large language model (LLM) applications by leveraging custom data.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
RAG for Developers — Built for Code, Not Just Text (Review Requested)

RAG for Developers — Built for Code, Not Just Text (Review Requested)

Comments
1 min read
Why Static Load Balancing Fails for LLM Infrastructure (And What Works Instead)

Why Static Load Balancing Fails for LLM Infrastructure (And What Works Instead)

5
Comments
7 min read
Building a RAG-Powered Documentation Assistant: Why I Used Bifrost LLM Gateway Instead of Direct API Calls

Building a RAG-Powered Documentation Assistant: Why I Used Bifrost LLM Gateway Instead of Direct API Calls

6
Comments
5 min read
Beyond RAG: Building an Autonomous "Epistemic Engine" to Fight AI Hallucination

Beyond RAG: Building an Autonomous "Epistemic Engine" to Fight AI Hallucination

Comments
2 min read
Stop feeding garbage to your LLM: How to get clean Markdown from Documentation

Stop feeding garbage to your LLM: How to get clean Markdown from Documentation

Comments
1 min read
My hands-on experience with Qdrant and Docling (and Ollama)

My hands-on experience with Qdrant and Docling (and Ollama)

Comments
11 min read
RAG-Augmented Agile Story Generation: An Architectural Framework for LLM-Powered Backlog Automation

RAG-Augmented Agile Story Generation: An Architectural Framework for LLM-Powered Backlog Automation

Comments
8 min read
Building a Simple RAG System Using FAISS

Building a Simple RAG System Using FAISS

1
Comments
3 min read
Reranking and Two-Stage Retrieval: Precision When It Matters Most

Reranking and Two-Stage Retrieval: Precision When It Matters Most

Comments
2 min read
LLMs Hallucinate. RAG Fixes That — Here’s How We Built a Reliable Healthcare AI

LLMs Hallucinate. RAG Fixes That — Here’s How We Built a Reliable Healthcare AI

Comments
3 min read
Multi-Tenant Design for Bedrock Knowledge Base: Solving the Account Limit with Metadata Filtering

Multi-Tenant Design for Bedrock Knowledge Base: Solving the Account Limit with Metadata Filtering

2
Comments
3 min read
I Built a TUI to Visualize RAG Chunking because chunk_size=1000 is a Lie 📉

I Built a TUI to Visualize RAG Chunking because chunk_size=1000 is a Lie 📉

Comments
3 min read
Create a Knowledge Base in Amazon Bedrock (Step-by-Step Console Guide)

Create a Knowledge Base in Amazon Bedrock (Step-by-Step Console Guide)

Comments
6 min read
Why your AI assistant lies to you (and how to fix it)

Why your AI assistant lies to you (and how to fix it)

Comments
4 min read
How to Build a Scalable RAG-Based Chatbot on AWS?

How to Build a Scalable RAG-Based Chatbot on AWS?

Comments
8 min read
CLaRa: Fixing RAG’s Broken Retrieval–Generation Pipeline With Shared-Space Learning

CLaRa: Fixing RAG’s Broken Retrieval–Generation Pipeline With Shared-Space Learning

Comments
3 min read
A RAG-Free Technique That Makes LLM Outputs Stable, Predictable, and Auditable

A RAG-Free Technique That Makes LLM Outputs Stable, Predictable, and Auditable

Comments
2 min read
Course: Large Language Models and Generative AI for NLP — 2025

Course: Large Language Models and Generative AI for NLP — 2025

10
Comments 1
1 min read
Inside Memcortex: A Lightweight Semantic Memory Layer for LLMs

Inside Memcortex: A Lightweight Semantic Memory Layer for LLMs

1
Comments 1
4 min read
The RAG Illusion: Why PostgreSQL Beats Vector Search for Most AI Applications

The RAG Illusion: Why PostgreSQL Beats Vector Search for Most AI Applications

Comments
10 min read
Choosing the Right RAG: Comparing the Most Common Retrieval-Augmented Generation Frameworks

Choosing the Right RAG: Comparing the Most Common Retrieval-Augmented Generation Frameworks

Comments
6 min read
Vector Dimensions, Cosine Similarity, Dot Product — and Why Your Distance Metric Silently Ruins Relevance

Vector Dimensions, Cosine Similarity, Dot Product — and Why Your Distance Metric Silently Ruins Relevance

Comments
2 min read
Fine-tuning For Domain-Customized Retriever Noise Mitigation in RAG Pipelines

Fine-tuning For Domain-Customized Retriever Noise Mitigation in RAG Pipelines

1
Comments
6 min read
Training, Decoding, and Hallucination in Large Language Models: A Deep Dive

Training, Decoding, and Hallucination in Large Language Models: A Deep Dive

1
Comments
9 min read
Beyond Vanilla RAG: The 7 Modern RAG Architectures Every AI Engineer Must Know

Beyond Vanilla RAG: The 7 Modern RAG Architectures Every AI Engineer Must Know

1
Comments
15 min read
loading...