DEV Community

# rag

Retrieval augmented generation, or RAG, is an architectural approach that can improve the efficacy of large language model (LLM) applications by leveraging custom data.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
RAG for Developers — Built for Code, Not Just Text (Review Requested)

RAG for Developers — Built for Code, Not Just Text (Review Requested)

Comments
1 min read
RAG Chunking Strategies Deep Dive

RAG Chunking Strategies Deep Dive

Comments
7 min read
Stop feeding garbage to your LLM: How to get clean Markdown from Documentation

Stop feeding garbage to your LLM: How to get clean Markdown from Documentation

Comments
1 min read
My hands-on experience with Qdrant and Docling (and Ollama)

My hands-on experience with Qdrant and Docling (and Ollama)

Comments
11 min read
Inside Memcortex: A Lightweight Semantic Memory Layer for LLMs

Inside Memcortex: A Lightweight Semantic Memory Layer for LLMs

Comments
4 min read
RAG is more than Vector Search

RAG is more than Vector Search

1
Comments
4 min read
Reranking and Two-Stage Retrieval: Precision When It Matters Most

Reranking and Two-Stage Retrieval: Precision When It Matters Most

Comments
2 min read
LLMs Hallucinate. RAG Fixes That — Here’s How We Built a Reliable Healthcare AI

LLMs Hallucinate. RAG Fixes That — Here’s How We Built a Reliable Healthcare AI

Comments
3 min read
Build Better RAG Pipelines: Scraping Technical Docs to Clean Markdown

Build Better RAG Pipelines: Scraping Technical Docs to Clean Markdown

Comments
2 min read
I Built a TUI to Visualize RAG Chunking because chunk_size=1000 is a Lie 📉

I Built a TUI to Visualize RAG Chunking because chunk_size=1000 is a Lie 📉

Comments
3 min read
Why your AI assistant lies to you (and how to fix it)

Why your AI assistant lies to you (and how to fix it)

Comments
4 min read
CLaRa: Fixing RAG’s Broken Retrieval–Generation Pipeline With Shared-Space Learning

CLaRa: Fixing RAG’s Broken Retrieval–Generation Pipeline With Shared-Space Learning

Comments
3 min read
A RAG-Free Technique That Makes LLM Outputs Stable, Predictable, and Auditable

A RAG-Free Technique That Makes LLM Outputs Stable, Predictable, and Auditable

Comments
2 min read
How to Convert JSON to TOON: 4 Methods Compared

How to Convert JSON to TOON: 4 Methods Compared

Comments
4 min read
Choosing the Right RAG: Comparing the Most Common Retrieval-Augmented Generation Frameworks

Choosing the Right RAG: Comparing the Most Common Retrieval-Augmented Generation Frameworks

Comments
6 min read
RAG with MongoDB Vector Search PART 1

RAG with MongoDB Vector Search PART 1

2
Comments 1
5 min read
Fine-tuning For Domain-Customized Retriever Noise Mitigation in RAG Pipelines

Fine-tuning For Domain-Customized Retriever Noise Mitigation in RAG Pipelines

1
Comments
6 min read
Training, Decoding, and Hallucination in Large Language Models: A Deep Dive

Training, Decoding, and Hallucination in Large Language Models: A Deep Dive

1
Comments
9 min read
Beyond Vanilla RAG: The 7 Modern RAG Architectures Every AI Engineer Must Know

Beyond Vanilla RAG: The 7 Modern RAG Architectures Every AI Engineer Must Know

1
Comments
15 min read
How to Design Two Practical Orchestration Loops for LLM Agents

How to Design Two Practical Orchestration Loops for LLM Agents

7
Comments
9 min read
How (Retrieval-Augmented Generation (RAG) enhances the use of AI in Finance

How (Retrieval-Augmented Generation (RAG) enhances the use of AI in Finance

Comments
3 min read
Vector Stores for RAG Comparison

Vector Stores for RAG Comparison

Comments
7 min read
Retrieval-Augmented Generation: Connecting LLMs to Your Data

Retrieval-Augmented Generation: Connecting LLMs to Your Data

Comments
10 min read
🧠LLMs As Sensors

🧠LLMs As Sensors

5
Comments
9 min read
How Kiro’s Global Steering Turned Me Into a Solo Frankenstein Engineer

How Kiro’s Global Steering Turned Me Into a Solo Frankenstein Engineer

Comments
2 min read
loading...