DEV Community

# rag

Retrieval augmented generation, or RAG, is an architectural approach that can improve the efficacy of large language model (LLM) applications by leveraging custom data.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
I fine-tuned an LLM for a client, then told them not to use it

I fine-tuned an LLM for a client, then told them not to use it

Comments
5 min read
Why AI Agents Need Memory (And Why This Might Be the Biggest Missing Piece in Today's AI)

Why AI Agents Need Memory (And Why This Might Be the Biggest Missing Piece in Today's AI)

1
Comments
4 min read
How do you know an LLM answer is actually grounded — not just plausible? I measured it across 7 models and 4 regulated domains

How do you know an LLM answer is actually grounded — not just plausible? I measured it across 7 models and 4 regulated domains

Comments
2 min read
# Enterprise RAG’s Biggest Risk: Answers That Look Correct but Aren’t

# Enterprise RAG’s Biggest Risk: Answers That Look Correct but Aren’t

Comments
7 min read
RAG reranking for production agents: four approaches, four failure modes

RAG reranking for production agents: four approaches, four failure modes

Comments
11 min read
I Work in Healthcare Tech. Here's Why I Built a RAG Tool for Clinical Documents.

I Work in Healthcare Tech. Here's Why I Built a RAG Tool for Clinical Documents.

Comments
5 min read
Your RAG Pipeline Is Bleeding Tokens. We Cut 86% Without Losing Accuracy.

Your RAG Pipeline Is Bleeding Tokens. We Cut 86% Without Losing Accuracy.

1
Comments
3 min read
How I built a RAG-grounded Discord brain in 5 weeks (solo, ESL, no funding)

How I built a RAG-grounded Discord brain in 5 weeks (solo, ESL, no funding)

Comments 1
8 min read
Agentic Support Agent for a Platform Teams

Agentic Support Agent for a Platform Teams

Comments
6 min read
Building a persistent AI business assistant with LangChain, FastAPI, and Redis

Building a persistent AI business assistant with LangChain, FastAPI, and Redis

Comments
1 min read
Gemma 4 12B Multimodal, AI Copilot Selection, & AI-Optimized Documentation Strategies

Gemma 4 12B Multimodal, AI Copilot Selection, & AI-Optimized Documentation Strategies

Comments
3 min read
AI.Insaf (@ai_tablet) — Полный архив постов канала

AI.Insaf (@ai_tablet) — Полный архив постов канала

Comments
4 min read
AI.Insaf — Архив постов канала (реальные посты из web_fetch)

AI.Insaf — Архив постов канала (реальные посты из web_fetch)

Comments
4 min read
RAG with OpenAI Embeddings, pgvector and LangChain

RAG with OpenAI Embeddings, pgvector and LangChain

Comments
3 min read
How to Cheat LLM Context: A Lightweight AI Doc Assistant Architecture

How to Cheat LLM Context: A Lightweight AI Doc Assistant Architecture

2
Comments
3 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.