DEV Community

# rag

Retrieval augmented generation, or RAG, is an architectural approach that can improve the efficacy of large language model (LLM) applications by leveraging custom data.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Your RAG Agent Is Retrieving the Wrong Chunk: 5 Failure Modes We Fix in Production

Your RAG Agent Is Retrieving the Wrong Chunk: 5 Failure Modes We Fix in Production

Comments
4 min read
Building a Fully-Local Research RAG on 2 GTX 1080 Ti + an RTX 3090 — 3 Gotchas

Building a Fully-Local Research RAG on 2 GTX 1080 Ti + an RTX 3090 — 3 Gotchas

Comments 4
5 min read
WearEdge Pro: An OPEA Manufacturing Five-Agent Suite for Frontline Operators

WearEdge Pro: An OPEA Manufacturing Five-Agent Suite for Frontline Operators

Comments
2 min read
What Happens Inside an LLM During Inference: Tokens, KV Cache, and GPU Execution Explained

What Happens Inside an LLM During Inference: Tokens, KV Cache, and GPU Execution Explained

Comments
12 min read
Long-Context Models Killed RAG. Except for the 6 Cases Where They Made It Worse.

Long-Context Models Killed RAG. Except for the 6 Cases Where They Made It Worse.

Comments
8 min read
Local LLM for Claude Code, AI Workflow Orchestration, and MLOps Deployment Patterns

Local LLM for Claude Code, AI Workflow Orchestration, and MLOps Deployment Patterns

Comments
3 min read
IBM RAG & Agentic AI Certificate: What I Can Build Now

IBM RAG & Agentic AI Certificate: What I Can Build Now

1
Comments
6 min read
NoteMeld: A Self-hosted AI Knowledge Base and Wiki Workspace

NoteMeld: A Self-hosted AI Knowledge Base and Wiki Workspace

Comments
1 min read
Why Your RAG Pipeline Is Failing in Production (And How to Fix It)

Why Your RAG Pipeline Is Failing in Production (And How to Fix It)

1
Comments
11 min read
Vector‑native RAG on Oracle: embeddings, HNSW/IVF, and hybrid search under database governance

Vector‑native RAG on Oracle: embeddings, HNSW/IVF, and hybrid search under database governance

1
Comments
12 min read
From Problems to Patterns: Generative AI in .Net (C#)

From Problems to Patterns: Generative AI in .Net (C#)

Comments
3 min read
MCP Server LLM Orchestration, GSD-Redux Automation, & DE for AI Production

MCP Server LLM Orchestration, GSD-Redux Automation, & DE for AI Production

Comments
4 min read
When Should You Use Text2Cypher in a GraphRAG Pipeline

When Should You Use Text2Cypher in a GraphRAG Pipeline

Comments
4 min read
Build a Local RAG AI App with Ollama, Mistral, and Node.js

Build a Local RAG AI App with Ollama, Mistral, and Node.js

Comments
7 min read
Graph RAG vs Vector RAG: When to Use Each

Graph RAG vs Vector RAG: When to Use Each

Comments
4 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.