DEV Community

# rag

Retrieval augmented generation, or RAG, is an architectural approach that can improve the efficacy of large language model (LLM) applications by leveraging custom data.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Vectors vs. Keywords: Why "Close Enough" is Dangerous in MedTech RAG

Vectors vs. Keywords: Why "Close Enough" is Dangerous in MedTech RAG

Comments
3 min read
You Don’t Need a Vector Database to Build RAG (Yet): A ~$1/Month DynamoDB Pipeline

You Don’t Need a Vector Database to Build RAG (Yet): A ~$1/Month DynamoDB Pipeline

1
Comments
10 min read
8 RAG Patterns You Should Stop Ignoring

8 RAG Patterns You Should Stop Ignoring

1
Comments
15 min read
10 Best Practices to Manage Unstructured Data for Enterprises

10 Best Practices to Manage Unstructured Data for Enterprises

Comments
8 min read
Self-Hosting Cognee: LLM Performance Tests

Self-Hosting Cognee: LLM Performance Tests

Comments
9 min read
Clone Your CTO: The Architecture of an 'AI Twin' (DSPy + Unsloth)

Clone Your CTO: The Architecture of an 'AI Twin' (DSPy + Unsloth)

Comments
3 min read
When Language-Agnostic Design Helps — and When It Complicates

When Language-Agnostic Design Helps — and When It Complicates

Comments
3 min read
Build a Production-Ready AI Document Brain: A No-Nonsense Guide to RAG SaaS

Build a Production-Ready AI Document Brain: A No-Nonsense Guide to RAG SaaS

1
Comments
4 min read
RAG & Semantic Search

RAG & Semantic Search

7
Comments 1
7 min read
One Year of Model Context Protocol: From Experiment to Industry Standard

One Year of Model Context Protocol: From Experiment to Industry Standard

Comments
3 min read
TOON vs JSON en RAG (Java): el Grinch de los formatos cuando cada token cuenta 🎁

TOON vs JSON en RAG (Java): el Grinch de los formatos cuando cada token cuenta 🎁

Comments
7 min read
The Research: MiniMax M2.1 (The "Linear" Revolution)

The Research: MiniMax M2.1 (The "Linear" Revolution)

Comments
3 min read
Deploying Scalable LLM Tools via Remote MCP on Kubernetes

Deploying Scalable LLM Tools via Remote MCP on Kubernetes

Comments
10 min read
Creacion de una base de conocimiento en Bedrock con Amazon OpenSearch Service.

Creacion de una base de conocimiento en Bedrock con Amazon OpenSearch Service.

11
Comments 1
3 min read
How to Build a Triple-Failover RAG with Gemini, Llama 3, and Groq for LegalTech

How to Build a Triple-Failover RAG with Gemini, Llama 3, and Groq for LegalTech

1
Comments
2 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.