DEV Community

# rag

Retrieval augmented generation, or RAG, is an architectural approach that can improve the efficacy of large language model (LLM) applications by leveraging custom data.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Zero-Downtime Embedding Migration: Switching from text-embedding-004 to text-embedding-3-large in Production

Zero-Downtime Embedding Migration: Switching from text-embedding-004 to text-embedding-3-large in Production

Comments
3 min read
Hybrid RAG System over SEC Filings

Hybrid RAG System over SEC Filings

Comments
19 min read
I Added Langfuse to My RAG App and It Immediately Caught Two Bugs

I Added Langfuse to My RAG App and It Immediately Caught Two Bugs

Comments
7 min read
When CLAUDE.md Stops Working: Adding Vector Memory to Claude Code

When CLAUDE.md Stops Working: Adding Vector Memory to Claude Code

1
Comments
10 min read
Beyond SEO: Generative Engine Optimization (GEO). How to Implement `llms.txt` and RAG-Friendly Markup

Beyond SEO: Generative Engine Optimization (GEO). How to Implement `llms.txt` and RAG-Friendly Markup

1
Comments 2
5 min read
Unifying Ranking and Generation in Query Auto-Completion via Retrieval-Augmented Generation and Multi-Objective Alignment

Unifying Ranking and Generation in Query Auto-Completion via Retrieval-Augmented Generation and Multi-Objective Alignment

Comments
4 min read
RAG Architecture: Building AI Apps That Know Your Data" platform

RAG Architecture: Building AI Apps That Know Your Data" platform

2
Comments
10 min read
Build a Fully Autonomous RAG Agent That Pays for Its Own Compute (x402 + GPU-Bridge)

Build a Fully Autonomous RAG Agent That Pays for Its Own Compute (x402 + GPU-Bridge)

1
Comments
7 min read
RAG Components Explained: The Building Blocks of Modern AI

RAG Components Explained: The Building Blocks of Modern AI

1
Comments 1
12 min read
The Next Frontier of AI Agent Runtimes: Observability, MCP, and High-Precision RAG

The Next Frontier of AI Agent Runtimes: Observability, MCP, and High-Precision RAG

6
Comments
3 min read
Why Most RAG Systems Fail in Production (And How to Design One That Actually Works)

Why Most RAG Systems Fail in Production (And How to Design One That Actually Works)

Comments 1
4 min read
Apex B. OpenClaw, Local Embeddings.

Apex B. OpenClaw, Local Embeddings.

Comments
2 min read
When building AI chat is actually hard (how and why we built our agents)

When building AI chat is actually hard (how and why we built our agents)

Comments 1
6 min read
I Built Student Memory Into Groq Prompts Via Hindsight

I Built Student Memory Into Groq Prompts Via Hindsight

2
Comments
10 min read
Re-ranking Isn't Just Sorting Your Search Results (Anthropic Academy Part 3)

Re-ranking Isn't Just Sorting Your Search Results (Anthropic Academy Part 3)

Comments
3 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.