DEV Community

# rag

Retrieval augmented generation, or RAG, is an architectural approach that can improve the efficacy of large language model (LLM) applications by leveraging custom data.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
How /search and /ask Work: Local Hybrid RAG with ChromaDB + SQLite FTS5

How /search and /ask Work: Local Hybrid RAG with ChromaDB + SQLite FTS5

Comments
10 min read
Building RAG Assistant: A I Built a Desktop RAG Chatbot From Scratch — Here's Everything I Learned

Building RAG Assistant: A I Built a Desktop RAG Chatbot From Scratch — Here's Everything I Learned

1
Comments
7 min read
🔬 Direction 1 closure on JAMES — when the hypothesis fails but the data turns "7-tier monotonic natural-stop gradient"

Gemma 4 Challenge: Write about Gemma 4 Submission

🔬 Direction 1 closure on JAMES — when the hypothesis fails but the data turns "7-tier monotonic natural-stop gradient"

1
Comments
2 min read
Google Just Made Gemma 4 Feel Like a Beta Test. Here's the Real Upgrade.

Gemma 4 Challenge: Write about Gemma 4 Submission

Google Just Made Gemma 4 Feel Like a Beta Test. Here's the Real Upgrade.

1
Comments
3 min read
RAG Series (13): Query Optimization — Asking Better Questions

RAG Series (13): Query Optimization — Asking Better Questions

Comments
6 min read
Building a RAG Chat System: From Zero to Production in Building This Blog: A Production AI Platform

Building a RAG Chat System: From Zero to Production in Building This Blog: A Production AI Platform

Comments
6 min read
Why “Local Document AI” Is Really an OCR + RAG + Local Inference Problem

Why “Local Document AI” Is Really an OCR + RAG + Local Inference Problem

5
Comments
4 min read
No More Hallucinated Citations: A Domain-Specific RAG System with Ollama, ChromaDB and AI Agents

No More Hallucinated Citations: A Domain-Specific RAG System with Ollama, ChromaDB and AI Agents

Comments
8 min read
AI Agents & Python Workflows: Anthropic Skills, Jupyter Challenges, and Edge Deployment

AI Agents & Python Workflows: Anthropic Skills, Jupyter Challenges, and Edge Deployment

Comments
3 min read
SQLite FTS5 won't tokenize Chinese — here's the 7-line bigram fix that did

SQLite FTS5 won't tokenize Chinese — here's the 7-line bigram fix that did

Comments 1
4 min read
Chunking Strategies for LLM Applications: A Practical Guide to Better RAG Systems

Chunking Strategies for LLM Applications: A Practical Guide to Better RAG Systems

1
Comments
4 min read
Local LLMs on Mobile, Enterprise Code Gen Workflows, & Production AI Cost Management

Local LLMs on Mobile, Enterprise Code Gen Workflows, & Production AI Cost Management

Comments
3 min read
Beyond Keywords: Mastering HyDE for Smarter Retrieval đź§ 

Beyond Keywords: Mastering HyDE for Smarter Retrieval đź§ 

Comments
4 min read
Part 3: Types of RAG

Part 3: Types of RAG

Comments
3 min read
Part 2: RAG Architecture: How Retrieval-Augmented Generation Actually Works

Part 2: RAG Architecture: How Retrieval-Augmented Generation Actually Works

Comments
4 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.