DEV Community

# rag

Retrieval augmented generation, or RAG, is an architectural approach that can improve the efficacy of large language model (LLM) applications by leveraging custom data.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
What It Actually Takes to Run a RAG System in Production

What It Actually Takes to Run a RAG System in Production

Comments
2 min read
I Built an AI Ops Assistant That Replaced $2,400/month in SaaS — In 3 Days, for Under $100/month

I Built an AI Ops Assistant That Replaced $2,400/month in SaaS — In 3 Days, for Under $100/month

12
Comments 2
9 min read
Building a Production RAG Server with Ollama, Open WebUI and Chroma DB

Building a Production RAG Server with Ollama, Open WebUI and Chroma DB

Comments
1 min read
How to Create a RAG-Powered Chatbot with LangChain and PostgreSQL

How to Create a RAG-Powered Chatbot with LangChain and PostgreSQL

2
Comments 1
12 min read
Medicine Encyclopedia 2.0: Stop Guessing and Start Scanning with Multimodal RAG

Medicine Encyclopedia 2.0: Stop Guessing and Start Scanning with Multimodal RAG

1
Comments
4 min read
From Paper Trails to Health Insights: Building a Personal EHR Semantic Search Engine with Hybrid Search

From Paper Trails to Health Insights: Building a Personal EHR Semantic Search Engine with Hybrid Search

1
Comments
4 min read
Stop Paying for APIs: Build a 100% Local AI Auditor with Python & Llama 3

Stop Paying for APIs: Build a 100% Local AI Auditor with Python & Llama 3

Comments
4 min read
Reducing OCR Cost in RAG Pipelines with Page-Level Detection

Reducing OCR Cost in RAG Pipelines with Page-Level Detection

Comments
1 min read
Building AI Agents That Don't Hallucinate: A Practical Architecture Guide

Building AI Agents That Don't Hallucinate: A Practical Architecture Guide

1
Comments
4 min read
Why your Production Retreival-Augmented-Generation (RAG) is failing and how to fix it?

Why your Production Retreival-Augmented-Generation (RAG) is failing and how to fix it?

3
Comments
4 min read
Playground to test Open-Source LLMs in action (GPT-OSS, Qwen3.5, DeepSeek) with Tools and RAG [Free and No signup]

Playground to test Open-Source LLMs in action (GPT-OSS, Qwen3.5, DeepSeek) with Tools and RAG [Free and No signup]

21
Comments 2
1 min read
I Built an MCP Server to Search Documentation from Claude (So You Don't Have to Web Search)

I Built an MCP Server to Search Documentation from Claude (So You Don't Have to Web Search)

1
Comments 1
3 min read
RAG vs Fine-Tuning: What I Actually Learned After 6 Months of Building LLM Apps

RAG vs Fine-Tuning: What I Actually Learned After 6 Months of Building LLM Apps

1
Comments 1
7 min read
How I Built a Local-First AI Stack for Document Q&A Without OpenAI

How I Built a Local-First AI Stack for Document Q&A Without OpenAI

2
Comments
22 min read
Why every RAG project I've built ends up fighting the pipeline — and what I'm doing about it

Why every RAG project I've built ends up fighting the pipeline — and what I'm doing about it

Comments 1
3 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.