DEV Community

# rag

Retrieval augmented generation, or RAG, is an architectural approach that can improve the efficacy of large language model (LLM) applications by leveraging custom data.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Docling Speaks LaTeX: Unlocking Academic and Scientific Documents

Docling Speaks LaTeX: Unlocking Academic and Scientific Documents

Comments
23 min read
Show DEV: PardusDB – The "SQLite of Vector DBs" written in Rust

Show DEV: PardusDB – The "SQLite of Vector DBs" written in Rust

5
Comments
1 min read
Chatting with 3 Billion Base Pairs: Building a RAG Index for Your Personal Genome (WGS)

Chatting with 3 Billion Base Pairs: Building a RAG Index for Your Personal Genome (WGS)

Comments
4 min read
What’s Actually Making Your LLM Costs Skyrocket?

What’s Actually Making Your LLM Costs Skyrocket?

Comments
2 min read
Scaling RAG : Demo to Production Ready

Scaling RAG : Demo to Production Ready

Comments
2 min read
Are We Over-Engineering LLM Stacks Too Early?

Are We Over-Engineering LLM Stacks Too Early?

Comments 1
2 min read
What It Actually Takes to Run a RAG System in Production

What It Actually Takes to Run a RAG System in Production

Comments
2 min read
Building a Production RAG Server with Ollama, Open WebUI and Chroma DB

Building a Production RAG Server with Ollama, Open WebUI and Chroma DB

Comments
1 min read
Medicine Encyclopedia 2.0: Stop Guessing and Start Scanning with Multimodal RAG

Medicine Encyclopedia 2.0: Stop Guessing and Start Scanning with Multimodal RAG

1
Comments
4 min read
From Paper Trails to Health Insights: Building a Personal EHR Semantic Search Engine with Hybrid Search

From Paper Trails to Health Insights: Building a Personal EHR Semantic Search Engine with Hybrid Search

1
Comments
4 min read
Stop Paying for APIs: Build a 100% Local AI Auditor with Python & Llama 3

Stop Paying for APIs: Build a 100% Local AI Auditor with Python & Llama 3

Comments
4 min read
Reducing OCR Cost in RAG Pipelines with Page-Level Detection

Reducing OCR Cost in RAG Pipelines with Page-Level Detection

Comments
1 min read
Building AI Agents That Don't Hallucinate: A Practical Architecture Guide

Building AI Agents That Don't Hallucinate: A Practical Architecture Guide

1
Comments
4 min read
Building a Chat Assistant Using Elasticsearch as a Vector Database

Building a Chat Assistant Using Elasticsearch as a Vector Database

Comments
1 min read
Building a RAG-Based AI Chatbot Backend with Node.js (Serverless)

Building a RAG-Based AI Chatbot Backend with Node.js (Serverless)

Comments
2 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.