DEV Community

# rag

Retrieval augmented generation, or RAG, is an architectural approach that can improve the efficacy of large language model (LLM) applications by leveraging custom data.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Compound AI Systems: How I Connect Multiple Models in a Single Production Product

Compound AI Systems: How I Connect Multiple Models in a Single Production Product

Comments
2 min read
Presentando Citai: el motor RAG que construí en 6 artículos — y que ahora podés probar gratis

Presentando Citai: el motor RAG que construí en 6 artículos — y que ahora podés probar gratis

1
Comments
6 min read
Introducing Citai: the RAG engine I built across 6 articles — now free to try

Introducing Citai: the RAG engine I built across 6 articles — now free to try

Comments
6 min read
Why Your LLM Ignores Detailed Instructions (It's Not a Bug)

Why Your LLM Ignores Detailed Instructions (It's Not a Bug)

Comments
2 min read
Most GenAI chatbot tutorials stop at “call an LLM get an answer.”

Most GenAI chatbot tutorials stop at “call an LLM get an answer.”

Comments
1 min read
🚀 Beyond RAG: Simulating the Future with MiroFish

🚀 Beyond RAG: Simulating the Future with MiroFish

2
Comments
2 min read
Perfect Retrieval Recall on LongMemEval — Running Fully Local

Perfect Retrieval Recall on LongMemEval — Running Fully Local

Comments 1
4 min read
AI Products Break on the Data Layer — Not on the Next Model Release

AI Products Break on the Data Layer — Not on the Next Model Release

6
Comments 2
5 min read
I Ran 500 More Agent Memory Experiments. The Real Problem Wasn’t Recall. It Was Binding.

Rigor beyond happy-path testing

I Ran 500 More Agent Memory Experiments. The Real Problem Wasn’t Recall. It Was Binding.

56
Comments 29
14 min read
Beyond Vector Search: Building a Clause Forest (FoC) Architecture for Financial RAG

Beyond Vector Search: Building a Clause Forest (FoC) Architecture for Financial RAG

Comments
7 min read
đź§  Streaming LLM APIs Can Quietly Give Free Tokens

đź§  Streaming LLM APIs Can Quietly Give Free Tokens

Comments
1 min read
Building RAG & Knowledge Bases with seekdb: Three Paths, One Stack

Building RAG & Knowledge Bases with seekdb: Three Paths, One Stack

5
Comments
4 min read
How I caught a silent NaN bug in production RAG, by asking the system to debug itself

How I caught a silent NaN bug in production RAG, by asking the system to debug itself

Comments
6 min read
Measuring RAG vs. Fine-tuning ROI for Agent Knowledge

Measuring RAG vs. Fine-tuning ROI for Agent Knowledge

Comments
9 min read
Stop Wasting Days on RAG Setup: How uv + pyseekdb Cut Your Development Time by 90%

Stop Wasting Days on RAG Setup: How uv + pyseekdb Cut Your Development Time by 90%

5
Comments
5 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.