DEV Community

# rag

Retrieval augmented generation, or RAG, is an architectural approach that can improve the efficacy of large language model (LLM) applications by leveraging custom data.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
The Hidden Magic Behind Search: Dense, Sparse, and Metadata Filtering

The Hidden Magic Behind Search: Dense, Sparse, and Metadata Filtering

Comments
3 min read
Building a Decentralized AI Chatbot with MimirLLM: A Step-by-Step Tutorial

Building a Decentralized AI Chatbot with MimirLLM: A Step-by-Step Tutorial

2
Comments
4 min read
Are LLMs Really Doomed?

Are LLMs Really Doomed?

26
Comments
3 min read
Weather App With State Management for Long Running Conversations Using AI Agents

Weather App With State Management for Long Running Conversations Using AI Agents

2
Comments
2 min read
Tutorial: LangChain, Milvus, Anthropic Claude 3 Haiku, and voyage-3-large RAG Chatbot

Tutorial: LangChain, Milvus, Anthropic Claude 3 Haiku, and voyage-3-large RAG Chatbot

1
Comments
8 min read
Simplifying RAG Pipelines: The Story Behind iQ Suite

Simplifying RAG Pipelines: The Story Behind iQ Suite

Comments
2 min read
Running locally DeepSeek-R1 for RAG

Running locally DeepSeek-R1 for RAG

1
Comments
4 min read
Best Practices for Production-Scale RAG Systems — An Implementation Guide

Best Practices for Production-Scale RAG Systems — An Implementation Guide

1
Comments
12 min read
Let’s Build Enterprise Cybersecurity Risk Assessment Using AI Agents

Let’s Build Enterprise Cybersecurity Risk Assessment Using AI Agents

1
Comments
2 min read
The Role of Augmented Reality in Manufacturing: Applications and Advantages

The Role of Augmented Reality in Manufacturing: Applications and Advantages

5
Comments
1 min read
RAG Chatbot Tutorial: LangChain, Milvus, GPT-4o mini, and text-embedding-3-large

RAG Chatbot Tutorial: LangChain, Milvus, GPT-4o mini, and text-embedding-3-large

7
Comments
4 min read
NoLiMA: GPT-4o achieve 99.3% accuracy in short contexts (<1K tokens), performance degrades to 69.7% at 32K tokens.

NoLiMA: GPT-4o achieve 99.3% accuracy in short contexts (<1K tokens), performance degrades to 69.7% at 32K tokens.

6
Comments 1
1 min read
Building an IBM AIX Expert Chatbot using RAG and FAISS

Building an IBM AIX Expert Chatbot using RAG and FAISS

Comments
3 min read
Building Local AI Agents: A Practical Guide to Frameworks and Deployment

Building Local AI Agents: A Practical Guide to Frameworks and Deployment

2
Comments
6 min read
Building a RAG-Powered Support Chatbot in 24 Hours of Hackathon

Building a RAG-Powered Support Chatbot in 24 Hours of Hackathon

28
Comments 12
9 min read
Building CSV RAG with Rig and Rust 🔥🔥🔥

Building CSV RAG with Rig and Rust 🔥🔥🔥

4
Comments
6 min read
How to Build a Vector Database with SQLite in Node.js For LLM's.

How to Build a Vector Database with SQLite in Node.js For LLM's.

11
Comments 2
7 min read
How Generative AI is Revolutionizing Financial Institutions in 2025: Top 10 Use Cases

How Generative AI is Revolutionizing Financial Institutions in 2025: Top 10 Use Cases

Comments
7 min read
GraphRAG: Augmenting Retrieval-Augmented Generation with Knowledge Graphs

GraphRAG: Augmenting Retrieval-Augmented Generation with Knowledge Graphs

Comments
4 min read
Create an agent and build a deployable notebook from it in watsonx.ai — Part 2

Create an agent and build a deployable notebook from it in watsonx.ai — Part 2

Comments
10 min read
Understanding and Implementing ReAct

Understanding and Implementing ReAct

1
Comments
4 min read
Ingesting documents using .NET to build a simple Retrieval Augmented Generation (RAG) system

Ingesting documents using .NET to build a simple Retrieval Augmented Generation (RAG) system

3
Comments 1
6 min read
Evaluate your LLM! Ok, but what's next? 🤷‍♂️

Evaluate your LLM! Ok, but what's next? 🤷‍♂️

9
Comments
1 min read
R.I.P. RAG? Gemini Flash 2.0 Might Just Have Revolutionized AI (Again) - Is Retrieval Augmented Generation Obsolete?

R.I.P. RAG? Gemini Flash 2.0 Might Just Have Revolutionized AI (Again) - Is Retrieval Augmented Generation Obsolete?

2
Comments
5 min read
Evaluation as a Business Imperative: The Survival Guide for Large Model Application Development

Evaluation as a Business Imperative: The Survival Guide for Large Model Application Development

Comments
5 min read
loading...