DEV Community

# rag

Retrieval augmented generation, or RAG, is an architectural approach that can improve the efficacy of large language model (LLM) applications by leveraging custom data.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
The 2024 State of RAG Podcast 1:01:43

The 2024 State of RAG Podcast

16
Comments 3
1 min read
RAG Explained: Generation of Embeddings

RAG Explained: Generation of Embeddings

2
Comments
2 min read
LLM Models and RAG Applications Step-by-Step - Part I - Introduction

LLM Models and RAG Applications Step-by-Step - Part I - Introduction

Comments
16 min read
LLM Models and RAG Applications Step-by-Step - Part II - Creating the Context

LLM Models and RAG Applications Step-by-Step - Part II - Creating the Context

1
Comments
19 min read
LLM Models and RAG Applications Step-by-Step - Part III - Searching and Injecting Context

LLM Models and RAG Applications Step-by-Step - Part III - Searching and Injecting Context

Comments
11 min read
Creating a Simple RAG in Python with AzureOpenAI and LlamaIndex

Creating a Simple RAG in Python with AzureOpenAI and LlamaIndex

2
Comments
2 min read
What is RAG in AI? How It Combines Retrieval with Generation for Accurate Results

What is RAG in AI? How It Combines Retrieval with Generation for Accurate Results

3
Comments
4 min read
Generative Audio

Generative Audio

1
Comments
8 min read
RAG vs. Fine-Tuning: Which Is Best for Enhancing LLMs?

RAG vs. Fine-Tuning: Which Is Best for Enhancing LLMs?

14
Comments 1
6 min read
Create Your Own AI RAG Chatbot: A Python Guide with LangChain

Create Your Own AI RAG Chatbot: A Python Guide with LangChain

26
Comments 9
7 min read
The Bug That Once Stopped the World

The Bug That Once Stopped the World

Comments
2 min read
OpenRAG: An Open-Source GenAI Application to Supercharge Data Queries with Large Language Models

OpenRAG: An Open-Source GenAI Application to Supercharge Data Queries with Large Language Models

Comments
3 min read
How OpenAI o1 works in a simple way and why it matters for RAG and Agentic 🤯

How OpenAI o1 works in a simple way and why it matters for RAG and Agentic 🤯

Comments
6 min read
Time Waits for No Document: 5 ways to speed up your work

Time Waits for No Document: 5 ways to speed up your work

56
Comments 7
5 min read
Chunking in AI - The Secret Sauce You're Missing

Chunking in AI - The Secret Sauce You're Missing

5
Comments
6 min read
RAG Explained: Ingestion of Data

RAG Explained: Ingestion of Data

1
Comments
3 min read
Building a Document QA with Streamlit & OpenAI

Building a Document QA with Streamlit & OpenAI

13
Comments 4
4 min read
Swiftide 0.12 - Hybrid Search, search filters, parquet loader, and a giant speed bump

Swiftide 0.12 - Hybrid Search, search filters, parquet loader, and a giant speed bump

Comments
1 min read
Doing Multihop on HotPotQA Using Qwen 2.5 72B

Doing Multihop on HotPotQA Using Qwen 2.5 72B

5
Comments
5 min read
🚀 Introduction to Building AI-Powered Apps with Streamlit and FastAPI

🚀 Introduction to Building AI-Powered Apps with Streamlit and FastAPI

7
Comments 2
7 min read
GraphRAG Local Setup via Ollama: Pitfalls Prevention Guide

GraphRAG Local Setup via Ollama: Pitfalls Prevention Guide

Comments
19 min read
ColBERT Live! Makes Your Vector Database Smarter

ColBERT Live! Makes Your Vector Database Smarter

1
Comments
8 min read
The Best Database for Retrieval-Augmented Generation (RAG): Choosing the Right Solution

The Best Database for Retrieval-Augmented Generation (RAG): Choosing the Right Solution

6
Comments
5 min read
How to Build Smarter AI Apps and Reduce Hallucinations with RAG

How to Build Smarter AI Apps and Reduce Hallucinations with RAG

16
Comments
4 min read
5 Powerful Techniques to Slash Your LLM Costs

5 Powerful Techniques to Slash Your LLM Costs

1
Comments
1 min read
Unveiling the Magic Behind Autonomous AI Agents

Unveiling the Magic Behind Autonomous AI Agents

1
Comments
3 min read
A New Reliable AI Tool for Developers

A New Reliable AI Tool for Developers

Comments
1 min read
Will Retrieval Augmented Generation (RAG) Be Killed by Long-Context LLMs?

Will Retrieval Augmented Generation (RAG) Be Killed by Long-Context LLMs?

Comments
9 min read
Has anyone worked with embeddings generation, and open to helping me with it?

Has anyone worked with embeddings generation, and open to helping me with it?

Comments
1 min read
Swiftide 0.9, a Rust native library for building LLM applications with RAG, brings Fluvio, Lancedb and Ragas support

Swiftide 0.9, a Rust native library for building LLM applications with RAG, brings Fluvio, Lancedb and Ragas support

Comments
3 min read
Introducing Hexabot: Your 100% Open-Source Chatbot Solution 06:09

Introducing Hexabot: Your 100% Open-Source Chatbot Solution

63
Comments
2 min read
Hexabot Setup & Visual Editor Tutorial: Build Your First AI Chatbot 06:28

Hexabot Setup & Visual Editor Tutorial: Build Your First AI Chatbot

9
Comments
1 min read
Speech to Speech RAG

Speech to Speech RAG

5
Comments 2
4 min read
Llama 3.2 Vision(11B vision-instruct model) in Kaggle: A Step-by-Step Guide

Llama 3.2 Vision(11B vision-instruct model) in Kaggle: A Step-by-Step Guide

24
Comments
3 min read
RAG Explained: Introduction, improving LLMs

RAG Explained: Introduction, improving LLMs

2
Comments
2 min read
Debunking 6 common pgvector myths

Debunking 6 common pgvector myths

4
Comments
9 min read
Building a simple RAG agent with LlamaIndex

Building a simple RAG agent with LlamaIndex

9
Comments
3 min read
Pre and Post Filtering in Vector Search with Metadata and RAG Pipelines

Pre and Post Filtering in Vector Search with Metadata and RAG Pipelines

1
Comments
5 min read
AI Assistant for Company-Wide Software Best Practices with Gemini, LlamaIndex & RAG

AI Assistant for Company-Wide Software Best Practices with Gemini, LlamaIndex & RAG

5
Comments
5 min read
AI: What is RAG ?

AI: What is RAG ?

Comments
2 min read
Hill climbing generative AI problems: When ground truth values are expensive to obtain & launching fast is important

Hill climbing generative AI problems: When ground truth values are expensive to obtain & launching fast is important

Comments
5 min read
Easiest Way to Build a RAG AI Agent Application

Easiest Way to Build a RAG AI Agent Application

15
Comments
6 min read
Learn How to Build AI Agents & Chatbots with LangGraph!

Learn How to Build AI Agents & Chatbots with LangGraph!

28
Comments
3 min read
Intro to Ollama: Running LLMs Locally

Intro to Ollama: Running LLMs Locally

1
Comments
2 min read
Understanding the Knowledge Graph: A Deep Dive into Its Benefits and Applications

Understanding the Knowledge Graph: A Deep Dive into Its Benefits and Applications

3
Comments
3 min read
Unleash the Power of RAG - Building Intelligent Apps with Chroma and Gemini Pro

Unleash the Power of RAG - Building Intelligent Apps with Chroma and Gemini Pro

Comments
12 min read
How I Built ‘University Course Finder’ Using RAG

How I Built ‘University Course Finder’ Using RAG

1
Comments
2 min read
Milvus Adventures August 19, 2024

Milvus Adventures August 19, 2024

Comments
3 min read
RAGEval: Scenario-specific RAG evaluation dataset generation framework

RAGEval: Scenario-specific RAG evaluation dataset generation framework

1
Comments
8 min read
RAG Simplified!! 🐣

RAG Simplified!! 🐣

70
Comments 25
6 min read
Understanding RAG (Part 5): Recommendations and wrap-up

Understanding RAG (Part 5): Recommendations and wrap-up

3
Comments 1
9 min read
Rag Architecture Easy Explained

Rag Architecture Easy Explained

5
Comments
3 min read
From Notebook to Serverless: Creating a Multimodal Search Engine with Amazon Bedrock and PostgreSQL

From Notebook to Serverless: Creating a Multimodal Search Engine with Amazon Bedrock and PostgreSQL

5
Comments
3 min read
Context Caching: Is It the End of Retrieval-Augmented Generation (RAG)? 🤔

Context Caching: Is It the End of Retrieval-Augmented Generation (RAG)? 🤔

2
Comments
3 min read
Desplegando una Aplicación de Embeddings Serverless con AWS CDK, Lambda y Amazon Aurora PostgreSQL

Desplegando una Aplicación de Embeddings Serverless con AWS CDK, Lambda y Amazon Aurora PostgreSQL

5
Comments
6 min read
Optimizing RAG Context: Chunking and Summarization for Technical Docs

Optimizing RAG Context: Chunking and Summarization for Technical Docs

1
Comments
20 min read
Dockerize Local RAG with Models

Dockerize Local RAG with Models

5
Comments
3 min read
AI-Powered Bot using Vectorized knowledge Architecture

AI-Powered Bot using Vectorized knowledge Architecture

1
Comments
4 min read
Construyendo un Motor de Búsqueda Multimodal con Amazon Titan Embeddings, Aurora Serveless PostgreSQL y LangChain

Construyendo un Motor de Búsqueda Multimodal con Amazon Titan Embeddings, Aurora Serveless PostgreSQL y LangChain

2
Comments
4 min read
De Notebook a Serverless: Creando un Motor de Búsqueda Multimodal con Amazon Bedrock y PostgreSQL

De Notebook a Serverless: Creando un Motor de Búsqueda Multimodal con Amazon Bedrock y PostgreSQL

1
Comments
4 min read
loading...