DEV Community

# rag

Retrieval augmented generation, or RAG, is an architectural approach that can improve the efficacy of large language model (LLM) applications by leveraging custom data.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Enhancing Hybrid Search in MongoDB: Combining RRF, Thresholds, and Weights

Enhancing Hybrid Search in MongoDB: Combining RRF, Thresholds, and Weights

3
Comments
2 min read
My Experience at Build Bengaluru 2024

My Experience at Build Bengaluru 2024

1
Comments
2 min read
Charting Your Unique Path in Generative AI: A Fresh Perspective for Beginners

Charting Your Unique Path in Generative AI: A Fresh Perspective for Beginners

Comments
3 min read
Build RAG 10X Faster

Build RAG 10X Faster

Comments
3 min read
FalkorDB has integrated with cognee to improve AI-driven knowledge retrieval

FalkorDB has integrated with cognee to improve AI-driven knowledge retrieval

Comments
1 min read
De Chatbot a Experto Industrial: Construyendo un Asistente Inteligente con Amazon Bedrock

De Chatbot a Experto Industrial: Construyendo un Asistente Inteligente con Amazon Bedrock

Comments
13 min read
Unlocking AI for Everyone: Build with RAG and Agentic RAG—No Code Needed

Unlocking AI for Everyone: Build with RAG and Agentic RAG—No Code Needed

Comments
2 min read
Understanding RAG and Long-Context LLMs: Insights from the SELF-ROUTE Hybrid Approach

Understanding RAG and Long-Context LLMs: Insights from the SELF-ROUTE Hybrid Approach

Comments
3 min read
What’s your favorite framework for building GenAI applications? (LangChain, Haystack, LlamaIndex, or others?) 🚀

What’s your favorite framework for building GenAI applications? (LangChain, Haystack, LlamaIndex, or others?) 🚀

Comments
1 min read
DeepMind at Google: Denny Zhou

DeepMind at Google: Denny Zhou

Comments
2 min read
Introducing Composio Tools| Agentic LLMs API Gateway

Introducing Composio Tools| Agentic LLMs API Gateway

Comments
3 min read
Want to start learning LLM and Generative AI? Start with Ollama and this article.

Want to start learning LLM and Generative AI? Start with Ollama and this article.

4
Comments 1
2 min read
GenAIScript - Comment Code with AI

GenAIScript - Comment Code with AI

1
Comments
5 min read
Faiss with sqlite for RAG

Faiss with sqlite for RAG

1
Comments
1 min read
Talk with your PDF documents in SharePoint

Talk with your PDF documents in SharePoint

Comments
2 min read
7 LLM Benchmarks for Performance, Capabilities, and Limitations

7 LLM Benchmarks for Performance, Capabilities, and Limitations

2
Comments
8 min read
🪤 The Chatbot Trap - Why Your LLM Project Is Stuck After the “Wow Moment"

🪤 The Chatbot Trap - Why Your LLM Project Is Stuck After the “Wow Moment"

3
Comments
4 min read
Stock Financial Analysis (Report Generation) using Generative AI - Gemini 1.5 Flash vs LLama 3.2 8b Model

Stock Financial Analysis (Report Generation) using Generative AI - Gemini 1.5 Flash vs LLama 3.2 8b Model

5
Comments
5 min read
Does Model Context Protocol (MCP) Spell the Death of RAG?

Does Model Context Protocol (MCP) Spell the Death of RAG?

Comments
4 min read
Git clone - that repo is too big : HELP!

Git clone - that repo is too big : HELP!

Comments
2 min read
Customize ChatGPT for Your Codebase : OpenAI

Customize ChatGPT for Your Codebase : OpenAI

1
Comments 2
10 min read
Why AI Agents Are Not Ready to Get Real Jobs Done — Yet

Why AI Agents Are Not Ready to Get Real Jobs Done — Yet

Comments
1 min read
Why Run LLM's /SLM's locally

Why Run LLM's /SLM's locally

1
Comments
1 min read
AI and All Data Weekly - 02 December 2024

AI and All Data Weekly - 02 December 2024

5
Comments
5 min read
Tired of AI Giving You 'Lazy Answers'? Here's Our Solution

Tired of AI Giving You 'Lazy Answers'? Here's Our Solution

1
Comments 5
3 min read
Knowledgeable Agents with FalkorDB Graph RAG

Knowledgeable Agents with FalkorDB Graph RAG

6
Comments 1
5 min read
struggling to effectively leverage graph structures in LLM-powered apps?

struggling to effectively leverage graph structures in LLM-powered apps?

1
Comments
2 min read
Multiple document conversion using Docling and a GUI

Multiple document conversion using Docling and a GUI

Comments
4 min read
The 10 Top-Rated Talks about Knowledge Graphs

The 10 Top-Rated Talks about Knowledge Graphs

Comments
2 min read
How Spring Boot and LangChain4J Enable Powerful Retrieval-Augmented Generation (RAG)

How Spring Boot and LangChain4J Enable Powerful Retrieval-Augmented Generation (RAG)

2
Comments
3 min read
User-Aligned Functions to Improve LLM-to-API Function-Calling Accuracy

User-Aligned Functions to Improve LLM-to-API Function-Calling Accuracy

Comments
11 min read
RAPTOR: A Novel Tree-Based Retrieval System for Enhancing Language Models – Research Summary

RAPTOR: A Novel Tree-Based Retrieval System for Enhancing Language Models – Research Summary

1
Comments
2 min read
My 2025 AI Engineer Roadmap List

My 2025 AI Engineer Roadmap List

82
Comments 2
4 min read
PDF RAG Demo: Building Simplified AI Workflows with Couchbase Shell

PDF RAG Demo: Building Simplified AI Workflows with Couchbase Shell

Comments
6 min read
Roadmap for Gen AI dev in 2025

Roadmap for Gen AI dev in 2025

9
Comments
3 min read
The Dark Side of VLMs: What's Really Going Wrong

The Dark Side of VLMs: What's Really Going Wrong

1
Comments 1
1 min read
Reflecting on the ORAssistant Project: A Journey of Collaboration and Technical Growth

Reflecting on the ORAssistant Project: A Journey of Collaboration and Technical Growth

1
Comments
2 min read
Semantic Search Feature

Semantic Search Feature

9
Comments
5 min read
The ultimate guide to Retrieval-Augmented Generation (RAG)

The ultimate guide to Retrieval-Augmented Generation (RAG)

24
Comments
16 min read
How to Create Your Own RAG with Free LLM Models and a Knowledge Base

How to Create Your Own RAG with Free LLM Models and a Knowledge Base

3
Comments
7 min read
First step and troubleshooting Docling — RAG with LlamaIndex on my CPU laptop

First step and troubleshooting Docling — RAG with LlamaIndex on my CPU laptop

Comments
5 min read
AI Engineer's Tool Review: Unstructured

AI Engineer's Tool Review: Unstructured

Comments
1 min read
Large Language Models (LLMs)

Large Language Models (LLMs)

Comments
1 min read
RAG Explained: Tackling the Big Problems in AI

RAG Explained: Tackling the Big Problems in AI

6
Comments
4 min read
Get Started with LangChain: A Step-by-Step Tutorial for Beginners

Get Started with LangChain: A Step-by-Step Tutorial for Beginners

6
Comments
4 min read
Function-based RAG: Extending LLMs Beyond Static Knowledge Bases

Function-based RAG: Extending LLMs Beyond Static Knowledge Bases

Comments
15 min read
Bolt.new with any LLM, you need to use it

Bolt.new with any LLM, you need to use it

2
Comments
2 min read
Building RAG-Powered Applications with LangChain, Pinecone, and OpenAI

Building RAG-Powered Applications with LangChain, Pinecone, and OpenAI

8
Comments 2
7 min read
What is Chunk Size and Chunk Overlap

What is Chunk Size and Chunk Overlap

Comments
3 min read
Extended RaBitQ: an Optimized Scalar Quantization Method

Extended RaBitQ: an Optimized Scalar Quantization Method

10
Comments
7 min read
Rethinking How We Train Customer-Facing AI Agents

Rethinking How We Train Customer-Facing AI Agents

32
Comments
1 min read
The Ghost of AI Past, Present, and Future

The Ghost of AI Past, Present, and Future

5
Comments
9 min read
PDF chat with source highlights

PDF chat with source highlights

Comments
1 min read
Why Chunk Text Before Embedding

Why Chunk Text Before Embedding

Comments
2 min read
Top 10 Real-World Applications of Artificial Intelligence to Watch in 2025

Top 10 Real-World Applications of Artificial Intelligence to Watch in 2025

Comments
5 min read
🤖 RAG vs. Agents: A Comparison and When to Use Each

🤖 RAG vs. Agents: A Comparison and When to Use Each

3
Comments
3 min read
AI Engineer's Tool Review: Guardrails AI

AI Engineer's Tool Review: Guardrails AI

Comments
1 min read
AI and All Data Weekly for 09 Dec 2024

AI and All Data Weekly for 09 Dec 2024

8
Comments 1
5 min read
Un Chatbot RAG pour explorer du contenu vidéo : une architecture event-driven et serverless sur Google Cloud

Un Chatbot RAG pour explorer du contenu vidéo : une architecture event-driven et serverless sur Google Cloud

9
Comments
7 min read
PDF Q&A Automation using LLaMA-3 Model via Groq API

PDF Q&A Automation using LLaMA-3 Model via Groq API

3
Comments
2 min read
loading...