DEV Community

# llm

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
pgvector vs. pgvecto.rs in 2024: A Comprehensive Comparison for Vector Search in PostgreSQL

pgvector vs. pgvecto.rs in 2024: A Comprehensive Comparison for Vector Search in PostgreSQL

10
Comments
7 min read
The Potential Future of Large Action Models (LAMs) Looks Insane: A Quick Glimpse Through Tony Stark and Nelima

The Potential Future of Large Action Models (LAMs) Looks Insane: A Quick Glimpse Through Tony Stark and Nelima

Comments
6 min read
Top-Trending LLMs Over the Last Week

Top-Trending LLMs Over the Last Week

2
Comments
4 min read
Use cases for Langchain in your business

Use cases for Langchain in your business

5
Comments
6 min read
What is RAG? A quick 101

What is RAG? A quick 101

9
Comments
3 min read
Running Local LLMs, CPU vs. GPU - a Quick Speed Test

Running Local LLMs, CPU vs. GPU - a Quick Speed Test

32
Comments 21
3 min read
All About Google Gemma

All About Google Gemma

Comments
2 min read
Limitations of Running AI Agents Locally

Limitations of Running AI Agents Locally

3
Comments
3 min read
Make the OpenAI Function Calling Work Better and Cheaper with a Two-Step Function Call 🚀

Make the OpenAI Function Calling Work Better and Cheaper with a Two-Step Function Call 🚀

6
Comments 3
4 min read
Generate your docstrings automatically with zero-docs

Generate your docstrings automatically with zero-docs

1
Comments
2 min read
The Rise of the 1-Bit LLM

The Rise of the 1-Bit LLM

11
Comments 5
19 min read
Building an Advanced Streamlit Chatbot with OpenAI Integration: A Comprehensive Guide - Part 3

Building an Advanced Streamlit Chatbot with OpenAI Integration: A Comprehensive Guide - Part 3

1
Comments
14 min read
Open Source Day 2024

Open Source Day 2024

1
Comments
5 min read
Reduce your LLM costs by 10x using semantic caching

Reduce your LLM costs by 10x using semantic caching

2
Comments
2 min read
(Easier) Root Cause Analysis of the Failure

(Easier) Root Cause Analysis of the Failure

2
Comments
2 min read
Supercharging LLM Training with Groq and LPUs

Supercharging LLM Training with Groq and LPUs

1
Comments
21 min read
LLMs for Text-to-SQL problems: the benchmark vs real-world performance

LLMs for Text-to-SQL problems: the benchmark vs real-world performance

2
Comments
8 min read
AI Runner Dev Update: Text-to-speech, LLM and Stable Diffusion interruptions

AI Runner Dev Update: Text-to-speech, LLM and Stable Diffusion interruptions

1
Comments
1 min read
Unleashing the Power of Similarity Search: Top 5 Vector Databases for AI Applications

Unleashing the Power of Similarity Search: Top 5 Vector Databases for AI Applications

17
Comments 1
3 min read
Real-time text to speech conversation about friends and Ray Bradbury with my computer

Real-time text to speech conversation about friends and Ray Bradbury with my computer

1
Comments
2 min read
Building a WhatsApp generative AI assistant with Amazon Bedrock and Python

Building a WhatsApp generative AI assistant with Amazon Bedrock and Python

8
Comments
8 min read
How does the Groq's LPU work?

How does the Groq's LPU work?

Comments
7 min read
FLaNK 04 March 2024

FLaNK 04 March 2024

6
Comments
6 min read
How to setup your own ChatGPT with OpenLLaMA

How to setup your own ChatGPT with OpenLLaMA

Comments
1 min read
Generative AI in QuickSight

Generative AI in QuickSight

Comments
2 min read
Large Language Models: Library Overview for Training, Fine-Tuning, Intererence and More

Large Language Models: Library Overview for Training, Fine-Tuning, Intererence and More

5
Comments 1
6 min read
FLaNK Stack 29 Jan 2024

FLaNK Stack 29 Jan 2024

5
Comments
6 min read
Uncovering Generative Artificial Intelligence and LLMs: A Brief Introduction

Uncovering Generative Artificial Intelligence and LLMs: A Brief Introduction

Comments
2 min read
Prompt Engineering for OpenAI Chat Completions

Prompt Engineering for OpenAI Chat Completions

Comments
7 min read
The Ultimate Guide to ML Model Deployment

The Ultimate Guide to ML Model Deployment

Comments
8 min read
AI Runner multimodal preview

AI Runner multimodal preview

Comments
1 min read
OLLAMA with AMD GPU (ROCm)

OLLAMA with AMD GPU (ROCm)

Comments
2 min read
AI Runner multimodal preview

AI Runner multimodal preview

Comments
1 min read
Boost Your Productivity with Walles.AI: A Comprehensive Guide to Efficient Task Management

Boost Your Productivity with Walles.AI: A Comprehensive Guide to Efficient Task Management

3
Comments
3 min read
Exploring the World of LLMs for SRE Powered by PartyRock (Claude, Jurassic-2, Titan, Command, Liama 2 & Stable Diffusion XL)

Exploring the World of LLMs for SRE Powered by PartyRock (Claude, Jurassic-2, Titan, Command, Liama 2 & Stable Diffusion XL)

6
Comments
7 min read
Large Language Models: Modern Gen4 LLM Overview (LLaMA, Pythia, PaLM2 and More)

Large Language Models: Modern Gen4 LLM Overview (LLaMA, Pythia, PaLM2 and More)

Comments
12 min read
Deploy Your Own AI Chat Buddy - The Qwen Chat Model Deployment with HuggingFace Guide

Deploy Your Own AI Chat Buddy - The Qwen Chat Model Deployment with HuggingFace Guide

1
Comments 1
8 min read
Async AI Workflows with Graph Theory

Async AI Workflows with Graph Theory

6
Comments
2 min read
Are LLM's essentially Teenagers?

Are LLM's essentially Teenagers?

12
Comments 1
6 min read
Understanding RAG: A Deeper Dive into the Fusion of Retrieval and Generation

Understanding RAG: A Deeper Dive into the Fusion of Retrieval and Generation

56
Comments 7
4 min read
What is cosine similarity, and how is it useful for text embeddings?

What is cosine similarity, and how is it useful for text embeddings?

Comments
4 min read
How do you know that an LLM-generated response is factually correct? 🤔

How do you know that an LLM-generated response is factually correct? 🤔

7
Comments
2 min read
Build knowledge graphs with LLM-driven entity extraction

Build knowledge graphs with LLM-driven entity extraction

1
Comments
3 min read
What's new in txtai 7.0

What's new in txtai 7.0

1
Comments
6 min read
Advanced RAG with graph path traversal

Advanced RAG with graph path traversal

1
Comments
6 min read
LLM Evaluation Metrics for Labeled Data

LLM Evaluation Metrics for Labeled Data

Comments
5 min read
Add Generative AI to a JavaScript Web App

Add Generative AI to a JavaScript Web App

5
Comments 1
10 min read
Incorpora IA generativa a una aplicaciĂłn web de JavaScript

Incorpora IA generativa a una aplicaciĂłn web de JavaScript

11
Comments
11 min read
I know you want to build AI applications too : LangChain

I know you want to build AI applications too : LangChain

14
Comments 2
10 min read
Adding Embeddings to a Phoenix App

Adding Embeddings to a Phoenix App

1
Comments
2 min read
Rocket League BotChat powered by TensorRT-LLM: My submission for NVIDIA's Generative AI on RTX PCs Developer Contest

Rocket League BotChat powered by TensorRT-LLM: My submission for NVIDIA's Generative AI on RTX PCs Developer Contest

Comments
16 min read
Fine-Tuning LlaMa 2 on Custom Data!

Fine-Tuning LlaMa 2 on Custom Data!

13
Comments 3
6 min read
Using the Ollama API to run LLMs and generate responses locally

Using the Ollama API to run LLMs and generate responses locally

17
Comments
2 min read
Memory in LLM agents

Memory in LLM agents

9
Comments
14 min read
Choose Your Own Coding Assistant

Choose Your Own Coding Assistant

39
Comments 15
5 min read
Using LLM, Postgres VectorDB, and OpenAI to Perform Semantic Search on PDF Documents

Using LLM, Postgres VectorDB, and OpenAI to Perform Semantic Search on PDF Documents

Comments
2 min read
Installing LLMs locally using Ollama - Beginner's guide

Installing LLMs locally using Ollama - Beginner's guide

1
Comments 2
2 min read
Step-by-step Guidelines for Integrating GPT in Your Project: Create an API for Anything Using LangChain and FastAPI

Step-by-step Guidelines for Integrating GPT in Your Project: Create an API for Anything Using LangChain and FastAPI

Comments
3 min read
Deploying HuggingFace Chat UI with the Hugging Face Text Generation Inference Server

Deploying HuggingFace Chat UI with the Hugging Face Text Generation Inference Server

Comments
3 min read
#SemanticKernel – 📎Chat Service demo running Phi-2 LLM locally with #LMStudio

#SemanticKernel – 📎Chat Service demo running Phi-2 LLM locally with #LMStudio

7
Comments
3 min read
loading...