DEV Community

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
What is RAG? A quick 101

What is RAG? A quick 101

9
Comments
3 min read
Starcoder2 vs Gemma:2b LLM

Starcoder2 vs Gemma:2b LLM

Comments
1 min read
All About Google Gemma

All About Google Gemma

Comments
2 min read
Make the OpenAI Function Calling Work Better and Cheaper with a Two-Step Function Call 🚀

Make the OpenAI Function Calling Work Better and Cheaper with a Two-Step Function Call 🚀

13
Comments 3
4 min read
Navigating the Future with AI Copilots: A Comprehensive Guide

Navigating the Future with AI Copilots: A Comprehensive Guide

1
Comments 1
12 min read
Generate your docstrings automatically with zero-docs

Generate your docstrings automatically with zero-docs

1
Comments
2 min read
The Rise of the 1-Bit LLM

The Rise of the 1-Bit LLM

11
Comments 5
19 min read
Automate Email Newsletter with Lyzr-Automata

Automate Email Newsletter with Lyzr-Automata

Comments
2 min read
Building an Advanced Streamlit Chatbot with OpenAI Integration: A Comprehensive Guide - Part 3

Building an Advanced Streamlit Chatbot with OpenAI Integration: A Comprehensive Guide - Part 3

22
Comments 1
14 min read
I'm making a huge database of recent LLM papers and repos

I'm making a huge database of recent LLM papers and repos

Comments
1 min read
Open Source Day 2024

Open Source Day 2024

1
Comments
5 min read
LLMs on your local Computer (Part 1)

LLMs on your local Computer (Part 1)

3
Comments
20 min read
GenLearn - Your Personalized Learning Assistant!

GenLearn - Your Personalized Learning Assistant!

8
Comments
5 min read
What are LLMs, Local LLMs and RAG?

What are LLMs, Local LLMs and RAG?

1
Comments
7 min read
Supercharging LLM Training with Groq and LPUs

Supercharging LLM Training with Groq and LPUs

5
Comments
21 min read
LLMs for Text-to-SQL problems: the benchmark vs real-world performance

LLMs for Text-to-SQL problems: the benchmark vs real-world performance

2
Comments
8 min read
AI Runner Dev Update: Text-to-speech, LLM and Stable Diffusion interruptions

AI Runner Dev Update: Text-to-speech, LLM and Stable Diffusion interruptions

1
Comments
1 min read
Unleashing the Power of Similarity Search: Top 5 Vector Databases for AI Applications

Unleashing the Power of Similarity Search: Top 5 Vector Databases for AI Applications

17
Comments 1
3 min read
Building a WhatsApp generative AI assistant with Amazon Bedrock and Python

Building a WhatsApp generative AI assistant with Amazon Bedrock and Python

25
Comments
8 min read
Real-time text to speech conversation about friends and Ray Bradbury with my computer

Real-time text to speech conversation about friends and Ray Bradbury with my computer

1
Comments
2 min read
How does the Groq's LPU work?

How does the Groq's LPU work?

Comments
7 min read
FLaNK 04 March 2024

FLaNK 04 March 2024

11
Comments 1
6 min read
How to setup your own ChatGPT with OpenLLaMA

How to setup your own ChatGPT with OpenLLaMA

2
Comments
1 min read
Generative AI in QuickSight

Generative AI in QuickSight

Comments
2 min read
Large Language Models: Library Overview for Training, Fine-Tuning, Intererence and More

Large Language Models: Library Overview for Training, Fine-Tuning, Intererence and More

5
Comments 1
6 min read
FLaNK Stack 29 Jan 2024

FLaNK Stack 29 Jan 2024

5
Comments
6 min read
Uncovering Generative Artificial Intelligence and LLMs: A Brief Introduction

Uncovering Generative Artificial Intelligence and LLMs: A Brief Introduction

Comments
2 min read
Training data poisoning to get what you want in LLMs, A Question

Training data poisoning to get what you want in LLMs, A Question

Comments
2 min read
What are Small language Models?

What are Small language Models?

1
Comments
10 min read
The problem plaguing LLMOps and Usage: Prompt and Vendor lock-ins

The problem plaguing LLMOps and Usage: Prompt and Vendor lock-ins

Comments
6 min read
Matryoshka Embeddings: The new kind of efficient embeddings

Matryoshka Embeddings: The new kind of efficient embeddings

1
Comments
13 min read
The Ultimate Guide to ML Model Deployment

The Ultimate Guide to ML Model Deployment

Comments
8 min read
AI Runner preview: improvement to real time voice chat

AI Runner preview: improvement to real time voice chat

Comments
1 min read
Extraction Matters Most

Extraction Matters Most

Comments
6 min read
OLLAMA with AMD GPU (ROCm)

OLLAMA with AMD GPU (ROCm)

4
Comments
2 min read
AI Runner multimodal preview

AI Runner multimodal preview

Comments
1 min read
AI Runner multimodal preview

AI Runner multimodal preview

Comments
1 min read
Deploy Mistral Large to Azure and create a conversation with Python and LangChain

Deploy Mistral Large to Azure and create a conversation with Python and LangChain

3
Comments
5 min read
Exploring the World of LLMs for SRE Powered by PartyRock (Claude, Jurassic-2, Titan, Command, Liama 2 & Stable Diffusion XL)

Exploring the World of LLMs for SRE Powered by PartyRock (Claude, Jurassic-2, Titan, Command, Liama 2 & Stable Diffusion XL)

7
Comments
7 min read
i built a CmdK widget, but it has very good AI too

i built a CmdK widget, but it has very good AI too

Comments
1 min read
Large Language Models: Modern Gen4 LLM Overview (LLaMA, Pythia, PaLM2 and More)

Large Language Models: Modern Gen4 LLM Overview (LLaMA, Pythia, PaLM2 and More)

1
Comments
12 min read
Deploy Your Own AI Chat Buddy - The Qwen Chat Model Deployment with HuggingFace Guide

Deploy Your Own AI Chat Buddy - The Qwen Chat Model Deployment with HuggingFace Guide

1
Comments 1
8 min read
Async AI Workflows with Graph Theory

Async AI Workflows with Graph Theory

6
Comments
2 min read
LoRA: A Breakdown of Low Rank Adaptation for Finetuning Large Models

LoRA: A Breakdown of Low Rank Adaptation for Finetuning Large Models

Comments
2 min read
Are LLM's essentially Teenagers?

Are LLM's essentially Teenagers?

12
Comments 1
6 min read
Understanding RAG: A Deeper Dive into the Fusion of Retrieval and Generation

Understanding RAG: A Deeper Dive into the Fusion of Retrieval and Generation

63
Comments 7
4 min read
How do you know that an LLM-generated response is factually correct? 🤔

How do you know that an LLM-generated response is factually correct? 🤔

7
Comments
2 min read
Evaluating LLM Models for Production Systems: Methods and Practices

Evaluating LLM Models for Production Systems: Methods and Practices

Comments
2 min read
Google Gemma first try

Google Gemma first try

Comments
3 min read
Build knowledge graphs with LLM-driven entity extraction

Build knowledge graphs with LLM-driven entity extraction

5
Comments
3 min read
Advanced RAG with graph path traversal

Advanced RAG with graph path traversal

1
Comments
6 min read
💡 What's new in txtai 7.0

💡 What's new in txtai 7.0

1
Comments
6 min read
LLM Evaluation Metrics for Labeled Data

LLM Evaluation Metrics for Labeled Data

1
Comments
5 min read
Add Generative AI to a JavaScript Web App

Add Generative AI to a JavaScript Web App

5
Comments 1
10 min read
Incorpora IA generativa a una aplicación web de JavaScript

Incorpora IA generativa a una aplicación web de JavaScript

14
Comments
11 min read
Creating an AI BLOGGER with Lyzr, LlamaIndex, Perplexity, GPT4

Creating an AI BLOGGER with Lyzr, LlamaIndex, Perplexity, GPT4

Comments 1
1 min read
Prompt Engineering for OpenAI Chat Completions

Prompt Engineering for OpenAI Chat Completions

8
Comments
7 min read
Adding Embeddings to a Phoenix App

Adding Embeddings to a Phoenix App

4
Comments
2 min read
I know you want to build AI applications too : LangChain

I know you want to build AI applications too : LangChain

10
Comments 4
10 min read
Rocket League BotChat powered by TensorRT-LLM: My submission for NVIDIA's Generative AI on RTX PCs Developer Contest

Rocket League BotChat powered by TensorRT-LLM: My submission for NVIDIA's Generative AI on RTX PCs Developer Contest

Comments
16 min read
loading...