DEV Community

# llm

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Day 47: Model Compression for Deployment

Day 47: Model Compression for Deployment

11
Comments
2 min read
Day 26: Learning Rate Schedules

Day 26: Learning Rate Schedules

Comments
2 min read
Testing LLM Speed Across Cloud Providers: Groq, Cerebras, AWS & More

Testing LLM Speed Across Cloud Providers: Groq, Cerebras, AWS & More

11
Comments 2
2 min read
Monitoring and Improving AI Model Performance with Handit.AI

Monitoring and Improving AI Model Performance with Handit.AI

Comments
4 min read
Boost Your Retrieval-Augmented Generation (RAG) with Vector Databases 🚀

Boost Your Retrieval-Augmented Generation (RAG) with Vector Databases 🚀

9
Comments
1 min read
RAG Implementation with LangChain

RAG Implementation with LangChain

7
Comments 1
9 min read
Announcing the MagicAPI AI Gateway: The Fastest AI Proxy for Developers!

Announcing the MagicAPI AI Gateway: The Fastest AI Proxy for Developers!

2
Comments
2 min read
Text-to-SQL: Creating Embeddings with Nebius AI Studio (part 1)

Text-to-SQL: Creating Embeddings with Nebius AI Studio (part 1)

3
Comments 1
4 min read
LLM APIs vs. Self-Hosted Models: Finding the Best Fit for Your Business Needs

LLM APIs vs. Self-Hosted Models: Finding the Best Fit for Your Business Needs

9
Comments
6 min read
AI in Manufacturing: Transforming the Future of Production

AI in Manufacturing: Transforming the Future of Production

5
Comments
7 min read
Understanding LLM Errors: What They Are and How to Address Them

Understanding LLM Errors: What They Are and How to Address Them

Comments
4 min read
Day 46: Adversarial Attacks on LLMs

Day 46: Adversarial Attacks on LLMs

5
Comments
2 min read
Overcoming LLM Testing Challenges with Pytest and Trulens: Ensuring Reliable Responses

Overcoming LLM Testing Challenges with Pytest and Trulens: Ensuring Reliable Responses

1
Comments
8 min read
Detecting Hallucinations in LLMs with Discrete Semantic Entropy and Perplexity

Detecting Hallucinations in LLMs with Discrete Semantic Entropy and Perplexity

4
Comments
3 min read
Quick Paper Overview: More Agents Is All You Need

Quick Paper Overview: More Agents Is All You Need

Comments
2 min read
AI in Education: Transforming Learning Experiences

AI in Education: Transforming Learning Experiences

6
Comments
6 min read
Local LLMs: The Future of Private AI Computing? A Complete Guide for 2024

Local LLMs: The Future of Private AI Computing? A Complete Guide for 2024

6
Comments 5
3 min read
My first LLM dialog box project

My first LLM dialog box project

6
Comments
1 min read
Making An LLM A Data Analysis Intern (Who Even Likes Reading Sustainability Reports!)

Making An LLM A Data Analysis Intern (Who Even Likes Reading Sustainability Reports!)

1
Comments
14 min read
Run Llama 3 Locally

Run Llama 3 Locally

Comments
2 min read
The Demise Of Human Coding, Rise Of AI, And Why It's Good For Devs Too

The Demise Of Human Coding, Rise Of AI, And Why It's Good For Devs Too

Comments
4 min read
AI in Healthcare: Transforming Patient Care and Operational Efficiency

AI in Healthcare: Transforming Patient Care and Operational Efficiency

5
Comments
5 min read
Retrieval-Augmented Generation

Retrieval-Augmented Generation

Comments
6 min read
Run LangTrace – Open Source Observability Tool for LLM Applications

Run LangTrace – Open Source Observability Tool for LLM Applications

11
Comments 2
10 min read
Generative AI in Video: Transforming Content Creation

Generative AI in Video: Transforming Content Creation

6
Comments
6 min read
Comprehensive Guide to the Capabilities and Applications of Large Language Models (LLMs)

Comprehensive Guide to the Capabilities and Applications of Large Language Models (LLMs)

Comments
12 min read
Day 22: Distributed Training in Large Language Models

Day 22: Distributed Training in Large Language Models

Comments
3 min read
Day 43: Evaluation Metrics for LLMs

Day 43: Evaluation Metrics for LLMs

2
Comments
3 min read
Noema – A Declarative AI Programming Library

Noema – A Declarative AI Programming Library

Comments
2 min read
Fine-Tuning vs. Retrieval-Augmented Generation (RAG): Enhancing LLMs for Specific Tasks

Fine-Tuning vs. Retrieval-Augmented Generation (RAG): Enhancing LLMs for Specific Tasks

6
Comments
2 min read
Fine-Tuning LLM: Transforming General Models into Specialized Experts

Fine-Tuning LLM: Transforming General Models into Specialized Experts

5
Comments
2 min read
Enhancing Language Models with Retrieval-Augmented Generation (RAG)

Enhancing Language Models with Retrieval-Augmented Generation (RAG)

5
Comments
3 min read
Limitations of Large Language Models: Unpacking the Challenges

Limitations of Large Language Models: Unpacking the Challenges

5
Comments
3 min read
Build a Competitive Intelligence Tool Powered by AI

Build a Competitive Intelligence Tool Powered by AI

79
Comments
9 min read
Ethical Considerations in LLM Development and Deployment

Ethical Considerations in LLM Development and Deployment

1
Comments
2 min read
The Surprising Benefits of Smaller Language Models

The Surprising Benefits of Smaller Language Models

Comments
5 min read
What are we even doing

What are we even doing

23
Comments 10
3 min read
Building Enterprise-Level Data Analysis Agent: Architecture Design and Implementation

Building Enterprise-Level Data Analysis Agent: Architecture Design and Implementation

5
Comments
9 min read
Building an Intelligent Customer Service Agent System from Scratch

Building an Intelligent Customer Service Agent System from Scratch

5
Comments
5 min read
Agent Task Orchestration System: From Design to Production

Agent Task Orchestration System: From Design to Production

Comments
4 min read
Building an Agent Tool Management Platform: A Practical Architecture Guide

Building an Agent Tool Management Platform: A Practical Architecture Guide

Comments
3 min read
Ollama and Web-LLM: Building Your Own Local AI Search Assistant

Ollama and Web-LLM: Building Your Own Local AI Search Assistant

13
Comments
8 min read
Using DSPy(COPRO) to refine prompt instructions

Using DSPy(COPRO) to refine prompt instructions

1
Comments
1 min read
Building Enterprise-Level Agent Systems: Core Component Design and Optimization

Building Enterprise-Level Agent Systems: Core Component Design and Optimization

Comments
5 min read
Building Enterprise Agent Systems: Core Component Design and Optimization

Building Enterprise Agent Systems: Core Component Design and Optimization

Comments
4 min read
Exploring the Exciting Possibilities of NVIDIA Megatron LM: A Fun and Friendly Code Walkthrough with PyTorch & NVIDIA Apex!

Exploring the Exciting Possibilities of NVIDIA Megatron LM: A Fun and Friendly Code Walkthrough with PyTorch & NVIDIA Apex!

Comments
5 min read
Day 36: Text Classification with LLMs

Day 36: Text Classification with LLMs

1
Comments
2 min read
Comprehensive list of dev tools for an AI Engineer

Comprehensive list of dev tools for an AI Engineer

6
Comments 2
1 min read
How Small Language Models Are Redefining AI Efficiency

How Small Language Models Are Redefining AI Efficiency

5
Comments 1
4 min read
Understanding the Evolution of Word Representation: Static vs. Dynamic Embeddings

Understanding the Evolution of Word Representation: Static vs. Dynamic Embeddings

4
Comments
2 min read
Building a Simple Chatbot with Llama2 [Chat with Excel]

Building a Simple Chatbot with Llama2 [Chat with Excel]

2
Comments 1
3 min read
How to Simplify Your AI Development Process using Docker AI Catalog

How to Simplify Your AI Development Process using Docker AI Catalog

Comments
6 min read
Day 39: Summarization with LLMs

Day 39: Summarization with LLMs

2
Comments
2 min read
Local RAG in Microsoft Word: using AnythingLLM + LM Studio

Local RAG in Microsoft Word: using AnythingLLM + LM Studio

3
Comments 1
3 min read
Unlock the Power of Meta Llama LLM: Easy Guide to Hosting in Your Local Dev Environment

Unlock the Power of Meta Llama LLM: Easy Guide to Hosting in Your Local Dev Environment

45
Comments 10
4 min read
Edge Computing and Large Language Models (LLMs): What’s the Connection?

Edge Computing and Large Language Models (LLMs): What’s the Connection?

Comments
8 min read
Think Smarter, Not Harder: Meet RAG

Think Smarter, Not Harder: Meet RAG

Comments
6 min read
Innovation Graph Analytics Powered by Embeddings and LLM’s

Innovation Graph Analytics Powered by Embeddings and LLM’s

6
Comments 1
7 min read
Claude MCP

Claude MCP

3
Comments
1 min read
How to build a RAG model from scratch?

How to build a RAG model from scratch?

1
Comments
6 min read
loading...