DEV Community

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
AI Infrastructure on Consumer Hardware

AI Infrastructure on Consumer Hardware

5
Comments
9 min read
Model Collapse: The AI Feedback Loop Problem Nobody Wants to Talk About

Model Collapse: The AI Feedback Loop Problem Nobody Wants to Talk About

Comments
5 min read
RAG is more than Vector Search

RAG is more than Vector Search

1
Comments
4 min read
Create Your First MCP App

Create Your First MCP App

1
Comments
6 min read
Understanding the Role of a Context Engineer

Understanding the Role of a Context Engineer

Comments
19 min read
2025 Complete Guide: In-Depth Analysis of ERNIE-4.5-VL-28B-A3B-Thinking Multimodal AI Model

2025 Complete Guide: In-Depth Analysis of ERNIE-4.5-VL-28B-A3B-Thinking Multimodal AI Model

Comments
12 min read
Master RAG Evaluation with RAGAS

Master RAG Evaluation with RAGAS

1
Comments
3 min read
Base LLMs vs Instruction-Tuned LLMs: Understanding the Architecture Behind ChatGPT and Claude

Base LLMs vs Instruction-Tuned LLMs: Understanding the Architecture Behind ChatGPT and Claude

Comments
3 min read
Build Better RAG Pipelines: Scraping Technical Docs to Clean Markdown

Build Better RAG Pipelines: Scraping Technical Docs to Clean Markdown

Comments 1
2 min read
Creating Personal AI Agents in Multiplayer Games with LoRA Adapters: An Efficient and Memory-Saving Solution

Creating Personal AI Agents in Multiplayer Games with LoRA Adapters: An Efficient and Memory-Saving Solution

5
Comments
4 min read
TOON: Token-Oriented Object Notation – A Complete Guide for LLM Data Efficiency

TOON: Token-Oriented Object Notation – A Complete Guide for LLM Data Efficiency

Comments 1
3 min read
Deploying NVIDIA Dynamo & LMCache for LLMs: Installation, Containers, and Integration

Deploying NVIDIA Dynamo & LMCache for LLMs: Installation, Containers, and Integration

4
Comments 2
2 min read
The Prompting Trick That Fixed My AI Image Generation

The Prompting Trick That Fixed My AI Image Generation

8
Comments
7 min read
🏠 Self-Hosted AI Code Generation: The Complete Guide to Building Your Private AI Coding Assistant

🏠 Self-Hosted AI Code Generation: The Complete Guide to Building Your Private AI Coding Assistant

14
Comments
6 min read
The Poetic Hack: Exploiting LLMs with Verse by Arvind Sundararajan

The Poetic Hack: Exploiting LLMs with Verse by Arvind Sundararajan

Comments
2 min read
From 16-bit to 4-bit: The Architecture for Scalable Personalized LLM Deployment

From 16-bit to 4-bit: The Architecture for Scalable Personalized LLM Deployment

5
Comments
6 min read
New AI web standards and scraping trends in 2026: rethinking robots.txt

New AI web standards and scraping trends in 2026: rethinking robots.txt

60
Comments 1
5 min read
Context-Optimized APIs: Designing MCP Servers for LLMs

Context-Optimized APIs: Designing MCP Servers for LLMs

1
Comments
5 min read
Your Primary LLM Provider Failed? Enable Automatic Fallback with Bifrost

Your Primary LLM Provider Failed? Enable Automatic Fallback with Bifrost

6
Comments
4 min read
Local LLM Hosting: Complete 2025 Guide - Ollama, vLLM, LocalAI, Jan, LM Studio & More

Local LLM Hosting: Complete 2025 Guide - Ollama, vLLM, LocalAI, Jan, LM Studio & More

1
Comments
19 min read
Top 5 LiteLLM Alternatives in 2025

Top 5 LiteLLM Alternatives in 2025

6
Comments
17 min read
A Financial MCP server with multi-provider orchestration (Open Source)

A Financial MCP server with multi-provider orchestration (Open Source)

Comments
1 min read
How to Cut Your AI API Costs: Six Proven Strategies

How to Cut Your AI API Costs: Six Proven Strategies

1
Comments
5 min read
Utilizing RAG Techniques for Improved AI Agent Performance

Utilizing RAG Techniques for Improved AI Agent Performance

Comments
8 min read
We built an LLM gateway 50x faster than LiteLLM (and it's open source)

We built an LLM gateway 50x faster than LiteLLM (and it's open source)

1
Comments
3 min read
loading...