DEV Community

# llm

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Local LLM Inference on Windows 11 and AMD GPU using WSL and llama.cpp

Local LLM Inference on Windows 11 and AMD GPU using WSL and llama.cpp

2
Comments
3 min read
I Got Tired of Googling kubectl Commands at 2 AM. So I Built a Local AI Agent That Does DevOps Safely. { pip install orbit-cli }

I Got Tired of Googling kubectl Commands at 2 AM. So I Built a Local AI Agent That Does DevOps Safely. { pip install orbit-cli }

2
Comments 1
8 min read
The Competition Over "Which AI Model Is Smartest" Is Over.

The Competition Over "Which AI Model Is Smartest" Is Over.

2
Comments 1
5 min read
Monitor LLM Inference in Production (2026): Prometheus & Grafana for vLLM, TGI, llama.cpp

Monitor LLM Inference in Production (2026): Prometheus & Grafana for vLLM, TGI, llama.cpp

1
Comments 1
9 min read
Standard AI Conversation Portability Does Not Exist Yet: Here Is Why That Should Bother You

Standard AI Conversation Portability Does Not Exist Yet: Here Is Why That Should Bother You

Comments 1
5 min read
Mastering Mistral API Costs: A Deep Dive into Pricing Models and Optimization Strategies

Mastering Mistral API Costs: A Deep Dive into Pricing Models and Optimization Strategies

Comments
6 min read
Private AI for Customer Support: Building LLM Helpdesks That Don’t Leak Customer Data

Private AI for Customer Support: Building LLM Helpdesks That Don’t Leak Customer Data

Comments 1
11 min read
Fine-tuning vs RAG: When to Use Each Approach for Production LLMs

Fine-tuning vs RAG: When to Use Each Approach for Production LLMs

Comments 1
8 min read
Software 3.1? - AI Functions

Software 3.1? - AI Functions

34
Comments 6
13 min read
Fine-tuning vs RAG: When to Use Each Approach for Production LLMs

Fine-tuning vs RAG: When to Use Each Approach for Production LLMs

Comments 1
8 min read
Building Production-Ready RAG Applications with Vector Databases

Building Production-Ready RAG Applications with Vector Databases

Comments 1
3 min read
A/B Testing LLM Systems

A/B Testing LLM Systems

1
Comments 1
7 min read
Scaling AI Memory: How I Tamed a 120k-Token Prompt with Deterministic GraphRAG

Scaling AI Memory: How I Tamed a 120k-Token Prompt with Deterministic GraphRAG

2
Comments
5 min read
Developing with Claude Code? You should be using Checkmate!

Developing with Claude Code? You should be using Checkmate!

Comments
3 min read
LangChain vs CrewAI vs AnythingLLM: ¿Qué Framework Deberías Elegir en 2026?

LangChain vs CrewAI vs AnythingLLM: ¿Qué Framework Deberías Elegir en 2026?

2
Comments
9 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.