DEV Community

# llm

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Circuit Breakers for LLM APIs: Applying SRE Patterns to AI Infrastructure

Circuit Breakers for LLM APIs: Applying SRE Patterns to AI Infrastructure

Comments
6 min read
Fazendo um LLM do Zero — Sessão 06: Dando uma Profissão ao Modelo (Fine-Tuning) 🎯👨‍⚕️

Fazendo um LLM do Zero — Sessão 06: Dando uma Profissão ao Modelo (Fine-Tuning) 🎯👨‍⚕️

Comments
4 min read
LLM Router Benchmark: 46 Models, 8 Providers, Sub-1ms Routing

LLM Router Benchmark: 46 Models, 8 Providers, Sub-1ms Routing

Comments
9 min read
How to Train a Small Language Model: The Complete Guide for 2026

How to Train a Small Language Model: The Complete Guide for 2026

1
Comments
9 min read
How to Implement Prompt Caching on Amazon Bedrock and Cut Inference Costs in Half

How to Implement Prompt Caching on Amazon Bedrock and Cut Inference Costs in Half

Comments
12 min read
Best LLM Monitoring Tools for 2026

Best LLM Monitoring Tools for 2026

3
Comments 1
13 min read
Per-customer LLM cost attribution for multi-step agents (LangGraph, CrewAI)

Per-customer LLM cost attribution for multi-step agents (LangGraph, CrewAI)

Comments 1
2 min read
Securing AI-Powered Applications: A Comprehensive Guide to Protecting Your LLM-Integrated Web App

Securing AI-Powered Applications: A Comprehensive Guide to Protecting Your LLM-Integrated Web App

Comments
8 min read
AI Agent API Costs: How ClawRouter Cuts LLM Spending by 500x

AI Agent API Costs: How ClawRouter Cuts LLM Spending by 500x

Comments
8 min read
How I ran LLM + RAG fully offline on Android using MNN

How I ran LLM + RAG fully offline on Android using MNN

Comments
3 min read
5 Agent Design Patterns Every Developer Needs to Know in 2026

5 Agent Design Patterns Every Developer Needs to Know in 2026

2
Comments
15 min read
Giving Your AI the Right Context with Model Context Protocol (MCP)

Giving Your AI the Right Context with Model Context Protocol (MCP)

3
Comments
4 min read
Hermes Agent: Honest Review

Hermes Agent: Honest Review

3
Comments 1
4 min read
Your AI agent is wasting 90% of its tokens on field names"

Your AI agent is wasting 90% of its tokens on field names"

Comments
7 min read
Fundamental matters more in AI era

Fundamental matters more in AI era

Comments
3 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.