DEV Community

# llm

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Local LLM Inference on Windows 11 and AMD GPU using WSL and llama.cpp

Local LLM Inference on Windows 11 and AMD GPU using WSL and llama.cpp

1
Comments
3 min read
Best Ways to Monitor Claude Code Token Usage and Costs in 2026

Best Ways to Monitor Claude Code Token Usage and Costs in 2026

Comments 1
6 min read
I built a “deterministic” LLM text rephraser with a validation pipeline - looking for architectural feedback

I built a “deterministic” LLM text rephraser with a validation pipeline - looking for architectural feedback

Comments
3 min read
Why I Replaced LangChain with 15KB of httpx

Why I Replaced LangChain with 15KB of httpx

Comments
6 min read
One command to add structured markup to your AI agent

One command to add structured markup to your AI agent

23
Comments 2
4 min read
I built memory decay for AI agents using the Ebbinghaus forgetting curve

I built memory decay for AI agents using the Ebbinghaus forgetting curve

24
Comments 2
2 min read
How to Design LLM Applications for Production: A System Design Guide

How to Design LLM Applications for Production: A System Design Guide

Comments
6 min read
OpenTelemetry for LLM Applications: A Practical Guide with LaunchDarkly and Langfuse

OpenTelemetry for LLM Applications: A Practical Guide with LaunchDarkly and Langfuse

1
Comments 1
14 min read
Realtime steering: interrupt, barge-in, redirect, and guide the AI

Realtime steering: interrupt, barge-in, redirect, and guide the AI

1
Comments
4 min read
Teaching an AI to Play Dwarf Fortress: The Idea

Teaching an AI to Play Dwarf Fortress: The Idea

Comments 2
9 min read
Building an AI Visibility Monitoring Tool: A Developer's Guide to Tracking LLM Citations

Building an AI Visibility Monitoring Tool: A Developer's Guide to Tracking LLM Citations

1
Comments
9 min read
Building an Autonomous AI Agent for a Social Network: Lessons from the Chaos

Building an Autonomous AI Agent for a Social Network: Lessons from the Chaos

Comments
5 min read
Caching Strategies for LLM Systems (Part 3): Multi-Query Attention and Memory-Efficient Decoding

Caching Strategies for LLM Systems (Part 3): Multi-Query Attention and Memory-Efficient Decoding

Comments
5 min read
LocalAI QuickStart: Run OpenAI-Compatible LLMs Locally

LocalAI QuickStart: Run OpenAI-Compatible LLMs Locally

1
Comments
9 min read
Building in Public: CV Analyzer - Closure

Building in Public: CV Analyzer - Closure

1
Comments
1 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.