DEV Community

# llm

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
I Tracked Every Token I Spent on LLM APIs for a Month — Here's What I Learned

I Tracked Every Token I Spent on LLM APIs for a Month — Here's What I Learned

Comments
2 min read
Reasoning Models Burn Tokens Filling Gaps You Left in Your Prompt

Reasoning Models Burn Tokens Filling Gaps You Left in Your Prompt

Comments 2
7 min read
I Built a Memory Layer for My AI Agents That Fixed the Context Forgetting Problem

I Built a Memory Layer for My AI Agents That Fixed the Context Forgetting Problem

Comments
2 min read
How Hindsight Exposed Our Keyword-Matching Chatbot Limits

How Hindsight Exposed Our Keyword-Matching Chatbot Limits

1
Comments
5 min read
Dotando a IAs con: V1 Enrich Legal (ETL-D API)

Dotando a IAs con: V1 Enrich Legal (ETL-D API)

Comments
2 min read
Circuit breaker for LLM provider failure

Circuit breaker for LLM provider failure

Comments
5 min read
Stop Writing AI Agent Prompts Like It's 2023: The Framework That Makes Your OpenClaw Agent Actually Work

Stop Writing AI Agent Prompts Like It's 2023: The Framework That Makes Your OpenClaw Agent Actually Work

Comments
4 min read
I Built an LLM Drift Detector — It Caught GPT-4o Changing Behaviour in Production

I Built an LLM Drift Detector — It Caught GPT-4o Changing Behaviour in Production

Comments
2 min read
How to Monitor AI Agent Drift in Production

How to Monitor AI Agent Drift in Production

Comments
6 min read
A Quick Note on Gemma 4 Image Settings in Llama.cpp

A Quick Note on Gemma 4 Image Settings in Llama.cpp

1
Comments
2 min read
The LLM Monitoring Stack I Run in Production (It's 3 Tools, $50/mo)

The LLM Monitoring Stack I Run in Production (It's 3 Tools, $50/mo)

Comments
2 min read
Running Gemma 4 Locally on an iPhone 13 Pro with Swift

Running Gemma 4 Locally on an iPhone 13 Pro with Swift

Comments 1
2 min read
How to Add LLM Drift Monitoring to Your CI/CD Pipeline in 10 Minutes

How to Add LLM Drift Monitoring to Your CI/CD Pipeline in 10 Minutes

Comments
2 min read
Why GenAI Isn't Ready for Prime Time

Why GenAI Isn't Ready for Prime Time

Comments
9 min read
Things You're Overengineering in Your AI Agent (The LLM Already Handles Them)

Tool naming tips and guardrail debates

Things You're Overengineering in Your AI Agent (The LLM Already Handles Them)

40
Comments 25
6 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.