DEV Community

# llm

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
What I Learned Building a Lightweight Local AI Agent

What I Learned Building a Lightweight Local AI Agent

1
Comments
9 min read
Anthropic prompt caching cut our RCA cost by 90%

Anthropic prompt caching cut our RCA cost by 90%

Comments
7 min read
Day 0: The Chat Box Era and Its Limits

Day 0: The Chat Box Era and Its Limits

1
Comments
6 min read
8GB to 70B: A Real Hardware Guide for Local LLMs

8GB to 70B: A Real Hardware Guide for Local LLMs

1
Comments
9 min read
AI Observability: Logs, Prompts, Tool Calls, And Cost

AI Observability: Logs, Prompts, Tool Calls, And Cost

8
Comments
15 min read
How a Morse Code Attack Bypassed Bankr's LLM Agent: T1027 Obfuscation in the Wild

How a Morse Code Attack Bypassed Bankr's LLM Agent: T1027 Obfuscation in the Wild

Comments
11 min read
Building a Serverless AI Model Evaluation Platform on AWS

Building a Serverless AI Model Evaluation Platform on AWS

1
Comments 2
6 min read
Prompt injection through website content: how AI agents can be manipulated by the pages they visit

Prompt injection through website content: how AI agents can be manipulated by the pages they visit

Comments
4 min read
Local AI Updates: llama.cpp MTP, vLLM Gemma 4 Speeds, Ollama Coder Benchmarks

Local AI Updates: llama.cpp MTP, vLLM Gemma 4 Speeds, Ollama Coder Benchmarks

Comments
3 min read
The Hidden Math Behind AI Agents: Why GPT-4o Can Be More Expensive Than Hiring a Human

The Hidden Math Behind AI Agents: Why GPT-4o Can Be More Expensive Than Hiring a Human

Comments
1 min read
"What Codex's 'sudo workaround' actually means for production agents"

"What Codex's 'sudo workaround' actually means for production agents"

Comments
5 min read
Your RL Agent Failed a 12-Step Task. Which Step Was Wrong? (The Supervision Problem in Agentic RL)

Your RL Agent Failed a 12-Step Task. Which Step Was Wrong? (The Supervision Problem in Agentic RL)

Comments 2
5 min read
Just joined the Gemma 4 Challenge by Google AI & DEV Community!

Just joined the Gemma 4 Challenge by Google AI & DEV Community!

Comments
1 min read
Evaluating RAG Systems: Measuring Retrieval Quality, Grounding, and Hallucinations

Evaluating RAG Systems: Measuring Retrieval Quality, Grounding, and Hallucinations

Comments
3 min read
Three of my agent's API calls were Opus. My logs said "200 OK" eight times.

Three of my agent's API calls were Opus. My logs said "200 OK" eight times.

Comments
2 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.