DEV Community

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
The gap between detecting hallucinations and handling them

The gap between detecting hallucinations and handling them

2
Comments
2 min read
Harness Engineering: The Infrastructure Layer That Makes AI Agents Actually Work

Harness Engineering: The Infrastructure Layer That Makes AI Agents Actually Work

2
Comments 2
9 min read
How We Solved the Hidden Problem of Cheap LLMs

How We Solved the Hidden Problem of Cheap LLMs

3
Comments 2
8 min read
Sonnet 4.6 vs Haiku 4.5 vs Opus 4.6: I Tested 3 Claude Models on 10 Real Tasks

Sonnet 4.6 vs Haiku 4.5 vs Opus 4.6: I Tested 3 Claude Models on 10 Real Tasks

Comments
3 min read
Model Showdown: Benchmarking Local vs Cloud LLMs on a Real Coding Task

Model Showdown: Benchmarking Local vs Cloud LLMs on a Real Coding Task

1
Comments
14 min read
Why I Built TokenBar: AI Spend Should Not Be a Monthly Surprise

Why I Built TokenBar: AI Spend Should Not Be a Monthly Surprise

Comments
1 min read
How Top Companies Are Shipping AI Agents Today (Apr 15)

How Top Companies Are Shipping AI Agents Today (Apr 15)

Comments
3 min read
I built an open-source LLM eval framework as a BCA student — hallucination detection, red-teaming, regression tracking

I built an open-source LLM eval framework as a BCA student — hallucination detection, red-teaming, regression tracking

Comments
1 min read
🦊 GoClaw Deep Dive 🤖 — A Builder's Guide to a Multi-Tenant AI Agent Platform 📘

🦊 GoClaw Deep Dive 🤖 — A Builder's Guide to a Multi-Tenant AI Agent Platform 📘

5
Comments
23 min read
The day I realized AI costs need a warning light

The day I realized AI costs need a warning light

Comments
2 min read
🤖 nanobot: A Comprehensive Build-Your-Own Guide 📚

🤖 nanobot: A Comprehensive Build-Your-Own Guide 📚

7
Comments
18 min read
I was tired of losing track of my AI conversations, so I built a Chrome extension

I was tired of losing track of my AI conversations, so I built a Chrome extension

5
Comments
4 min read
Boosting llama.cpp with Auto-Tuning, Qwen Quantization Benchmarks, & Mobile Ollama AI Servers

Boosting llama.cpp with Auto-Tuning, Qwen Quantization Benchmarks, & Mobile Ollama AI Servers

Comments
3 min read
SafePaths: How We Reduced Token Consumption by 85% — The Benchmark Story

SafePaths: How We Reduced Token Consumption by 85% — The Benchmark Story

Comments
2 min read
I Built an AI Agent to Do My Pre-Refinement. It Turned Into a Mirror of How We Wrote Tickets.

I Built an AI Agent to Do My Pre-Refinement. It Turned Into a Mirror of How We Wrote Tickets.

1
Comments
10 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.