DEV Community

# llm

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Cutting MCP Tool-Call Token Costs by 50%+ with Code Mode

Cutting MCP Tool-Call Token Costs by 50%+ with Code Mode

1
Comments
7 min read
Why I Built TokenBar: Most AI Bills Are a Visibility Problem, Not a Billing Problem

Why I Built TokenBar: Most AI Bills Are a Visibility Problem, Not a Billing Problem

Comments
2 min read
The Modular Mind

The Modular Mind

Comments
3 min read
I Ran a 2-Billion Parameter AI Model in a Browser Tab. No Server.

I Ran a 2-Billion Parameter AI Model in a Browser Tab. No Server.

2
Comments 1
14 min read
I tried LangSmith, Langfuse, Helicone, and Phoenix — here's what each gets wrong

I tried LangSmith, Langfuse, Helicone, and Phoenix — here's what each gets wrong

Comments
2 min read
Free AI-SEO score: see if ChatGPT, Claude, Gemini cite your site

Free AI-SEO score: see if ChatGPT, Claude, Gemini cite your site

1
Comments 1
2 min read
Most AI bills are a visibility problem, not a billing problem

Most AI bills are a visibility problem, not a billing problem

Comments
2 min read
Vector Retrieval Quietly Replaced Keyword Match, and the SEO Stack Did Not Notice

Vector Retrieval Quietly Replaced Keyword Match, and the SEO Stack Did Not Notice

1
Comments
16 min read
How to Lock Down an AI Agent Before It Goes Rogue

How to Lock Down an AI Agent Before It Goes Rogue

Comments 2
4 min read
How to Install Ollama on Linux and Windows: Complete Setup Guide

How to Install Ollama on Linux and Windows: Complete Setup Guide

Comments
3 min read
Nine Search Backends, Nine Different Webs. Why AI Citations Diverge for the Same Query.

Nine Search Backends, Nine Different Webs. Why AI Citations Diverge for the Same Query.

Comments
16 min read
Streaming SSE Proxying for LLM APIs: The Hard Parts

Streaming SSE Proxying for LLM APIs: The Hard Parts

Comments
5 min read
The AI goblin problem: what GPT-5.5’s weird training bug tells us about alignment

The AI goblin problem: what GPT-5.5’s weird training bug tells us about alignment

2
Comments
4 min read
Building a Dependency Resolver ft. VibeCodeArena

Building a Dependency Resolver ft. VibeCodeArena

Comments
5 min read
Gemma 4 GGUF Benchmarks, Open-Source Voice AI Platform, Qwen3.6 vs. Gemma4 Comparison

Gemma 4 GGUF Benchmarks, Open-Source Voice AI Platform, Qwen3.6 vs. Gemma4 Comparison

Comments
3 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.