DEV Community

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Cut Claude Code Token Usage by Delegating to Cheaper Models with Boss Mode

Cut Claude Code Token Usage by Delegating to Cheaper Models with Boss Mode

Comments
4 min read
Why ChatGPT will silently lie about your bank statement (and how to catch it)

Why ChatGPT will silently lie about your bank statement (and how to catch it)

Comments
4 min read
MCP in Production Reality vs the Spec

MCP in Production Reality vs the Spec

Comments
3 min read
Who Owns the Code Claude Wrote? The Legal Mess No One's Talking About

Who Owns the Code Claude Wrote? The Legal Mess No One's Talking About

Comments
3 min read
Building a Multi-Agent Travel Planner: From a One-Sentence Prompt to a Validated, Budget-Aware Itinerary

Building a Multi-Agent Travel Planner: From a One-Sentence Prompt to a Validated, Budget-Aware Itinerary

1
Comments
8 min read
How Much VRAM Do You *Actually* Need for Local LLMs?

How Much VRAM Do You *Actually* Need for Local LLMs?

Comments
2 min read
Building Reliable AI Systems: Why Prompting Isn’t Enough

Building Reliable AI Systems: Why Prompting Isn’t Enough

Comments
3 min read
Achieving Maximum Throughput on vLLM with a Single RTX 3090: A Production Guide for 7B LLMs

Achieving Maximum Throughput on vLLM with a Single RTX 3090: A Production Guide for 7B LLMs

1
Comments
4 min read
DeepSeek-V4 is Here, and Yes — 1M Context Is Finally for Everyone

DeepSeek-V4 is Here, and Yes — 1M Context Is Finally for Everyone

Comments
5 min read
The Agentic AI Revolution: What's Actually Happening in April 2026

The Agentic AI Revolution: What's Actually Happening in April 2026

Comments
2 min read
Stop Getting Rate-Limited: Building Bulletproof LLM API Consumption Patterns

Stop Getting Rate-Limited: Building Bulletproof LLM API Consumption Patterns

Comments
3 min read
I switched from OpenAI to z.ai for codiai coding review ng and I'm genuinely happy with it — honest review

I switched from OpenAI to z.ai for codiai coding review ng and I'm genuinely happy with it — honest review

Comments
3 min read
SimCore: I built a social simulation engine where LLM agents live on a real map of your city

SimCore: I built a social simulation engine where LLM agents live on a real map of your city

Comments
1 min read
7 Platforms That Turn Agent Evals Into RL Training Data

7 Platforms That Turn Agent Evals Into RL Training Data

Comments
8 min read
Local LLMs & Multimodal: Qwen GGUF, Nemotron-3-Nano-Omni, MiMo V2.5-Pro Released

Local LLMs & Multimodal: Qwen GGUF, Nemotron-3-Nano-Omni, MiMo V2.5-Pro Released

Comments
3 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.