DEV Community

# llm

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
What MCP Really Is — A Demo You Can Run on Your Laptop in 5 Minutes

What MCP Really Is — A Demo You Can Run on Your Laptop in 5 Minutes

Comments
10 min read
Building AI-Powered Apps for Free in 2026 — The Complete Guide

Building AI-Powered Apps for Free in 2026 — The Complete Guide

Comments
2 min read
Local LLM vs Gemini API — Cost, Quality, Privacy Compared (2026)

Local LLM vs Gemini API — Cost, Quality, Privacy Compared (2026)

Comments
2 min read
About Sharing Local Inference: A Marketplace for Renting Idle GPUs with an OpenAI-Compatible Backend

About Sharing Local Inference: A Marketplace for Renting Idle GPUs with an OpenAI-Compatible Backend

Comments
8 min read
Cut Your AI Agent Token Costs by 75% With One Skill Plugin

Cut Your AI Agent Token Costs by 75% With One Skill Plugin

Comments
2 min read
Why most LLM API usage is quietly inefficient

Why most LLM API usage is quietly inefficient

Comments
4 min read
Qwen sky proof: compressed memory made a tiny model behave better — with the receipts

Qwen sky proof: compressed memory made a tiny model behave better — with the receipts

Comments
1 min read
We Built an AI CFO With $30B in Connected Assets. The Secret Was a Filesystem.

We Built an AI CFO With $30B in Connected Assets. The Secret Was a Filesystem.

3
Comments 1
7 min read
The 8B Model That Punches at 32B Weight

The 8B Model That Punches at 32B Weight

Comments
2 min read
Hermes Agent CLI cheat sheet — commands, flags, and slash shortcuts

Hermes Agent CLI cheat sheet — commands, flags, and slash shortcuts

1
Comments
8 min read
AI Isn’t “Inspired” by Human Writing. It Is Built on Unpaid Intellectual Labor

AI Isn’t “Inspired” by Human Writing. It Is Built on Unpaid Intellectual Labor

2
Comments
7 min read
The "Chat" API is a Token Tax: Why we must return to Stateless Completions

The "Chat" API is a Token Tax: Why we must return to Stateless Completions

Comments
2 min read
Behavioral Annotations: Why readonly and destructive guide LLM Planning

Behavioral Annotations: Why readonly and destructive guide LLM Planning

Comments
3 min read
KODA Format: A Schema-First Data Format to Reduce LLM Token Usage ( 40%)

KODA Format: A Schema-First Data Format to Reduce LLM Token Usage ( 40%)

1
Comments 1
3 min read
The AI Agent Destroyed Its Mail Server to Keep a Secret

The AI Agent Destroyed Its Mail Server to Keep a Secret

Comments
5 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.