DEV Community

# llm

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
How Stripe, Shopify, and Airbnb Build AI Harnesses

How Stripe, Shopify, and Airbnb Build AI Harnesses

Comments
3 min read
79% on LongMemEval: How We Beat Full-Context GPT-4 with a Local SQLite Database

79% on LongMemEval: How We Beat Full-Context GPT-4 with a Local SQLite Database

1
Comments 1
9 min read
Anthropic plugs into SpaceX's 220,000-GPU Colossus — and doubles Claude's rate limits

Anthropic plugs into SpaceX's 220,000-GPU Colossus — and doubles Claude's rate limits

1
Comments
3 min read
I ran local LLMs on my phone for a week, and now my desktop setup feels like overkill

I ran local LLMs on my phone for a week, and now my desktop setup feels like overkill

8
Comments 1
1 min read
I Trained an LLM on 75K of My Own Messages So It Would Stop Writing Like a Chatbot

I Trained an LLM on 75K of My Own Messages So It Would Stop Writing Like a Chatbot

Comments
8 min read
Claude Code Chose a Stock Ticker Over Someone's Life. We Investigated.

Claude Code Chose a Stock Ticker Over Someone's Life. We Investigated.

Comments 1
10 min read
Kong AI Gateway vs TrueFoundry: the honest version of this comparison

Kong AI Gateway vs TrueFoundry: the honest version of this comparison

2
Comments
7 min read
tierKV: A Distributed KV Cache That Makes Evicted Blocks Faster to Restore Than GPU Cache Hits

tierKV: A Distributed KV Cache That Makes Evicted Blocks Faster to Restore Than GPU Cache Hits

1
Comments
3 min read
Why I Built My Own AI Project Management Assistant – and What I Learned

Why I Built My Own AI Project Management Assistant – and What I Learned

Comments
4 min read
Turning Production Incidents Into Testing Postmortems — With a Local LLM and No API Key

Turning Production Incidents Into Testing Postmortems — With a Local LLM and No API Key

Comments
6 min read
Stop Guessing Your RAG Quality: Automating Faithfulness Metrics with Spring AI and LLM-as-a-Judge

Stop Guessing Your RAG Quality: Automating Faithfulness Metrics with Spring AI and LLM-as-a-Judge

Comments
2 min read
The Complete Guide to Running LLMs Locally in 2026: From Ollama to Production

The Complete Guide to Running LLMs Locally in 2026: From Ollama to Production

Comments
8 min read
What Is Generative UI? (And Why Text Output Is No Longer Enough)

What Is Generative UI? (And Why Text Output Is No Longer Enough)

1
Comments
9 min read
Why AI Code Review Tools Keep Commenting on Lines That Don’t Exist

Why AI Code Review Tools Keep Commenting on Lines That Don’t Exist

Comments
2 min read
How I Built a Red/Blue Team Loop That Teaches My AI Firewall to Defend Itself

How I Built a Red/Blue Team Loop That Teaches My AI Firewall to Defend Itself

Comments
8 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.