DEV Community

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
A six-concern production harness for Nemotron agents on Crusoe Managed Inference

A six-concern production harness for Nemotron agents on Crusoe Managed Inference

Comments
4 min read
My LLM provider went down for 11 minutes. My code spent 4 of them in connect timeouts.

My LLM provider went down for 11 minutes. My code spent 4 of them in connect timeouts.

Comments
4 min read
I shipped eight agent-stack repos in eight hours. Here's what made it possible.

I shipped eight agent-stack repos in eight hours. Here's what made it possible.

Comments
4 min read
Compass v1.1.0 · we shipped a memory plugin that catches its own consumption drift

Compass v1.1.0 · we shipped a memory plugin that catches its own consumption drift

1
Comments
5 min read
Is AI Getting Quietly Dumber? A 24/7 Benchmark That Catches LLM Degradation

Is AI Getting Quietly Dumber? A 24/7 Benchmark That Catches LLM Degradation

Comments 2
6 min read
What 'Bring Your Own Model' (BYOK) Actually Means When You Adopt AI at Work

What 'Bring Your Own Model' (BYOK) Actually Means When You Adopt AI at Work

Comments
4 min read
.klickd v4.0.0 — Portable AI memory with constraints, strict schemas, and test vectors

.klickd v4.0.0 — Portable AI memory with constraints, strict schemas, and test vectors

Comments
6 min read
llama.cpp Checkpoint Fix, NuExtract3 VLM, & Qwen3.6 Local Inference Benchmarks

llama.cpp Checkpoint Fix, NuExtract3 VLM, & Qwen3.6 Local Inference Benchmarks

Comments
3 min read
Your GitHub Actions Logs Are Leaking LLM Keys and Your SIEM Isn't Catching It

Your GitHub Actions Logs Are Leaking LLM Keys and Your SIEM Isn't Catching It

Comments
3 min read
Auto-labelling 1.2M robotics frames with VLMs: a failover story

Auto-labelling 1.2M robotics frames with VLMs: a failover story

Comments
4 min read
We Audited Our Agent Tool-Call Traces. Half Our Eval Data Was Garbage.

We Audited Our Agent Tool-Call Traces. Half Our Eval Data Was Garbage.

Comments
4 min read
GLM-4: The Chinese-English Bilingual Workhorse You Didn't Know You Needed

GLM-4: The Chinese-English Bilingual Workhorse You Didn't Know You Needed

Comments
3 min read
Gemma 4: Google's Lightweight Powerhouse — Run AI on Hardware You Already Own

Gemma 4: Google's Lightweight Powerhouse — Run AI on Hardware You Already Own

Comments
3 min read
Preventing GPT hallucination in automated content pipelines: how I structure Make.com flows with data injection

Preventing GPT hallucination in automated content pipelines: how I structure Make.com flows with data injection

Comments
8 min read
I ran Claude Code on a local LLM for 4 hours — 7M tokens, $0 (would have cost $94)

I ran Claude Code on a local LLM for 4 hours — 7M tokens, $0 (would have cost $94)

Comments
2 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.