DEV Community

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Notes on Serving LLMs with TensorRT-LLM and Triton

Notes on Serving LLMs with TensorRT-LLM and Triton

Comments
4 min read
Why does AI stock analysis sound so responsible and stay so empty?

Why does AI stock analysis sound so responsible and stay so empty?

Comments
4 min read
Claude Opus 4.8: Parallel-Subagent Dynamic Workflows

Claude Opus 4.8: Parallel-Subagent Dynamic Workflows

Comments
6 min read
Opus 4.8 barely moved the leaderboard. It moved the one number that decides if your agents can be trusted.

Opus 4.8 barely moved the leaderboard. It moved the one number that decides if your agents can be trusted.

Comments
5 min read
Hallucinations Are Not Always Wrong Facts: Sometimes They're Wrong Interpretations

Hallucinations Are Not Always Wrong Facts: Sometimes They're Wrong Interpretations

1
Comments 1
2 min read
Opus 4.8, Qwen, DeepSeek, and a Claude Code Failure: What I Could Actually Reproduce

Opus 4.8, Qwen, DeepSeek, and a Claude Code Failure: What I Could Actually Reproduce

Comments
3 min read
AI Basics: Key Concepts Every Software Engineer Should Know

AI Basics: Key Concepts Every Software Engineer Should Know

Comments
18 min read
How We Cut Our AI Coding Bill by 65% Without Sacrificing Quality

How We Cut Our AI Coding Bill by 65% Without Sacrificing Quality

Comments
3 min read
KeyMesh: Zero-Runtime-Dependency API Key Rotation, Circuit Breaker and Failover for Production LLM Applications in Node.js

KeyMesh: Zero-Runtime-Dependency API Key Rotation, Circuit Breaker and Failover for Production LLM Applications in Node.js

Comments
3 min read
Microsoft MAI-Code-1-Flash: Adaptive Solution-Length Control

Microsoft MAI-Code-1-Flash: Adaptive Solution-Length Control

1
Comments
6 min read
Choosing the Right Scraping Interface for LLM Workflows

Choosing the Right Scraping Interface for LLM Workflows

1
Comments
4 min read
Rust RAG, Tokenizer-Free TTS (VoxCPM2), & Project NOMAD: Local AI & Offline Deployments

Rust RAG, Tokenizer-Free TTS (VoxCPM2), & Project NOMAD: Local AI & Offline Deployments

Comments
3 min read
I stopped letting AI review its own code

I stopped letting AI review its own code

Comments 1
6 min read
I got tired of LLM observability tools getting acquired. So I built one that can't be.

I got tired of LLM observability tools getting acquired. So I built one that can't be.

Comments
1 min read
Quantum Edge, LLM Leaders, and Hidden AI Traps

Quantum Edge, LLM Leaders, and Hidden AI Traps

Comments
2 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.