DEV Community

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Voice-Controlled Local AI Agent

Voice-Controlled Local AI Agent

Comments
5 min read
A Gemini Deep Research Failure Mode: Refusal, Topic Drift, and Fabricated Charts

A Gemini Deep Research Failure Mode: Refusal, Topic Drift, and Fabricated Charts

Comments
7 min read
I Was About to Rewrite My Chat Router. The Bug Was Two Lines in a Prompt.

Marketing copy mistaken for product catalogs

I Was About to Rewrite My Chat Router. The Bug Was Two Lines in a Prompt.

18
Comments 14
7 min read
Chronicle: Rethinking Codebase Context for AI Coding Agents

Chronicle: Rethinking Codebase Context for AI Coding Agents

3
Comments
4 min read
Building a Voice-Controlled AI Agent using AssemblyAI and Groq

Building a Voice-Controlled AI Agent using AssemblyAI and Groq

1
Comments
3 min read
I Built a Debugger for LLM Agents — Here's Why "Observability" Wasn't Enough

I Built a Debugger for LLM Agents — Here's Why "Observability" Wasn't Enough

3
Comments 1
2 min read
Building KernelMind Part 2: Hybrid Retrieval, Reranking, and Actually Retrieving Useful Code

Building KernelMind Part 2: Hybrid Retrieval, Reranking, and Actually Retrieving Useful Code

2
Comments 3
5 min read
A CLI tool to score fine-tuning dataset quality before training starts

A CLI tool to score fine-tuning dataset quality before training starts

2
Comments
3 min read
You Probably Don't Need a Custom Agent

You Probably Don't Need a Custom Agent

Comments
3 min read
Stop Blaming Your Prompts. It’s the Architecture, Stup1d!

Stop Blaming Your Prompts. It’s the Architecture, Stup1d!

1
Comments 1
2 min read
20260324_snn_vs_gpu_en

20260324_snn_vs_gpu_en

Comments
6 min read
Open-Weight AI Model Licenses Compared: What MiniMax's Controversy Means for You

Open-Weight AI Model Licenses Compared: What MiniMax's Controversy Means for You

1
Comments
5 min read
llama.cppの設定で8GBの性能が5倍変わる — 主要オプションの最適値を出した

llama.cppの設定で8GBの性能が5倍変わる — 主要オプションの最適値を出した

Comments
4 min read
VRAMを増やせば解決する、は物理的に間違っている — HBM・CXL・Unified Memoryが取れなかったもの

VRAMを増やせば解決する、は物理的に間違っている — HBM・CXL・Unified Memoryが取れなかったもの

Comments
4 min read
What GenAI Actually Costs in Production

What GenAI Actually Costs in Production

Comments 1
8 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.