DEV Community

# llm

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Run Gemma 4 on Your Laptop — A Hands-On Guide to Google's Latest Open Multimodal LLM

Run Gemma 4 on Your Laptop — A Hands-On Guide to Google's Latest Open Multimodal LLM

Comments
10 min read
If You Can Survive a Toddler, You Can Ship LLMs in Production

If You Can Survive a Toddler, You Can Ship LLMs in Production

5
Comments 3
5 min read
Building an MCP Server with Common Lisp

Building an MCP Server with Common Lisp

Comments
3 min read
Build a RAG Pipeline from Scratch in Python: A Step-by-Step Guide

Build a RAG Pipeline from Scratch in Python: A Step-by-Step Guide

Comments
9 min read
I Tested 6 LLM Models on the Same 50 Production Prompts — Here’s What Actually Varies

I Tested 6 LLM Models on the Same 50 Production Prompts — Here’s What Actually Varies

2
Comments
12 min read
🤖 Building a Private, Local WhatsApp AI Assistant with Node.js & Ollama

🤖 Building a Private, Local WhatsApp AI Assistant with Node.js & Ollama

Comments
2 min read
Why your LLM agent drifts off-task by step 4 (and why prompts can't fix it)

Why your LLM agent drifts off-task by step 4 (and why prompts can't fix it)

2
Comments 2
3 min read
Why AI Teams Need a Unified Gateway Instead of More API Chaos

Why AI Teams Need a Unified Gateway Instead of More API Chaos

Comments
1 min read
Gemma4 Tool Calling Fixes in llama.cpp, RTX cuBLAS MatMul Bug, & Local Ollama + Whisper UI

Gemma4 Tool Calling Fixes in llama.cpp, RTX cuBLAS MatMul Bug, & Local Ollama + Whisper UI

Comments
3 min read
Five Atomic Skills, Two Approaches: Claude Code and a Paper

Five Atomic Skills, Two Approaches: Claude Code and a Paper

Comments
22 min read
Software Engineers Are Building Agents Wrong: Treat Agentic AI Like Distributed Systems, Not Prompt Chains

Software Engineers Are Building Agents Wrong: Treat Agentic AI Like Distributed Systems, Not Prompt Chains

Comments
4 min read
RAG Architecture — Prototype to Production in Three Stages

RAG Architecture — Prototype to Production in Three Stages

1
Comments
8 min read
Building Your Own "Google Maps for Codebases": A Guide to Codebase Q&A with LLMs

Building Your Own "Google Maps for Codebases": A Guide to Codebase Q&A with LLMs

Comments
6 min read
You Don't Have to Fine-Tune Your LLM to change it's Behavior. You Can Just… Steer It.

You Don't Have to Fine-Tune Your LLM to change it's Behavior. You Can Just… Steer It.

1
Comments 1
7 min read
Claude Code install and config for Ollama, llama.cpp, pricing

Claude Code install and config for Ollama, llama.cpp, pricing

Comments
9 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.