DEV Community

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
A Picture Is Worth Ten Thousand Tokens

A Picture Is Worth Ten Thousand Tokens

Comments
6 min read
ElevenLabs Conversational AI survey bot — reducing latency and robotic feel, plus initial delay issue

ElevenLabs Conversational AI survey bot — reducing latency and robotic feel, plus initial delay issue

Comments
1 min read
Taming multi-invoice PDFs and building a customer dashboard

Taming multi-invoice PDFs and building a customer dashboard

Comments
2 min read
Stop Letting Your LLM Bill Spiral: Building a Multi-Tenant Gateway in Spring Boot

Stop Letting Your LLM Bill Spiral: Building a Multi-Tenant Gateway in Spring Boot

Comments
8 min read
LLM Study Diary #2: Tokenization

LLM Study Diary #2: Tokenization

Comments
2 min read
llama.cpp MTP Beta, Gemma GGUF Fixes, & Sentinel Local-First AI Coding App

llama.cpp MTP Beta, Gemma GGUF Fixes, & Sentinel Local-First AI Coding App

Comments
3 min read
You Vibe-Coded Your SaaS Landing Page — Google Can't See It

You Vibe-Coded Your SaaS Landing Page — Google Can't See It

Comments
2 min read
LLM Foundry on a tiny model: the stack still does the heavy lifting

LLM Foundry on a tiny model: the stack still does the heavy lifting

Comments
1 min read
Vision Models for OCR: When They Beat Tesseract and When They Don't

Vision Models for OCR: When They Beat Tesseract and When They Don't

Comments
7 min read
How Should We Evaluate AI Coding Tools in Real Engineering Environments

How Should We Evaluate AI Coding Tools in Real Engineering Environments

Comments
4 min read
The LLM-shaped hole in your XGBoost pipeline

The LLM-shaped hole in your XGBoost pipeline

Comments
1 min read
How I cut my multi-turn LLM API costs by 90% (O(N ) O(N))

How I cut my multi-turn LLM API costs by 90% (O(N ) O(N))

Comments
2 min read
Six Principles in Practice: How an Agentic E2E Found 11 Production Bugs in 8 Runs

Six Principles in Practice: How an Agentic E2E Found 11 Production Bugs in 8 Runs

Comments
13 min read
Chunking in RAG: why your splitter matters more than your embedding model

Chunking in RAG: why your splitter matters more than your embedding model

2
Comments
5 min read
What MCP Really Is — A Demo You Can Run on Your Laptop in 5 Minutes

What MCP Really Is — A Demo You Can Run on Your Laptop in 5 Minutes

Comments
10 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.