DEV Community

# llm

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Use Claude long enough and you'll end up with Karpathy's LLM Wiki without doing much.

Use Claude long enough and you'll end up with Karpathy's LLM Wiki without doing much.

Comments
5 min read
Comparing Model Performance: Without MTP vs. With MTP vs. With MTP + QAT

Comparing Model Performance: Without MTP vs. With MTP vs. With MTP + QAT

Comments
8 min read
AutoLab Benchmarks Frontier Agents on Long-Horizon R&D Tasks: Iterative Experiment-Loop Evaluation

AutoLab Benchmarks Frontier Agents on Long-Horizon R&D Tasks: Iterative Experiment-Loop Evaluation

Comments
6 min read
Building AI agents with Vercel AI SDK

Building AI agents with Vercel AI SDK

Comments
6 min read
How to Scrape E-Commerce Sites for AI Agents Using Playwright and LLMs

How to Scrape E-Commerce Sites for AI Agents Using Playwright and LLMs

Comments
6 min read
LLM Spend Audit: The 45-Minute Diagnostic for Startups

LLM Spend Audit: The 45-Minute Diagnostic for Startups

Comments
3 min read
Your AI Agent Is Paying for HTML It Never Reads — I Measured the 7x Token Tax

Your AI Agent Is Paying for HTML It Never Reads — I Measured the 7x Token Tax

Comments
5 min read
Designing Generative AI Career Coaches that Meet EU AI Rules & Boost Remote‑Work Upskilling

Designing Generative AI Career Coaches that Meet EU AI Rules & Boost Remote‑Work Upskilling

Comments
9 min read
The Eval Gap: Your Agent Has Observability but No Idea If It's Any Good

The Eval Gap: Your Agent Has Observability but No Idea If It's Any Good

Comments
6 min read
Benchmarking a kill switch for runaway AI agents -- and why the real number is a ceiling, not a %

Benchmarking a kill switch for runaway AI agents -- and why the real number is a ceiling, not a %

Comments
4 min read
If an LLM Can Answer a Question, Why Does LangChain Need Chains?

If an LLM Can Answer a Question, Why Does LangChain Need Chains?

1
Comments
2 min read
Google Ships Gemma 4 QAT Checkpoints: Quantization-Aware Training

Google Ships Gemma 4 QAT Checkpoints: Quantization-Aware Training

1
Comments
6 min read
Datadog dashboards for prompt regression: the panels we actually keep

Datadog dashboards for prompt regression: the panels we actually keep

Comments
8 min read
AI Agents Are Not Just Prompts: What You Need to Understand First

AI Agents Are Not Just Prompts: What You Need to Understand First

1
Comments
3 min read
Telegram Integration - 0$ Personal Agentic AI Assistant - Part 5

Telegram Integration - 0$ Personal Agentic AI Assistant - Part 5

Comments
6 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.