DEV Community

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Why most LLM VRAM calculators are wrong on modern models (and an open-source MIT fix)

Why most LLM VRAM calculators are wrong on modern models (and an open-source MIT fix)

Comments
2 min read
How to test an OpenAI-compatible AI API gateway without rewriting your app

How to test an OpenAI-compatible AI API gateway without rewriting your app

Comments
4 min read
Tigergraph-MediGraph

Tigergraph-MediGraph

Comments
2 min read
Auto-Generating JSON-LD: Page Signals, Type Heuristics, and a Careful Gemini Prompt

Auto-Generating JSON-LD: Page Signals, Type Heuristics, and a Careful Gemini Prompt

Comments
6 min read
Before the Pod Starts: GPU Node Setup for LLMs on Kubernetes

Before the Pod Starts: GPU Node Setup for LLMs on Kubernetes

Comments
18 min read
Why Your Embedding Model Choice Matters More Than Your LLM Choice

Why Your Embedding Model Choice Matters More Than Your LLM Choice

Comments
5 min read
Parsing robots.txt for 10 AI Crawlers: Wildcards, Partial Blocks, Line Numbers

Parsing robots.txt for 10 AI Crawlers: Wildcards, Partial Blocks, Line Numbers

Comments
5 min read
How to Audit AI API Costs by Team and User in 2026

How to Audit AI API Costs by Team and User in 2026

1
Comments
8 min read
AI 週報 — 2026-05-29 to 2026-06-05 | OpenAI 前沿模型登陸 AWS:基礎模型通路戰開打

AI 週報 — 2026-05-29 to 2026-06-05 | OpenAI 前沿模型登陸 AWS:基礎模型通路戰開打

Comments
3 min read
The LLM failure mode nobody is monitoring: overconfident responses in high-stakes domains

The LLM failure mode nobody is monitoring: overconfident responses in high-stakes domains

Comments
1 min read
Our Client's In-House LLM Integration Failed in Production: Observability, Cost, Latency — What Went Wrong

Our Client's In-House LLM Integration Failed in Production: Observability, Cost, Latency — What Went Wrong

Comments
7 min read
We May Be Building AI Development Tools Backwards

We May Be Building AI Development Tools Backwards

Comments
1 min read
LLM 파인튜닝 방법 비교: Full vs LoRA vs QLoRA 선택 가이드 2026

LLM 파인튜닝 방법 비교: Full vs LoRA vs QLoRA 선택 가이드 2026

Comments
1 min read
The check you can write is the check you can fool

The check you can write is the check you can fool

1
Comments 1
5 min read
Agentic AI in software development: what's actually production-ready in 2026

Agentic AI in software development: what's actually production-ready in 2026

Comments
3 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.