DEV Community

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
The Multi-Provider LLM Problem: Why “One API” Is Not Enough

The Multi-Provider LLM Problem: Why “One API” Is Not Enough

1
Comments
1 min read
Modular LLM Inference Engine from Scratch

Modular LLM Inference Engine from Scratch

Comments
6 min read
The cheapest model call is the one you don't make

The cheapest model call is the one you don't make

Comments
6 min read
[Gemini API] Gemini Batch API and Webhook API practical usage on restaurant survey

[Gemini API] Gemini Batch API and Webhook API practical usage on restaurant survey

7
Comments
7 min read
How Transformers Architecture Powers Modern LLMs

How Transformers Architecture Powers Modern LLMs

Comments
8 min read
Local LLMs: Bytedance Lance 3B Multimodal, llama.cpp MTP, Ollama Client

Local LLMs: Bytedance Lance 3B Multimodal, llama.cpp MTP, Ollama Client

Comments
3 min read
We pre-registered, ran, and verified the macro ablation: information per joule, measured

We pre-registered, ran, and verified the macro ablation: information per joule, measured

Comments
3 min read
Lazy-Loading AI Skills in n8n with the Data Table Node

Lazy-Loading AI Skills in n8n with the Data Table Node

Comments
3 min read
I Spent 6 Months Fixing RAG. Here's What I Found (And Built)

I Spent 6 Months Fixing RAG. Here's What I Found (And Built)

1
Comments
5 min read
How to Reduce Agent Token Costs From the CLI (2026 Guide)

How to Reduce Agent Token Costs From the CLI (2026 Guide)

Comments
11 min read
Three Budget-Guardrail Failure Modes That Matter More Than Model Quality (May 2026)

Three Budget-Guardrail Failure Modes That Matter More Than Model Quality (May 2026)

Comments
2 min read
AgentThreatBench: The First OWASP Agentic Top 10 Security Benchmark

AgentThreatBench: The First OWASP Agentic Top 10 Security Benchmark

Comments
4 min read
О введении триады фильтров для нейросетей и LLM

О введении триады фильтров для нейросетей и LLM

1
Comments
1 min read
Loading Personality into AI: A Design Philosophy for Separating Memory and Persona

Loading Personality into AI: A Design Philosophy for Separating Memory and Persona

Comments
8 min read
Day 3: Prompting Techniques in AI

Day 3: Prompting Techniques in AI

Comments
2 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.