DEV Community

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Beyond Prompts: Why Context Engineering Is the Real Future of Enterprise AI

Beyond Prompts: Why Context Engineering Is the Real Future of Enterprise AI

Comments
3 min read
Codex custom provider: a practical base_url setup for cheaper AI coding runs

Codex custom provider: a practical base_url setup for cheaper AI coding runs

Comments
2 min read
LLM Application Development: A Complete Developer's Guide (2026)

LLM Application Development: A Complete Developer's Guide (2026)

Comments
2 min read
AI API gateway fallback policy template for production apps

AI API gateway fallback policy template for production apps

Comments
3 min read
From a Gemma 4 Challenge Project to a Manufacturing Assistant App

From a Gemma 4 Challenge Project to a Manufacturing Assistant App

1
Comments
2 min read
What Is Agentic Workflow Consulting? A Practical Guide for Data Leaders

What Is Agentic Workflow Consulting? A Practical Guide for Data Leaders

Comments
7 min read
When an Actor Platform Is Too Much for an LLM Scraping Task

When an Actor Platform Is Too Much for an LLM Scraping Task

Comments
4 min read
A GitHub project claims 60-95% fewer tokens with the same answers. The number is real. The economics it implies for your agent fleet are uncomfortable.

A GitHub project claims 60-95% fewer tokens with the same answers. The number is real. The economics it implies for your agent fleet are uncomfortable.

1
Comments
14 min read
Compass v1.1.0 · we shipped a memory plugin that catches its own consumption drift

Compass v1.1.0 · we shipped a memory plugin that catches its own consumption drift

1
Comments
5 min read
Doubling Qwen3.6-27B on One RTX 3090: ollama llama.cpp + MTP, Lever by Lever (35.7 ~75 tok/s)

Doubling Qwen3.6-27B on One RTX 3090: ollama llama.cpp + MTP, Lever by Lever (35.7 ~75 tok/s)

Comments
7 min read
OpenAI-compatible AI API gateway migration checklist

OpenAI-compatible AI API gateway migration checklist

Comments
4 min read
AI Weekly — 2026-05-29 to 2026-06-05 | The Gap Between Launch and Landing

AI Weekly — 2026-05-29 to 2026-06-05 | The Gap Between Launch and Landing

Comments
4 min read
The Data Pipeline Problems Nobody Mentions in AI Architecture Discussions

The Data Pipeline Problems Nobody Mentions in AI Architecture Discussions

1
Comments
3 min read
How to test an OpenAI-compatible AI API gateway without rewriting your app

How to test an OpenAI-compatible AI API gateway without rewriting your app

Comments
4 min read
NousResearch Agent, Open-Source Notebook LM, & Local Multimodal OCR for Consumer GPUs

NousResearch Agent, Open-Source Notebook LM, & Local Multimodal OCR for Consumer GPUs

Comments
3 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.