DEV Community

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Alibaba's Qwen3.6-Max-Preview Challenges GPT-5.4 on Agentic Coding

Alibaba's Qwen3.6-Max-Preview Challenges GPT-5.4 on Agentic Coding

Comments
7 min read
Why ChatGPT will silently lie about your bank statement (and how to catch it)

Why ChatGPT will silently lie about your bank statement (and how to catch it)

Comments
4 min read
MCP in Production Reality vs the Spec

MCP in Production Reality vs the Spec

Comments
3 min read
Who Owns the Code Claude Wrote? The Legal Mess No One's Talking About

Who Owns the Code Claude Wrote? The Legal Mess No One's Talking About

Comments
3 min read
Building a Multi-Agent Travel Planner: From a One-Sentence Prompt to a Validated, Budget-Aware Itinerary

Building a Multi-Agent Travel Planner: From a One-Sentence Prompt to a Validated, Budget-Aware Itinerary

1
Comments
8 min read
How Much VRAM Do You *Actually* Need for Local LLMs?

How Much VRAM Do You *Actually* Need for Local LLMs?

Comments
2 min read
Building Reliable AI Systems: Why Prompting Isn’t Enough

Building Reliable AI Systems: Why Prompting Isn’t Enough

Comments
3 min read
I Added Three Rules to Gemma 4. The MoE Searched. The Dense Model Refused.

Gemma 4 Challenge: Write about Gemma 4 Submission

I Added Three Rules to Gemma 4. The MoE Searched. The Dense Model Refused.

36
Comments 40
11 min read
Achieving Maximum Throughput on vLLM with a Single RTX 3090: A Production Guide for 7B LLMs

Achieving Maximum Throughput on vLLM with a Single RTX 3090: A Production Guide for 7B LLMs

1
Comments
4 min read
Building CineLog: What It Takes to Ship a Local-First, Real-Time Sync App as a Solo Developer

Building CineLog: What It Takes to Ship a Local-First, Real-Time Sync App as a Solo Developer

2
Comments
6 min read
Why your diffusion model is slow at batch size 1 (and what actually helps)

Why your diffusion model is slow at batch size 1 (and what actually helps)

Comments
4 min read
DeepSeek-V4 is Here, and Yes — 1M Context Is Finally for Everyone

DeepSeek-V4 is Here, and Yes — 1M Context Is Finally for Everyone

Comments
5 min read
Stop Getting Rate-Limited: Building Bulletproof LLM API Consumption Patterns

Stop Getting Rate-Limited: Building Bulletproof LLM API Consumption Patterns

Comments
3 min read
The Agentic AI Revolution: What's Actually Happening in April 2026

The Agentic AI Revolution: What's Actually Happening in April 2026

Comments
2 min read
I switched from OpenAI to z.ai for codiai coding review ng and I'm genuinely happy with it — honest review

I switched from OpenAI to z.ai for codiai coding review ng and I'm genuinely happy with it — honest review

Comments
3 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.