DEV Community

# llm

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
TurboQuant on a MacBook: building a one-command local stack with Ollama, MLX, and an automatic routing proxy

TurboQuant on a MacBook: building a one-command local stack with Ollama, MLX, and an automatic routing proxy

8
Comments 1
6 min read
I built an open-source LLM eval platform with a ReAct agent that diagnoses quality regressions

I built an open-source LLM eval platform with a ReAct agent that diagnoses quality regressions

1
Comments
3 min read
I Needed Memory That Survives Context Windows. Memory That Moves Across Environments

I Needed Memory That Survives Context Windows. Memory That Moves Across Environments

1
Comments 2
2 min read
I benchmarked 24 local LLM models for OpenClaw agent tool calling on RTX 3090

I benchmarked 24 local LLM models for OpenClaw agent tool calling on RTX 3090

1
Comments
4 min read
How Much VRAM Do You Need to Fine-Tune an LLM? Stop Guessing and Use This Tool.

How Much VRAM Do You Need to Fine-Tune an LLM? Stop Guessing and Use This Tool.

1
Comments
3 min read
Enterprise AI Security: 12 Best Practices for Deploying LLMs in Production

Enterprise AI Security: 12 Best Practices for Deploying LLMs in Production

Comments
13 min read
What the Data Act Misses: The Last Mile Between Regulation and Adoption

What the Data Act Misses: The Last Mile Between Regulation and Adoption

Comments
4 min read
When AI Models Expose Their Tools: The Transparency Pattern Changing Agent Development

When AI Models Expose Their Tools: The Transparency Pattern Changing Agent Development

1
Comments
3 min read
What I Learned Calling 4 Different LLM APIs From the Same Codebase

What I Learned Calling 4 Different LLM APIs From the Same Codebase

Comments
4 min read
I Know It’s AI, But It Still Feels Real

I Know It’s AI, But It Still Feels Real

15
Comments 9
3 min read
I Turned My M1 MacBook Into an Offline AI Coding Agent - $0 API Cost, Zero Cloud

I Turned My M1 MacBook Into an Offline AI Coding Agent - $0 API Cost, Zero Cloud

3
Comments 1
9 min read
Context Engineering: How to Manage Context for AI Models and Agents

Context Engineering: How to Manage Context for AI Models and Agents

Comments 2
11 min read
I Ran Google's latest Gemma 4 Models on 48GB GPU. Here's What Actually Happened.

I Ran Google's latest Gemma 4 Models on 48GB GPU. Here's What Actually Happened.

6
Comments 1
6 min read
There Is No Best AI Model in 2026 — And That's Actually Good News

There Is No Best AI Model in 2026 — And That's Actually Good News

Comments
6 min read
OpenAI Structured Outputs vs Zod: which to use for LLM response validation in 2026

OpenAI Structured Outputs vs Zod: which to use for LLM response validation in 2026

1
Comments
5 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.