DEV Community

# llm

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
GLM 5.1 just dropped — 754B open-weight MoE model under MIT license. here's how to run it

GLM 5.1 just dropped — 754B open-weight MoE model under MIT license. here's how to run it

Comments 2
3 min read
How to Fine-Tune GPT-4o-mini on Your Own Guardrail Failures (50 Lines of Python)

How to Fine-Tune GPT-4o-mini on Your Own Guardrail Failures (50 Lines of Python)

Comments
6 min read
The 5 Levels of RAG Maturity: How to Know When Your RAG Is Actually Production-Ready

The 5 Levels of RAG Maturity: How to Know When Your RAG Is Actually Production-Ready

4
Comments
8 min read
I built a client-side LLM token counter because I kept guessing at prompt costs

I built a client-side LLM token counter because I kept guessing at prompt costs

Comments 2
4 min read
Simplifying the AI Testing through Evaliphy

Simplifying the AI Testing through Evaliphy

1
Comments
5 min read
My AI pipeline had a 1M token context window. The output still got worse.

My AI pipeline had a 1M token context window. The output still got worse.

Comments
2 min read
AI Model Pricing Is a Mess — Here Is How We Track It

AI Model Pricing Is a Mess — Here Is How We Track It

1
Comments
2 min read
From ollama run to Tokens: What Really Happens When You Run an LLM Locally

From ollama run to Tokens: What Really Happens When You Run an LLM Locally

1
Comments
5 min read
Training Small LLMs to Edit Code Instead of Generating It

Training Small LLMs to Edit Code Instead of Generating It

Comments
4 min read
Is Brain Float (bf16) Worth it?

Gemma 4 Challenge: Build With Gemma 4 Submission

Is Brain Float (bf16) Worth it?

7
Comments 2
6 min read
How to Add Cost-Aware Model Selection to Your AI Agent

How to Add Cost-Aware Model Selection to Your AI Agent

1
Comments
2 min read
"Attention Is All You Need" Paper tahun 2017 yang mengubah dunia kecerdasan buatan, dijelaskan tanpa perlu latar belakang teknis.

"Attention Is All You Need" Paper tahun 2017 yang mengubah dunia kecerdasan buatan, dijelaskan tanpa perlu latar belakang teknis.

Comments
4 min read
The AI Development Stack: Fundamentals Every Developer Should Actually Understand

The AI Development Stack: Fundamentals Every Developer Should Actually Understand

2
Comments
8 min read
Don't Change the Topic With an LLM

Don't Change the Topic With an LLM

Comments
2 min read
Right Model, Right Time - Why Model Routing Is Becoming Core to GenAI Platforms

Right Model, Right Time - Why Model Routing Is Becoming Core to GenAI Platforms

Comments 2
3 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.