DEV Community

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Dec 19, 2025 | The Tongyi Weekly: Your weekly dose of cutting-edge AI from Tongyi Lab

Dec 19, 2025 | The Tongyi Weekly: Your weekly dose of cutting-edge AI from Tongyi Lab

Comments
5 min read
📌 Most models use Grouped Query Attention. That doesn’t mean yours should.📌

📌 Most models use Grouped Query Attention. That doesn’t mean yours should.📌

1
Comments
1 min read
How to Evaluate Your RAG System: A Complete Guide to Metrics, Methods, and Best Practices

How to Evaluate Your RAG System: A Complete Guide to Metrics, Methods, and Best Practices

Comments
18 min read
Mooncake Memory Deep Dive: KVCache, Token Cost, DRAM Usage, and Saturation Analysis

Mooncake Memory Deep Dive: KVCache, Token Cost, DRAM Usage, and Saturation Analysis

Comments
5 min read
How to Use Synthetic Data to Evaluate LLM Prompts: A Step-by-Step Guide

How to Use Synthetic Data to Evaluate LLM Prompts: A Step-by-Step Guide

Comments
8 min read
I Didn’t Build a Chatbot — I Built an AI That Runs the System

I Didn’t Build a Chatbot — I Built an AI That Runs the System

Comments
2 min read
A/B Testing Prompts: A Complete Guide to Optimizing LLM Performance

A/B Testing Prompts: A Complete Guide to Optimizing LLM Performance

Comments
7 min read
Local Lock Down Lobe Chat Setup

Local Lock Down Lobe Chat Setup

Comments
4 min read
OWL-Aware Chunking Strategies: A Comprehensive Performance Analysis

OWL-Aware Chunking Strategies: A Comprehensive Performance Analysis

Comments
12 min read
Por Qué el 83% de Herramientas de Detección de Alucinaciones RAG Fallan en Producción

Por Qué el 83% de Herramientas de Detección de Alucinaciones RAG Fallan en Producción

Comments
3 min read
Beyond Sharper Images: How LLM-Guided Super-Resolution Transforms Geo-Spatial Analysis

Beyond Sharper Images: How LLM-Guided Super-Resolution Transforms Geo-Spatial Analysis

Comments
7 min read
ChatGPT Hacks: How Developers Actually Use It in Real Projects

ChatGPT Hacks: How Developers Actually Use It in Real Projects

1
Comments 2
3 min read
I Built an ETL Pipeline That Actually Thinks & And Cut Token Costs by 52% (And Here's What I Learned)

I Built an ETL Pipeline That Actually Thinks & And Cut Token Costs by 52% (And Here's What I Learned)

1
Comments
17 min read
How to Make Your AI Strictly Follow Rules: Building a Robust Rule System

How to Make Your AI Strictly Follow Rules: Building a Robust Rule System

Comments
3 min read
Por Qué el 47% de Empresas Están Migrando de GPT-4 a Open Source LLMs en 2025

Por Qué el 47% de Empresas Están Migrando de GPT-4 a Open Source LLMs en 2025

Comments
2 min read
From Reviewer to Architect: Escaping the AI Verification Trap

From Reviewer to Architect: Escaping the AI Verification Trap

1
Comments
5 min read
Top 5 AI Gateways for 2026: Building Reliable Multi-Provider AI Infrastructure

Top 5 AI Gateways for 2026: Building Reliable Multi-Provider AI Infrastructure

Comments
12 min read
When Announcements Replace Innovation: OpenAI’s Code Red 🚨

When Announcements Replace Innovation: OpenAI’s Code Red 🚨

Comments
3 min read
Why I Built a Spark-Native LLM Evaluation Framework

Why I Built a Spark-Native LLM Evaluation Framework

Comments
9 min read
Fine-Tuning Large Language Models with LoRA and QLoRA

Fine-Tuning Large Language Models with LoRA and QLoRA

Comments
2 min read
TOON vs JSON: When 60% Token Savings Becomes 1.8% - A Reality Check

TOON vs JSON: When 60% Token Savings Becomes 1.8% - A Reality Check

Comments
5 min read
Self-Host Your LLM Gateway or Try the Managed Version (Bifrost OSS & Enterprise)

Self-Host Your LLM Gateway or Try the Managed Version (Bifrost OSS & Enterprise)

5
Comments
2 min read
Lessons from wiring text, image, and audio into a single LLM gateway - Bifrost

Lessons from wiring text, image, and audio into a single LLM gateway - Bifrost

5
Comments
1 min read
Escape the Notebook: Build and Debug Deep LLM Agents Right in Your Terminal

Escape the Notebook: Build and Debug Deep LLM Agents Right in Your Terminal

Comments
3 min read
Code review with private LLM? In pipeline? Simple!

Code review with private LLM? In pipeline? Simple!

Comments
10 min read
loading...