DEV Community

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
The Orphan Axiom Problem in Ontology-Based RAG

The Orphan Axiom Problem in Ontology-Based RAG

Comments
6 min read
Giving a Chat App Operational Access to My Cloudflare Account with MCP

Giving a Chat App Operational Access to My Cloudflare Account with MCP

Comments
2 min read
Optimal Chunking for Ontology RAG: Empirical Analysis & Orphan Axiom Problem

Optimal Chunking for Ontology RAG: Empirical Analysis & Orphan Axiom Problem

Comments
12 min read
How to Evaluate Your RAG System: A Complete Guide to Metrics, Methods, and Best Practices

How to Evaluate Your RAG System: A Complete Guide to Metrics, Methods, and Best Practices

Comments
18 min read
Mooncake Memory Deep Dive: KVCache, Token Cost, DRAM Usage, and Saturation Analysis

Mooncake Memory Deep Dive: KVCache, Token Cost, DRAM Usage, and Saturation Analysis

Comments
5 min read
I Tested 7 Python PDF Extractors So You Don’t Have To (2025 Edition)

I Tested 7 Python PDF Extractors So You Don’t Have To (2025 Edition)

Comments
6 min read
Por Qué el 83% de Herramientas de Detección de Alucinaciones RAG Fallan en Producción

Por Qué el 83% de Herramientas de Detección de Alucinaciones RAG Fallan en Producción

Comments
3 min read
OWL-Aware Chunking Strategies: A Comprehensive Performance Analysis

OWL-Aware Chunking Strategies: A Comprehensive Performance Analysis

Comments
12 min read
Lessons Learned Deploying LLMs in Regulated Enterprise Environments

Lessons Learned Deploying LLMs in Regulated Enterprise Environments

Comments
4 min read
RAG vs Document Injection: Why Your AI Document Chat Needs Smart Retrieval

RAG vs Document Injection: Why Your AI Document Chat Needs Smart Retrieval

Comments
6 min read
Beyond Sharper Images: How LLM-Guided Super-Resolution Transforms Geo-Spatial Analysis

Beyond Sharper Images: How LLM-Guided Super-Resolution Transforms Geo-Spatial Analysis

Comments
7 min read
ChatGPT Hacks: How Developers Actually Use It in Real Projects

ChatGPT Hacks: How Developers Actually Use It in Real Projects

1
Comments 2
3 min read
I Built an ETL Pipeline That Actually Thinks & And Cut Token Costs by 52% (And Here's What I Learned)

I Built an ETL Pipeline That Actually Thinks & And Cut Token Costs by 52% (And Here's What I Learned)

1
Comments
17 min read
How to Make Your AI Strictly Follow Rules: Building a Robust Rule System

How to Make Your AI Strictly Follow Rules: Building a Robust Rule System

Comments
3 min read
Por Qué el 47% de Empresas Están Migrando de GPT-4 a Open Source LLMs en 2025

Por Qué el 47% de Empresas Están Migrando de GPT-4 a Open Source LLMs en 2025

Comments
2 min read
Top 5 AI Gateways for 2026: Building Reliable Multi-Provider AI Infrastructure

Top 5 AI Gateways for 2026: Building Reliable Multi-Provider AI Infrastructure

Comments
12 min read
From Reviewer to Architect: Escaping the AI Verification Trap

From Reviewer to Architect: Escaping the AI Verification Trap

1
Comments
5 min read
Why I Built a Spark-Native LLM Evaluation Framework

Why I Built a Spark-Native LLM Evaluation Framework

Comments
9 min read
Fine-Tuning Large Language Models with LoRA and QLoRA

Fine-Tuning Large Language Models with LoRA and QLoRA

Comments
2 min read
TOON vs JSON: When 60% Token Savings Becomes 1.8% - A Reality Check

TOON vs JSON: When 60% Token Savings Becomes 1.8% - A Reality Check

Comments
5 min read
Self-Host Your LLM Gateway or Try the Managed Version (Bifrost OSS & Enterprise)

Self-Host Your LLM Gateway or Try the Managed Version (Bifrost OSS & Enterprise)

5
Comments
2 min read
Lessons from wiring text, image, and audio into a single LLM gateway - Bifrost

Lessons from wiring text, image, and audio into a single LLM gateway - Bifrost

5
Comments
1 min read
Escape the Notebook: Build and Debug Deep LLM Agents Right in Your Terminal

Escape the Notebook: Build and Debug Deep LLM Agents Right in Your Terminal

Comments
3 min read
Code review with private LLM? In pipeline? Simple!

Code review with private LLM? In pipeline? Simple!

Comments
10 min read
runners y cuantificacion de modelos

runners y cuantificacion de modelos

Comments
4 min read
loading...