DEV Community

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
LLPY-07: Integrando LLMs - OpenAI y Google Gemini

LLPY-07: Integrando LLMs - OpenAI y Google Gemini

Comments
10 min read
How to serve Markdown to AI agents: Making your docs more AI-friendly

How to serve Markdown to AI agents: Making your docs more AI-friendly

5
Comments 1
2 min read
Integrating Ollama with Python: REST API and Python Client Examples

Integrating Ollama with Python: REST API and Python Client Examples

1
Comments
4 min read
RAG LLM: Why Your AI Costs 10x More Than It Should (And How to Fix It)

RAG LLM: Why Your AI Costs 10x More Than It Should (And How to Fix It)

Comments
5 min read
# Data Ingestion & Vector Store #llmszoomcamp

# Data Ingestion & Vector Store #llmszoomcamp

Comments
2 min read
Agentic AI: How LLMs Really Work Behind the Scenes

Agentic AI: How LLMs Really Work Behind the Scenes

8
Comments
4 min read
I Tried Building a Whiteboard App with Claude 4.5 Sonnet

I Tried Building a Whiteboard App with Claude 4.5 Sonnet

Comments
2 min read
Nov7, 2025 | The Tongyi Weekly: Your weekly dose of cutting-edge AI from Tongyi Lab

Nov7, 2025 | The Tongyi Weekly: Your weekly dose of cutting-edge AI from Tongyi Lab

1
Comments
3 min read
How To Run an Open-Source LLM on Your Personal Computer

How To Run an Open-Source LLM on Your Personal Computer

4
Comments 1
6 min read
TOON Benchmarks: A Critical Analysis of Different Results

TOON Benchmarks: A Critical Analysis of Different Results

2
Comments 1
7 min read
Prompt Caching Slashed My AI Bills by 90%. Here's What Nobody Tells You.

Prompt Caching Slashed My AI Bills by 90%. Here's What Nobody Tells You.

Comments
5 min read
LLPY-14: Evaluación y Métricas de Calidad - Midiendo el Éxito del RAG

LLPY-14: Evaluación y Métricas de Calidad - Midiendo el Éxito del RAG

Comments
12 min read
Train it or feed it? Teaching LLMs your data the smart way

Train it or feed it? Teaching LLMs your data the smart way

Comments
4 min read
Understanding RAG: How AI Models Learn to Search Before They Speak

Understanding RAG: How AI Models Learn to Search Before They Speak

1
Comments
3 min read
🧑‍🚀 Choosing the Right Engine to Launch Your LLM (LM Studio, Ollama, and vLLM)

🧑‍🚀 Choosing the Right Engine to Launch Your LLM (LM Studio, Ollama, and vLLM)

Comments 3
3 min read
AI Security Tools Find Critical curl Vulnerabilities

AI Security Tools Find Critical curl Vulnerabilities

Comments
9 min read
Why Claude Code's Unix Philosophy Beats Other AI Assistants

Why Claude Code's Unix Philosophy Beats Other AI Assistants

Comments
8 min read
AutoAgents – a Rust-Based Multi-Agent Framework for LLM-Powered Intelligence

AutoAgents – a Rust-Based Multi-Agent Framework for LLM-Powered Intelligence

6
Comments
1 min read
Step-by-Step: Manual vLLM Setup on Google Cloud L4 (Debian)

Step-by-Step: Manual vLLM Setup on Google Cloud L4 (Debian)

Comments
2 min read
Structured prompts: how YAML cut my LLM costs by 30%

Structured prompts: how YAML cut my LLM costs by 30%

Comments
3 min read
🧩 Runtime Snapshots #3 — QA That Speaks JSON

🧩 Runtime Snapshots #3 — QA That Speaks JSON

4
Comments
1 min read
Gemini 2.5 Flash-Lite: Speed > Scale — 887 TPS, 50% Less Verbosity, Real-World Wins

Gemini 2.5 Flash-Lite: Speed > Scale — 887 TPS, 50% Less Verbosity, Real-World Wins

Comments
1 min read
About context and LLM

About context and LLM

2
Comments
7 min read
Building Effective Prompt Engineering Strategies for AI Agents

Building Effective Prompt Engineering Strategies for AI Agents

Comments 1
7 min read
How to Build Developer Trust in AI‑Powered Code Generation Through Data‑Driven Feedback and Evaluation

How to Build Developer Trust in AI‑Powered Code Generation Through Data‑Driven Feedback and Evaluation

1
Comments 1
8 min read
loading...