DEV Community

# llm

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Why I Used SHA-256 to Solve a Problem Most RAG Tutorials Pretend Doesn't Exist

Why I Used SHA-256 to Solve a Problem Most RAG Tutorials Pretend Doesn't Exist

Comments
4 min read
How ChatGPT/Gemini/MS Copilot Understands Your Question: A Step-by-Step Journey from Input to Response

How ChatGPT/Gemini/MS Copilot Understands Your Question: A Step-by-Step Journey from Input to Response

Comments
2 min read
AWS Tools, AI Reliability, and Prompt Engineering Hacks

AWS Tools, AI Reliability, and Prompt Engineering Hacks

Comments
3 min read
Android Device Cloud for LLM Agents

Android Device Cloud for LLM Agents

Comments
4 min read
Build a per-locale red-team harness for your LLM agent (before you trust the English number)

Build a per-locale red-team harness for your LLM agent (before you trust the English number)

Comments
3 min read
Cursor Composer 2.5: Targeted Textual Feedback RL

Cursor Composer 2.5: Targeted Textual Feedback RL

Comments
8 min read
There Is No Single "Best Model"

There Is No Single "Best Model"

Comments
2 min read
What's the best way to access DeepSeek and Qwen in production without managing separate API keys for each provider

What's the best way to access DeepSeek and Qwen in production without managing separate API keys for each provider

Comments
1 min read
Compass v1.1.0 · we shipped a memory plugin that catches its own consumption drift

Compass v1.1.0 · we shipped a memory plugin that catches its own consumption drift

Comments
5 min read
How to add automatic LLM fallbacks to your voice pipeline

How to add automatic LLM fallbacks to your voice pipeline

Comments
9 min read
AssemblyAI LLM Gateway vs. OpenRouter vs. LLM Gateway.io: Pricing, security, and reliability compared

AssemblyAI LLM Gateway vs. OpenRouter vs. LLM Gateway.io: Pricing, security, and reliability compared

Comments
10 min read
Production Reranker Layer for RAG in Python: Cross-Encoder, Cohere Fallback, and Reciprocal Rank Fusion (Runnable Code)

Production Reranker Layer for RAG in Python: Cross-Encoder, Cohere Fallback, and Reciprocal Rank Fusion (Runnable Code)

Comments
10 min read
Building an Agent Harness from Scratch: The Loop and the Tools

Building an Agent Harness from Scratch: The Loop and the Tools

1
Comments
10 min read
From Simple LLMs to Intelligent AI Agents

From Simple LLMs to Intelligent AI Agents

Comments
3 min read
Semantic Caching with Spring AI and PgVector: Reduce LLM Costs and Improve Response Time by 90%

Semantic Caching with Spring AI and PgVector: Reduce LLM Costs and Improve Response Time by 90%

Comments
5 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.