DEV Community

# llm

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Integrating LLMs into a Go service without losing your mind (or adding 550ms latency)

Integrating LLMs into a Go service without losing your mind (or adding 550ms latency)

Comments
5 min read
OpenModels: Explore LLM Models and Inference Providers

OpenModels: Explore LLM Models and Inference Providers

3
Comments
5 min read
Agentic AI in chemistry

Agentic AI in chemistry

Comments
1 min read
Adding a Free Overflow Model to Your MCP Server: Gemma via the Gemini API

Adding a Free Overflow Model to Your MCP Server: Gemma via the Gemini API

Comments
3 min read
LLaMA 3.3 AI Assistant to My Spring Boot WebSocket App

LLaMA 3.3 AI Assistant to My Spring Boot WebSocket App

Comments
3 min read
LLM Guardrails in Production and How Bifrost Protects Your AI Agents at the Gateway Level

LLM Guardrails in Production and How Bifrost Protects Your AI Agents at the Gateway Level

10
Comments
9 min read
ML-based LLM request classifier for cost-optimized routing (~2ms inference)

ML-based LLM request classifier for cost-optimized routing (~2ms inference)

Comments
1 min read
Lorem Ipsum Makes LLMs Smarter. No, Seriously.

Lorem Ipsum Makes LLMs Smarter. No, Seriously.

2
Comments
4 min read
Your RAG works on Claude. Does it work on Gemma 4? Drift detection across model families.

Your RAG works on Claude. Does it work on Gemma 4? Drift detection across model families.

Comments 2
7 min read
Four Write Tools, Zero Confirmation, What Could Go Wrong

Four Write Tools, Zero Confirmation, What Could Go Wrong

Comments
5 min read
How I Cut My AI API Costs by 60%: A Data-Driven Approach to LLM Model Selection

How I Cut My AI API Costs by 60%: A Data-Driven Approach to LLM Model Selection

1
Comments 2
2 min read
Architecture Over Model: How We Got 13/13 Bug Detection Without Upgrading to a Stronger AI

Architecture Over Model: How We Got 13/13 Bug Detection Without Upgrading to a Stronger AI

Comments
13 min read
AI workshop platform for real human questions

AI workshop platform for real human questions

Comments
1 min read
Monitoring LLM API Calls in Python: Latency, Token Usage, and Cost Tracking With OpenTelemetry

Monitoring LLM API Calls in Python: Latency, Token Usage, and Cost Tracking With OpenTelemetry

1
Comments 1
9 min read
pip-guardian on Pypi

pip-guardian on Pypi

Comments
2 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.