DEV Community

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Kong AI Gateway vs TrueFoundry: the honest version of this comparison

Kong AI Gateway vs TrueFoundry: the honest version of this comparison

Comments
7 min read
How AI Chat Platforms Actually Implement Content Moderation (and Why "Uncensored" Models Aren't Just "GPT Without Filters")

How AI Chat Platforms Actually Implement Content Moderation (and Why "Uncensored" Models Aren't Just "GPT Without Filters")

Comments
3 min read
Structuring Raw Interaction Data in AI Agents using Weaviate Engram

Structuring Raw Interaction Data in AI Agents using Weaviate Engram

1
Comments
3 min read
8 of the World's Top-10 Open-Source LLMs Are Chinese. Here's How to Use Them All with One OpenAI-Compatible Key.

8 of the World's Top-10 Open-Source LLMs Are Chinese. Here's How to Use Them All with One OpenAI-Compatible Key.

Comments
3 min read
What Actually Runs Well on a GTX 1080 Ti in 2026 (Measured)

What Actually Runs Well on a GTX 1080 Ti in 2026 (Measured)

Comments
3 min read
79% on LongMemEval: How We Beat Full-Context GPT-4 with a Local SQLite Database

79% on LongMemEval: How We Beat Full-Context GPT-4 with a Local SQLite Database

1
Comments
9 min read
MiniMax M3 Ships Open-Weight 1M Context: MiniMax Sparse Attention (MSA)

MiniMax M3 Ships Open-Weight 1M Context: MiniMax Sparse Attention (MSA)

Comments
6 min read
DiffusionGemma: How Google's New Open LLM Hits 1,000 Tokens/sec and Changes Inference Economics

DiffusionGemma: How Google's New Open LLM Hits 1,000 Tokens/sec and Changes Inference Economics

5
Comments
4 min read
I stopped trusting “same answers, fewer tokens” after watching an agent lose 1 field name and burn 3 hours

I stopped trusting “same answers, fewer tokens” after watching an agent lose 1 field name and burn 3 hours

Comments
7 min read
2026 年 6 月 Claude 全球宕机 3 小时复盘:你的 AI Agent 还在单点依赖吗?

2026 年 6 月 Claude 全球宕机 3 小时复盘:你的 AI Agent 还在单点依赖吗?

Comments
1 min read
Claude 全球宕机 3 小时复盘:为什么你的 AI Agent 不能只有一家 LLM

Claude 全球宕机 3 小时复盘:为什么你的 AI Agent 不能只有一家 LLM

Comments
1 min read
LiteLLM vs Embedded Self-Healing: 3 Reasons Agent Architecture Is Not the Endgame

LiteLLM vs Embedded Self-Healing: 3 Reasons Agent Architecture Is Not the Endgame

Comments
1 min read
AI Agent 生产环境每月崩几次?——LLM API 可靠性数据真相

AI Agent 生产环境每月崩几次?——LLM API 可靠性数据真相

Comments
1 min read
How to Build Production-Ready Generative AI Development Services for Enterprise Applications

How to Build Production-Ready Generative AI Development Services for Enterprise Applications

Comments
4 min read
Stop explaining yourself to Claude

Stop explaining yourself to Claude

Comments
3 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.