DEV Community

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Implementing Claude Code's Memory Model as a Dreaming Layer on 58 Articles

Implementing Claude Code's Memory Model as a Dreaming Layer on 58 Articles

Comments
7 min read
AI Evals, Explained: How We Actually Know Our AI Is Any Good

AI Evals, Explained: How We Actually Know Our AI Is Any Good

Comments
6 min read
AI 週報 — 2026-06-05 to 2026-06-11 | OpenAI 掛牌倒數:秘密 S-1 提交背後的三個技術訊號

AI 週報 — 2026-06-05 to 2026-06-11 | OpenAI 掛牌倒數:秘密 S-1 提交背後的三個技術訊號

Comments
2 min read
Token-Based Pricing Doesn't Survive Adoption Curves

Token-Based Pricing Doesn't Survive Adoption Curves

Comments
7 min read
Ollama Cloud Free vs Pro — Usage Limits, Pricing & What You Actually Get (2026)

Ollama Cloud Free vs Pro — Usage Limits, Pricing & What You Actually Get (2026)

Comments
3 min read
Konsey: a multi-LLM council where a model can't verify its own output

Konsey: a multi-LLM council where a model can't verify its own output

Comments
1 min read
Citation-Guard: Production RAG Patterns for Regulated Fintech

Citation-Guard: Production RAG Patterns for Regulated Fintech

Comments
4 min read
The Three-Layer Architecture of AI Tokens: Why the Middle Is Eating the Stack

The Three-Layer Architecture of AI Tokens: Why the Middle Is Eating the Stack

Comments
9 min read
TradeMemory An AI-Powered Persistence Layer for Disciplined Trading

TradeMemory An AI-Powered Persistence Layer for Disciplined Trading

Comments
3 min read
Pitfalls of Testing LLM Long-Term Memory: A 3‑Day Debugging Saga

Pitfalls of Testing LLM Long-Term Memory: A 3‑Day Debugging Saga

Comments
4 min read
How do Chinese access Claude/GPT API at 0.2x pricing?

How do Chinese access Claude/GPT API at 0.2x pricing?

1
Comments 2
13 min read
Flash Attention: what it does and why it matters

Flash Attention: what it does and why it matters

Comments
8 min read
Agent-Harness Scaling Law: Feedback Quality Predicts Success, Not Raw Compute: Effective Feedback Compute (EFC)

Agent-Harness Scaling Law: Feedback Quality Predicts Success, Not Raw Compute: Effective Feedback Compute (EFC)

Comments
7 min read
Same Lever, Opposite Intent: When Shared Agent Memory Backfires

Same Lever, Opposite Intent: When Shared Agent Memory Backfires

1
Comments 4
2 min read
GeekNews AI Weekly Deep Dive - 2026-06-15

GeekNews AI Weekly Deep Dive - 2026-06-15

Comments
1 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.