DEV Community

# llm

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
BeeLlama.cpp enhances llama.cpp, Qwen 35B hits 128K context, iOS local LLMs with Ollama

BeeLlama.cpp enhances llama.cpp, Qwen 35B hits 128K context, iOS local LLMs with Ollama

Comments
3 min read
Deterministic reliability stack for LLM pipelines

Deterministic reliability stack for LLM pipelines

Comments
1 min read
The 7-Layer Memory Architecture Behind Modern AI Agents

The 7-Layer Memory Architecture Behind Modern AI Agents

Comments
7 min read
LLM Token Counting and Cost Optimization: A Practical Guide

LLM Token Counting and Cost Optimization: A Practical Guide

1
Comments
5 min read
Integrating LLMs Into Playwright Testing Workflows

Integrating LLMs Into Playwright Testing Workflows

1
Comments 1
5 min read
Generation 1 — Standalone Models (2018–2022)

Generation 1 — Standalone Models (2018–2022)

Comments
5 min read
Why Most WordPress SEO Plugins Are Not Ready for AI Search Yet

Why Most WordPress SEO Plugins Are Not Ready for AI Search Yet

Comments
5 min read
A Survey of LLM-based Deep Search Agents Adaptive Path Planning via Weighted A* and Heuristic Rewards

A Survey of LLM-based Deep Search Agents Adaptive Path Planning via Weighted A* and Heuristic Rewards

Comments
4 min read
How Stripe, Shopify, and Airbnb Build AI Harnesses

How Stripe, Shopify, and Airbnb Build AI Harnesses

Comments
3 min read
Evaluating LLM code reviewers: an offline harness for precision, recall, and routing"

Evaluating LLM code reviewers: an offline harness for precision, recall, and routing"

2
Comments
5 min read
Anthropic plugs into SpaceX's 220,000-GPU Colossus — and doubles Claude's rate limits

Anthropic plugs into SpaceX's 220,000-GPU Colossus — and doubles Claude's rate limits

1
Comments
3 min read
I Trained an LLM on 75K of My Own Messages So It Would Stop Writing Like a Chatbot

I Trained an LLM on 75K of My Own Messages So It Would Stop Writing Like a Chatbot

Comments
8 min read
What 11 big tech companies actually do with AI in 2026

What 11 big tech companies actually do with AI in 2026

Comments
23 min read
tierKV: A Distributed KV Cache That Makes Evicted Blocks Faster to Restore Than GPU Cache Hits

tierKV: A Distributed KV Cache That Makes Evicted Blocks Faster to Restore Than GPU Cache Hits

1
Comments
3 min read
Why I Built My Own AI Project Management Assistant – and What I Learned

Why I Built My Own AI Project Management Assistant – and What I Learned

Comments
4 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.