Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
DEV Community
Close
#
llm
Follow
Hide
Posts
Left menu
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
Running OpenAI's gpt-oss-20b with 128k Context on a Single L4 GPU
Oleksii Nizhegolenko
Oleksii Nizhegolenko
Oleksii Nizhegolenko
Follow
May 19
Running OpenAI's gpt-oss-20b with 128k Context on a Single L4 GPU
#
ai
#
llm
#
kubernetes
#
devops
Comments
Add Comment
13 min read
Your AI speed benchmark is measuring the one workload you don't run
Thousand Miles AI
Thousand Miles AI
Thousand Miles AI
Follow
May 19
Your AI speed benchmark is measuring the one workload you don't run
#
discuss
#
ai
#
llm
#
inference
Comments
Add Comment
3 min read
Your Tech Stack Has an AI Problem: How to Audit and Fix It in 2026
Lycore Development
Lycore Development
Lycore Development
Follow
May 19
Your Tech Stack Has an AI Problem: How to Audit and Fix It in 2026
#
ai
#
architecture
#
llm
#
softwareengineering
Comments
Add Comment
8 min read
RAG Series (21): Performance Optimization — Faster and Cheaper
WonderLab
WonderLab
WonderLab
Follow
May 19
RAG Series (21): Performance Optimization — Faster and Cheaper
#
ai
#
rag
#
llm
#
performance
Comments
Add Comment
7 min read
How the itrstats tax assistant works: one query, every layer
kartikey rajvaidya
kartikey rajvaidya
kartikey rajvaidya
Follow
May 18
How the itrstats tax assistant works: one query, every layer
#
python
#
llm
#
agents
#
rag
Comments
Add Comment
10 min read
The Shai-Hulud Worm Is Now Open Source — Here's How to Stop Self-Replicating Prompts Before They Reach Your LLM
Cor E
Cor E
Cor E
Follow
May 19
The Shai-Hulud Worm Is Now Open Source — Here's How to Stop Self-Replicating Prompts Before They Reach Your LLM
#
security
#
llm
#
appsec
#
cybersecurity
1
 reaction
Comments
Add Comment
5 min read
The Hype Correction
Vilius
Vilius
Vilius
Follow
May 23
The Hype Correction
#
ai
#
agents
#
llm
#
devtools
2
 reactions
Comments
Add Comment
4 min read
Building llama.cpp from source on a Dell Precision T5820 with an RTX 3090 Ti (after seven power cycles)
Ian L. Paterson
Ian L. Paterson
Ian L. Paterson
Follow
May 18
Building llama.cpp from source on a Dell Precision T5820 with an RTX 3090 Ti (after seven power cycles)
#
ai
#
llm
#
gpu
#
linux
Comments
Add Comment
16 min read
The LLM Kept Saying “Fixed.” For Three Months, It Wasn’t.
Ian L. Paterson
Ian L. Paterson
Ian L. Paterson
Follow
May 18
The LLM Kept Saying “Fixed.” For Three Months, It Wasn’t.
#
ai
#
llm
#
testing
#
programming
Comments
Add Comment
7 min read
Inference Arbitrage: How I Route 200+ Daily LLM Calls Across Five Models
Ian L. Paterson
Ian L. Paterson
Ian L. Paterson
Follow
May 18
Inference Arbitrage: How I Route 200+ Daily LLM Calls Across Five Models
#
ai
#
llm
#
devops
#
python
Comments
Add Comment
10 min read
Three Months of Speed-Up Experiments on a 3090 Ti: Autoregressive DFlash MTP for Qwen3.6-27B
Ian L. Paterson
Ian L. Paterson
Ian L. Paterson
Follow
May 18
Three Months of Speed-Up Experiments on a 3090 Ti: Autoregressive DFlash MTP for Qwen3.6-27B
#
ai
#
llm
#
gpu
#
performance
Comments
Add Comment
18 min read
LLM Benchmark Rankings 2026: 15 Models Tested on 38 Real Coding Tasks
Ian L. Paterson
Ian L. Paterson
Ian L. Paterson
Follow
May 18
LLM Benchmark Rankings 2026: 15 Models Tested on 38 Real Coding Tasks
#
ai
#
llm
#
programming
#
benchmarking
Comments
Add Comment
28 min read
How I Track Claude, Codex, and Gemini Quotas from One Script
Ian L. Paterson
Ian L. Paterson
Ian L. Paterson
Follow
May 18
How I Track Claude, Codex, and Gemini Quotas from One Script
#
ai
#
llm
#
productivity
#
bash
Comments
Add Comment
6 min read
Designing a Multi-Agent AI System for Content Analysis and Recommendations
Nagashree Bhat
Nagashree Bhat
Nagashree Bhat
Follow
May 18
Designing a Multi-Agent AI System for Content Analysis and Recommendations
#
systemdesign
#
llm
#
backend
#
ai
Comments
Add Comment
7 min read
A Practical Model Selection Matrix for Multi-Model AI Apps
Ye Allen
Ye Allen
Ye Allen
Follow
May 19
A Practical Model Selection Matrix for Multi-Model AI Apps
#
api
#
ai
#
openai
#
llm
Comments
1
 comment
2 min read
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account