Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
DEV Community
Close
#
llm
Follow
Hide
Posts
Left menu
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
Qwen3.6 MoE, WritHer Offline AI, & llama.cpp Benchmarks Lead Local AI News
soy
soy
soy
Follow
Apr 16
Qwen3.6 MoE, WritHer Offline AI, & llama.cpp Benchmarks Lead Local AI News
#
ai
#
llm
#
selfhosted
Comments
Add Comment
3 min read
How to Detect If Your LLM Proxy Is Silently Eating Your Tokens
Alan West
Alan West
Alan West
Follow
Apr 16
How to Detect If Your LLM Proxy Is Silently Eating Your Tokens
#
llm
#
ai
#
security
#
openai
Comments
Add Comment
5 min read
Subliminal Learning and the Hidden Channel Problem in LLM Training
Maurizio Morri
Maurizio Morri
Maurizio Morri
Follow
Apr 16
Subliminal Learning and the Hidden Channel Problem in LLM Training
#
ai
#
llm
#
machinelearning
#
security
Comments
Add Comment
2 min read
Why Your 5-Agent System Forgets State (And How to Fix It)
Luis Gerardo Rodriguez Garcia
Luis Gerardo Rodriguez Garcia
Luis Gerardo Rodriguez Garcia
Follow
Apr 17
Why Your 5-Agent System Forgets State (And How to Fix It)
#
showdev
#
ai
#
opensource
#
llm
Comments
Add Comment
1 min read
qwen3.6 scores 73.4 on SWE-bench with only 3B active parameters. here's why that matters.
David
David
David
Follow
Apr 16
qwen3.6 scores 73.4 on SWE-bench with only 3B active parameters. here's why that matters.
#
ai
#
llm
#
opensource
#
coding
Comments
Add Comment
4 min read
10 Ways To Reduce Your LLM API Costs
Bruno Pérez
Bruno Pérez
Bruno Pérez
Follow
May 20
10 Ways To Reduce Your LLM API Costs
#
ai
#
llm
#
money
#
models
8
reactions
Comments
1
comment
7 min read
How Agentic Search Actually Works: The Research Loop Link-Fetching Agents Miss
tokozen
tokozen
tokozen
Follow
Apr 16
How Agentic Search Actually Works: The Research Loop Link-Fetching Agents Miss
#
ai
#
rag
#
scraping
#
llm
Comments
Add Comment
4 min read
Gemma 4 wrote three summaries in one response. The middle one was a self-disclaimer.
thehwang
thehwang
thehwang
Follow
May 20
Gemma 4 wrote three summaries in one response. The middle one was a self-disclaimer.
#
gemma
#
llm
#
ollama
#
ablation
2
reactions
Comments
1
comment
6 min read
Prompt Hashing for Duplicate Detection: Cutting LLM Waste With SHA-256
gauravdagde
gauravdagde
gauravdagde
Follow
Apr 16
Prompt Hashing for Duplicate Detection: Cutting LLM Waste With SHA-256
#
go
#
llm
#
ai
#
backend
Comments
Add Comment
4 min read
Bringing Generative AI to Microcontrollers: Introducing NocLLM
Muhammad Ikhwan Fathulloh
Muhammad Ikhwan Fathulloh
Muhammad Ikhwan Fathulloh
Follow
Apr 21
Bringing Generative AI to Microcontrollers: Introducing NocLLM
#
arduino
#
iot
#
cpp
#
llm
1
reaction
Comments
Add Comment
3 min read
AI Red-Teaming for Beginners: Where to Start and What to Test
Charles Givre
Charles Givre
Charles Givre
Follow
Apr 16
AI Red-Teaming for Beginners: Where to Start and What to Test
#
ai
#
beginners
#
llm
#
security
Comments
Add Comment
5 min read
Runware: One API for All AI Modalities — AI University Update (77 Providers)
kanta13jp1
kanta13jp1
kanta13jp1
Follow
Apr 17
Runware: One API for All AI Modalities — AI University Update (77 Providers)
#
ai
#
llm
#
buildinpublic
#
webdev
1
reaction
Comments
1
comment
2 min read
How to Compare AI Models Without Getting Fooled by Benchmarks
BenchGecko
BenchGecko
BenchGecko
Follow
Apr 21
How to Compare AI Models Without Getting Fooled by Benchmarks
#
ai
#
machinelearning
#
llm
#
webdev
10
reactions
Comments
Add Comment
2 min read
AI 週報 — 2026/04/10–2026/04/17 模型封鎖潮來了,但工具鏈才是真戰場
Yang Goufang
Yang Goufang
Yang Goufang
Follow
Apr 17
AI 週報 — 2026/04/10–2026/04/17 模型封鎖潮來了,但工具鏈才是真戰場
#
ai
#
machinelearning
#
tech
#
llm
Comments
Add Comment
1 min read
Google I/O Review (1/5) — Gemini 3.5 'Flash' Costs 15x More Than Flash 2.0. It's Pro in Disguise
ww-w.ai
ww-w.ai
ww-w.ai
Follow
May 20
Google I/O Review (1/5) — Gemini 3.5 'Flash' Costs 15x More Than Flash 2.0. It's Pro in Disguise
#
ai
#
google
#
llm
#
pricing
1
reaction
Comments
Add Comment
5 min read
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account