Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
DEV Community
Close
#
llm
Follow
Hide
Posts
Left menu
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
I Replaced $800/mo in API Costs with a Local Llama 4 Setup for E-Commerce
doltter
doltter
doltter
Follow
Apr 23
I Replaced $800/mo in API Costs with a Local Llama 4 Setup for E-Commerce
#
ai
#
ecommerce
#
llm
#
opensource
Comments
Add Comment
4 min read
GPU cloud servers for AI workloads: how to choose the right instance and deploy without waste
Damaso Sanoja
Damaso Sanoja
Damaso Sanoja
Follow
May 7
GPU cloud servers for AI workloads: how to choose the right instance and deploy without waste
#
ai
#
cloud
#
infrastructure
#
llm
1
 reaction
Comments
Add Comment
15 min read
Qwen 3.6, llama.cpp Speculative Decoding, Deepseek TileKernels for Local AI on Consumer GPUs
soy
soy
soy
Follow
Apr 23
Qwen 3.6, llama.cpp Speculative Decoding, Deepseek TileKernels for Local AI on Consumer GPUs
#
ai
#
llm
#
selfhosted
Comments
Add Comment
3 min read
I built a new file format to cut AI token costs by 70% — here's how it works
Javier Castillo
Javier Castillo
Javier Castillo
Follow
Apr 23
I built a new file format to cut AI token costs by 70% — here's how it works
#
ai
#
data
#
llm
#
performance
1
 reaction
Comments
Add Comment
5 min read
I evaluated the leaked system prompts of the biggest AI coding tools. Here's what I found.
Francisco Ferreira
Francisco Ferreira
Francisco Ferreira
Follow
Apr 23
I evaluated the leaked system prompts of the biggest AI coding tools. Here's what I found.
#
promptengineering
#
llm
#
ai
#
webdev
Comments
Add Comment
4 min read
Doby: How I Cut Claude Code's Navigation Tokens by 95% with a Spec-First Workflow
changmyoungkim
changmyoungkim
changmyoungkim
Follow
Apr 24
Doby: How I Cut Claude Code's Navigation Tokens by 95% with a Spec-First Workflow
#
architecture
#
claude
#
llm
#
productivity
Comments
Add Comment
1 min read
LocalForge: I built a self-hosted LLM control plane with intelligent routing and LoRA finetuning
Muhammad Ali Nasir
Muhammad Ali Nasir
Muhammad Ali Nasir
Follow
Apr 23
LocalForge: I built a self-hosted LLM control plane with intelligent routing and LoRA finetuning
#
python
#
ai
#
llm
#
agents
Comments
Add Comment
2 min read
48 Hours After Publishing: Second-Order Injection Field Notes
GnomeMan4201
GnomeMan4201
GnomeMan4201
Follow
Apr 23
48 Hours After Publishing: Second-Order Injection Field Notes
#
security
#
llm
#
ai
#
cybersecurity
1
 reaction
Comments
Add Comment
2 min read
The Actual Cost of Self-Hosting Your LLM (Nobody Does This Math First)
claire nguyen
claire nguyen
claire nguyen
Follow
Apr 23
The Actual Cost of Self-Hosting Your LLM (Nobody Does This Math First)
#
llm
#
ai
#
devops
#
sre
Comments
Add Comment
4 min read
A Minimal ~9M Parameter Transformer LLM Trained from Scratch
Mudasir Habib
Mudasir Habib
Mudasir Habib
Follow
Apr 23
A Minimal ~9M Parameter Transformer LLM Trained from Scratch
#
challenge
#
ai
#
opensource
#
llm
Comments
Add Comment
2 min read
LLM Observability tool
unni mana
unni mana
unni mana
Follow
Apr 25
LLM Observability tool
#
showdev
#
java
#
llm
#
monitoring
Comments
Add Comment
1 min read
AI Duel on Building Retro RPG Quest Journal
YASHWANTH REDDY K
YASHWANTH REDDY K
YASHWANTH REDDY K
Follow
Apr 23
AI Duel on Building Retro RPG Quest Journal
#
vibecodearena
#
hackerearth
#
ai
#
llm
Comments
Add Comment
3 min read
Qwen3.6-Plus Benchmark: It Is Trying to Finish the Job, Not Just Win Chat Scores
Super Jarvis
Super Jarvis
Super Jarvis
Follow
Apr 23
Qwen3.6-Plus Benchmark: It Is Trying to Finish the Job, Not Just Win Chat Scores
#
agents
#
ai
#
llm
#
performance
1
 reaction
Comments
Add Comment
5 min read
Context Compression and Persistent Memory Design for Terminal AI Assistants
Joel Alan
Joel Alan
Joel Alan
Follow
Apr 23
Context Compression and Persistent Memory Design for Terminal AI Assistants
#
agents
#
ai
#
cli
#
llm
1
 reaction
Comments
Add Comment
7 min read
qwen3.6-27b scores 77.2% on SWE-bench. the dense model is winning against MoE.
David
David
David
Follow
Apr 23
qwen3.6-27b scores 77.2% on SWE-bench. the dense model is winning against MoE.
#
ai
#
llm
#
opensource
#
coding
Comments
Add Comment
3 min read
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account