Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
DEV Community
Close
Python
Follow
Hide
import antigravity
Posts
Left menu
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
How Much GPU Memory Does NexusQuant Actually Save?
João André Gomes Marques
João André Gomes Marques
João André Gomes Marques
Follow
Apr 7
How Much GPU Memory Does NexusQuant Actually Save?
#
machinelearning
#
gpu
#
llm
#
python
Comments
Add Comment
4 min read
What I Learned Testing 12 Compression Approaches That Failed
João André Gomes Marques
João André Gomes Marques
João André Gomes Marques
Follow
Apr 7
What I Learned Testing 12 Compression Approaches That Failed
#
machinelearning
#
llm
#
research
#
python
Comments
Add Comment
6 min read
The Math Behind E8 Lattice Quantization (with Code)
João André Gomes Marques
João André Gomes Marques
João André Gomes Marques
Follow
Apr 7
The Math Behind E8 Lattice Quantization (with Code)
#
machinelearning
#
math
#
python
#
llm
Comments
Add Comment
6 min read
Why Your RAG System Returns Garbage (And How to Actually Fix It)
Alan West
Alan West
Alan West
Follow
Mar 27
Why Your RAG System Returns Garbage (And How to Actually Fix It)
#
rag
#
llm
#
python
#
ai
Comments
Add Comment
5 min read
Why Python's sorted() Is Safer Than list.sort() in Production Systems
Emmimal P Alexander
Emmimal P Alexander
Emmimal P Alexander
Follow
Mar 4
Why Python's sorted() Is Safer Than list.sort() in Production Systems
#
python
#
programming
#
beginners
#
tutorial
Comments
Add Comment
11 min read
Building Privacy-Preserving Machine Learning: A Practical Guide to Federated Learning
Dinesh Garikapati
Dinesh Garikapati
Dinesh Garikapati
Follow
Mar 27
Building Privacy-Preserving Machine Learning: A Practical Guide to Federated Learning
#
machinelearning
#
privacy
#
python
#
tutorial
2
reactions
Comments
Add Comment
4 min read
I Built a Semantic Cache That Cuts LLM API Costs by 72% - What Actually Worked and What Didn't
Vinay Kumar Reddy Budideti
Vinay Kumar Reddy Budideti
Vinay Kumar Reddy Budideti
Follow
Mar 4
I Built a Semantic Cache That Cuts LLM API Costs by 72% - What Actually Worked and What Didn't
#
ai
#
python
#
opensource
#
machinelearning
Comments
Add Comment
6 min read
How to deploy NexusQuant in production (and what's missing)
João André Gomes Marques
João André Gomes Marques
João André Gomes Marques
Follow
Apr 7
How to deploy NexusQuant in production (and what's missing)
#
machinelearning
#
llm
#
production
#
python
Comments
Add Comment
4 min read
Build and deploy a RAG pipeline as a REST API in under 5 minutes with RAGLight
Bessouat40
Bessouat40
Bessouat40
Follow
Mar 4
Build and deploy a RAG pipeline as a REST API in under 5 minutes with RAGLight
#
python
#
ai
#
rag
#
opensource
Comments
Add Comment
3 min read
Why Your AI Agents Are Burning Cash and How to Fix It
Alan West
Alan West
Alan West
Follow
Mar 27
Why Your AI Agents Are Burning Cash and How to Fix It
#
ai
#
llm
#
agents
#
python
Comments
Add Comment
5 min read
5 open source tools for AI agent governance in 2026
João André Gomes Marques
João André Gomes Marques
João André Gomes Marques
Follow
Apr 7
5 open source tools for AI agent governance in 2026
#
ai
#
security
#
opensource
#
python
1
reaction
Comments
3
comments
1 min read
Compress your LLM's KV cache 33x with zero training
João André Gomes Marques
João André Gomes Marques
João André Gomes Marques
Follow
Apr 7
Compress your LLM's KV cache 33x with zero training
#
python
#
machinelearning
#
llm
#
opensource
Comments
Add Comment
2 min read
Longer contexts are easier to compress (not harder)
João André Gomes Marques
João André Gomes Marques
João André Gomes Marques
Follow
Apr 7
Longer contexts are easier to compress (not harder)
#
python
#
machinelearning
#
llm
#
performance
Comments
Add Comment
2 min read
Why E8 lattice quantization beats scalar quantization for KV caches
João André Gomes Marques
João André Gomes Marques
João André Gomes Marques
Follow
Apr 7
Why E8 lattice quantization beats scalar quantization for KV caches
#
python
#
machinelearning
#
math
#
llm
Comments
Add Comment
2 min read
Building the Trust Layer for AI Trading Agents
laguia
laguia
laguia
Follow
Mar 4
Building the Trust Layer for AI Trading Agents
#
ai
#
mcp
#
api
#
python
1
reaction
Comments
Add Comment
2 min read
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account