Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
DEV Community
Close
#
llm
Follow
Hide
Posts
Left menu
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
Speculative decoding shifted our output distribution and evals missed it
Marcus Chen
Marcus Chen
Marcus Chen
Follow
Jun 18
Speculative decoding shifted our output distribution and evals missed it
#
machinelearning
#
llm
#
mlops
#
pytorch
1
reaction
Comments
1
comment
4 min read
How to Extract Business Rules from Legacy COBOL Code
Michel Ozzello
Michel Ozzello
Michel Ozzello
Follow
for
CoreStory
May 15
How to Extract Business Rules from Legacy COBOL Code
#
ai
#
llm
#
programming
#
softwareengineering
Comments
Add Comment
8 min read
How I Reduced Prompt Injection Attacks by 86% With My Own Framework (And What Went Wrong the First Time)
Gustavo Viana
Gustavo Viana
Gustavo Viana
Follow
May 15
How I Reduced Prompt Injection Attacks by 86% With My Own Framework (And What Went Wrong the First Time)
#
ai
#
security
#
python
#
llm
Comments
Add Comment
5 min read
Compass v1.1.0 · we shipped a memory plugin that catches its own consumption drift
chunxiaoxx
chunxiaoxx
chunxiaoxx
Follow
May 19
Compass v1.1.0 · we shipped a memory plugin that catches its own consumption drift
#
llm
#
memory
#
mcp
#
agents
1
reaction
Comments
Add Comment
5 min read
RAG SOTA: I Tested 7 Pipelines and Built SEQUOIA (Open Source)
Ai developer
Ai developer
Ai developer
Follow
May 28
RAG SOTA: I Tested 7 Pipelines and Built SEQUOIA (Open Source)
#
ai
#
machinelearning
#
rag
#
llm
Comments
2
comments
2 min read
Is AI Getting Quietly Dumber? A 24/7 Benchmark That Catches LLM Degradation
Ray
Ray
Ray
Follow
Jun 18
Is AI Getting Quietly Dumber? A 24/7 Benchmark That Catches LLM Degradation
#
ai
#
llm
#
openai
#
devtools
Comments
2
comments
6 min read
Your LLM bill is not your capacity plan. Here's the math that pages you at 2am.
SolvoHQ
SolvoHQ
SolvoHQ
Follow
May 15
Your LLM bill is not your capacity plan. Here's the math that pages you at 2am.
#
ai
#
llm
#
webdev
#
programming
Comments
Add Comment
4 min read
Beyond Pay-Per-Token: How Enterprises Barter Architecture for AI Access
Vikrant Shukla
Vikrant Shukla
Vikrant Shukla
Follow
May 15
Beyond Pay-Per-Token: How Enterprises Barter Architecture for AI Access
#
ai
#
llm
#
cloud
Comments
Add Comment
3 min read
Push vs Pull Memory: A Better Way to Think About AI Agent Memory
Todd Hendricks
Todd Hendricks
Todd Hendricks
Follow
Jun 18
Push vs Pull Memory: A Better Way to Think About AI Agent Memory
#
ai
#
llm
#
agents
#
memory
Comments
Add Comment
5 min read
Why RAG Fails in Enterprise R&D (And What Actually Works)
Gilad Salinger
Gilad Salinger
Gilad Salinger
Follow
May 19
Why RAG Fails in Enterprise R&D (And What Actually Works)
#
ai
#
rag
#
llm
#
enterprise
Comments
1
comment
5 min read
LLM Structured Output Validation in Python That Holds Up
Rost
Rost
Rost
Follow
May 15
LLM Structured Output Validation in Python That Holds Up
#
architecture
#
llm
#
ai
#
aicoding
Comments
Add Comment
14 min read
Agents need a black box recorder, not more memory
Morgan
Morgan
Morgan
Follow
May 14
Agents need a black box recorder, not more memory
#
llm
#
ai
#
agents
#
devtools
Comments
Add Comment
3 min read
AI Reliability: What It Is, Why It Matters, and How to Fix It
Megha Chouhan
Megha Chouhan
Megha Chouhan
Follow
May 15
AI Reliability: What It Is, Why It Matters, and How to Fix It
#
ai
#
llm
#
monitoring
#
testing
Comments
Add Comment
9 min read
What is Agent Memory and why does it matter?
Anil Murty
Anil Murty
Anil Murty
Follow
May 14
What is Agent Memory and why does it matter?
#
agents
#
ai
#
llm
#
rag
Comments
Add Comment
7 min read
LLaMA.cpp Gets Qwen MTP Boost, Ring-2.6-1T for Ollama, AMD GPU Fixes
soy
soy
soy
Follow
May 14
LLaMA.cpp Gets Qwen MTP Boost, Ring-2.6-1T for Ollama, AMD GPU Fixes
#
ai
#
llm
#
selfhosted
Comments
Add Comment
3 min read
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account