Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
DEV Community
Close
#
inference
Follow
Hide
Posts
Left menu
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
ReFlect: Training-Free Error Recovery for Long-Horizon LLM Reasoning
Jangwook Kim
Jangwook Kim
Jangwook Kim
Follow
May 11
ReFlect: Training-Free Error Recovery for Long-Horizon LLM Reasoning
#
llmreasoning
#
agents
#
inference
#
arxiv2026
Comments
Add Comment
4 min read
Why Most Browser AI Demos Fail on Real Hardware
Bruno Juca
Bruno Juca
Bruno Juca
Follow
May 10
Why Most Browser AI Demos Fail on Real Hardware
#
ai
#
inference
#
hardware
#
benchmark
Comments
Add Comment
4 min read
The Inference Inversion
David Aronchick
David Aronchick
David Aronchick
Follow
May 5
The Inference Inversion
#
distributedcomputing
#
edgecomputing
#
nvidia
#
inference
Comments
Add Comment
7 min read
First Confirmed Directional Move on the AI Inference Frontier Index in 2026
Steriani Karamanlis
Steriani Karamanlis
Steriani Karamanlis
Follow
May 12
First Confirmed Directional Move on the AI Inference Frontier Index in 2026
#
ai
#
llm
#
inference
#
pricing
Comments
Add Comment
4 min read
Muse Spark beats Llama 4 with 10x less compute. Here's how.
Gabriel Anhaia
Gabriel Anhaia
Gabriel Anhaia
Follow
Apr 26
Muse Spark beats Llama 4 with 10x less compute. Here's how.
#
ai
#
llm
#
architecture
#
inference
Comments
Add Comment
7 min read
First Words: LLM Inference on RISC-V
Bruno Verachten
Bruno Verachten
Bruno Verachten
Follow
Apr 22
First Words: LLM Inference on RISC-V
#
bananapi
#
benchmark
#
inference
#
llamacpp
Comments
Add Comment
9 min read
Gaussian Process Regression: The Bayesian Approach to Curve Fitting
Berkan Sesen
Berkan Sesen
Berkan Sesen
Follow
Apr 13
Gaussian Process Regression: The Bayesian Approach to Curve Fitting
#
bayesian
#
supervisedlearning
#
probabilistic
#
inference
Comments
Add Comment
13 min read
Google Dropped TurboQuant Two Weeks Ago. The Community Already Made It Usable.
Alan West
Alan West
Alan West
Follow
Apr 7
Google Dropped TurboQuant Two Weeks Ago. The Community Already Made It Usable.
#
turboquant
#
locallm
#
inference
#
opensource
1
 reaction
Comments
Add Comment
6 min read
Hierarchical Bayesian Regression with PyMC: When Groups Share Strength
Berkan Sesen
Berkan Sesen
Berkan Sesen
Follow
Apr 26
Hierarchical Bayesian Regression with PyMC: When Groups Share Strength
#
bayesian
#
probabilistic
#
inference
#
pymc
1
 reaction
Comments
Add Comment
13 min read
From MLE to Bayesian Inference: Why Your Estimate Needs a Prior
Berkan Sesen
Berkan Sesen
Berkan Sesen
Follow
Mar 29
From MLE to Bayesian Inference: Why Your Estimate Needs a Prior
#
bayesian
#
inference
#
statistics
#
probabilistic
Comments
Add Comment
15 min read
The EM Algorithm: An Intuitive Guide with the Coin Toss Example
Berkan Sesen
Berkan Sesen
Berkan Sesen
Follow
Mar 27
The EM Algorithm: An Intuitive Guide with the Coin Toss Example
#
unsupervisedlearning
#
inference
#
optimisation
#
probabilistic
Comments
Add Comment
10 min read
Maximum Likelihood Estimation from Scratch: From Coin Flips to Gaussians
Berkan Sesen
Berkan Sesen
Berkan Sesen
Follow
Mar 26
Maximum Likelihood Estimation from Scratch: From Coin Flips to Gaussians
#
statistics
#
inference
#
optimisation
#
probabilistic
Comments
Add Comment
13 min read
Estimating Operational Costs for CLIP-Based Image Search on 1 Million Images: Infrastructure Expenses Focused
Artyom Kornilov
Artyom Kornilov
Artyom Kornilov
Follow
Mar 10
Estimating Operational Costs for CLIP-Based Image Search on 1 Million Images: Infrastructure Expenses Focused
#
clip
#
gpu
#
inference
#
cost
Comments
Add Comment
12 min read
I built an Ollama alternative with TurboQuant, model groups, and multi-GPU support
deharoalexandre-cyber
deharoalexandre-cyber
deharoalexandre-cyber
Follow
Apr 8
I built an Ollama alternative with TurboQuant, model groups, and multi-GPU support
#
ai
#
llm
#
cpp
#
inference
1
 reaction
Comments
1
 comment
4 min read
How to Optimize AI Agent Costs — Inference, API Calls, and Infrastructure
Custodia-Admin
Custodia-Admin
Custodia-Admin
Follow
Mar 13
How to Optimize AI Agent Costs — Inference, API Calls, and Infrastructure
#
agents
#
costs
#
optimization
#
inference
Comments
1
 comment
3 min read
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account