DEV Community

Discussion on: Running Local LLMs, CPU vs. GPU - a Quick Speed Test

 
maximsaplin profile image
Maxim Saplin

Seems the threads param is ignored, I saw same behaviour when testing CPU inference