Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
DEV Community
Close
#
gpu
Follow
Hide
Posts
Left menu
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
Rodei IA de 35B na minha GPU velha e me surpreendi!
Marcelo Cabral Ghilardi
Marcelo Cabral Ghilardi
Marcelo Cabral Ghilardi
Follow
Jun 3
Rodei IA de 35B na minha GPU velha e me surpreendi!
#
ai
#
gpu
#
llm
#
quantizacao
Comments
Add Comment
4 min read
Which serverless GPU platform has the fastest cold start for inference — I tested five and tracked p99 specifically
yukixing6-star
yukixing6-star
yukixing6-star
Follow
Jun 3
Which serverless GPU platform has the fastest cold start for inference — I tested five and tracked p99 specifically
#
gpu
#
machinelearning
Comments
Add Comment
2 min read
An AMD GPU Beat My Mac on Llama 8B. The Same GPU Lost on Phi-3.
Rob
Rob
Rob
Follow
Jun 2
An AMD GPU Beat My Mac on Llama 8B. The Same GPU Lost on Phi-3.
#
performance
#
benchmarks
#
machinelearning
#
gpu
Comments
Add Comment
5 min read
The cheapest way to run RTX 5090 and H200 inference without AWS — a real cost comparison
yukixing6-star
yukixing6-star
yukixing6-star
Follow
Jun 3
The cheapest way to run RTX 5090 and H200 inference without AWS — a real cost comparison
#
gpu
#
machinelearning
#
cloudcomputing
Comments
Add Comment
1 min read
Fleet 1.0: Finding the One Slow Rank in a 64-GPU Job From the Cluster Side
Ingero Team
Ingero Team
Ingero Team
Follow
Jun 2
Fleet 1.0: Finding the One Slow Rank in a 64-GPU Job From the Cluster Side
#
ebpf
#
gpu
#
kubernetes
#
observability
Comments
Add Comment
6 min read
Best Local AI Models for Each VRAM Tier (4 GB to 80 GB) in 2026
Jovan Chan
Jovan Chan
Jovan Chan
Follow
Jun 2
Best Local AI Models for Each VRAM Tier (4 GB to 80 GB) in 2026
#
localai
#
vram
#
hardware
#
gpu
Comments
Add Comment
6 min read
$20K local AI coding workstation in 2026: what hardware actually runs agentic workflows
Jovan Chan
Jovan Chan
Jovan Chan
Follow
Jun 2
$20K local AI coding workstation in 2026: what hardware actually runs agentic workflows
#
hardware
#
gpu
#
localai
#
agenticcoding
Comments
Add Comment
6 min read
Intel Arc B580 for Local AI: 12 GB at $249, With a Software Tax
Jovan Chan
Jovan Chan
Jovan Chan
Follow
Jun 2
Intel Arc B580 for Local AI: 12 GB at $249, With a Software Tax
#
intelarc
#
gpu
#
localai
#
llm
Comments
Add Comment
5 min read
Mistral Small 4 for Local AI in 2026: The 119B MoE Hardware Reality
Jovan Chan
Jovan Chan
Jovan Chan
Follow
Jun 2
Mistral Small 4 for Local AI in 2026: The 119B MoE Hardware Reality
#
mistral
#
localai
#
hardware
#
gpu
Comments
Add Comment
6 min read
RTX 5060 for Local AI in 2026: When 448 GB/s Hits an 8GB Wall
Jovan Chan
Jovan Chan
Jovan Chan
Follow
Jun 2
RTX 5060 for Local AI in 2026: When 448 GB/s Hits an 8GB Wall
#
gpu
#
nvidia
#
rtx5060
#
localllm
Comments
Add Comment
6 min read
NVIDIA's NVK Vulkan Driver Boosts Mesh Shaders; Wayland Dominates Linux Desktops; Jetson Updates for Physical AI
soy
soy
soy
Follow
Jun 2
NVIDIA's NVK Vulkan Driver Boosts Mesh Shaders; Wayland Dominates Linux Desktops; Jetson Updates for Physical AI
#
gpu
#
nvidia
#
hardware
Comments
Add Comment
3 min read
Docker vs Podman for AI/ML Workloads in 2026: A Technical Comparison
Pavan Madduri
Pavan Madduri
Pavan Madduri
Follow
Jun 2
Docker vs Podman for AI/ML Workloads in 2026: A Technical Comparison
#
docker
#
gpu
#
agents
#
ai
1
 reaction
Comments
Add Comment
6 min read
NVIDIA RTX Spark Superchip Unveiled, NBD-VRAM for GPU Swap, Local AI on RTX
soy
soy
soy
Follow
Jun 1
NVIDIA RTX Spark Superchip Unveiled, NBD-VRAM for GPU Swap, Local AI on RTX
#
gpu
#
nvidia
#
hardware
Comments
Add Comment
3 min read
Auto-Generated CUDA Kernels Need Kernel-Level Validation
Ingero Team
Ingero Team
Ingero Team
Follow
Jun 1
Auto-Generated CUDA Kernels Need Kernel-Level Validation
#
ai
#
machinelearning
#
gpu
#
performance
Comments
Add Comment
5 min read
Notes on CUDA Tensor Core GEMM (WMMA)
member_2e5ba30f
member_2e5ba30f
member_2e5ba30f
Follow
May 31
Notes on CUDA Tensor Core GEMM (WMMA)
#
cuda
#
gpu
#
cpp
#
performance
Comments
Add Comment
4 min read
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account