DEV Community

Cover image for Self-Hosted GPT-OSS-20B: Real Response Time, Token/s Throughput & Cost on L4, L40S and H100
Ivan Borshchov
Ivan Borshchov

Posted on

Self-Hosted GPT-OSS-20B: Real Response Time, Token/s Throughput & Cost on L4, L40S and H100

Self-hosted GPT on L4, L40S, H100 benchmark

Top comments (0)