DEV Community

soy
soy

Posted on • Originally published at media.patentllm.org

RTX 5090 LLM 100 tps Benchmarks, RTX 5060 Ti eGPU with TBT5/OCuLink, NVIDIA Frame Gen

RTX 5090 LLM 100 tps Benchmarks, RTX 5060 Ti eGPU with TBT5/OCuLink, NVIDIA Frame Gen

Today's Highlights

Today's top hardware news features cutting-edge GPU performance: NVIDIA's RTX 5090 clocks 100 tps with 256k context for Qwen3.6-27B-INT4 via vLLM, demonstrating significant VRAM optimization. Additionally, MOREFINE launches an RTX 5060 Ti eGPU leveraging Thunderbolt 5 and OCuLink, while NVIDIA's Frame Generation shows "black magic" FPS boosts on RTX 5050 laptops.

Qwen3.6-27B-INT4 Achieves 100 tps with 256k Context on RTX 5090 via vLLM 0.19 (r/LocalLLaMA)

Source: https://reddit.com/r/LocalLLaMA/comments/1sw21op/qwen3627bint4_clocking_100_tps_with_256k_context/

This report highlights impressive performance from the Qwen3.6-27B-INT4 model, achieving 100 tokens per second (tps) with an extensive 256,000 token context length on a single NVIDIA RTX 5090 GPU. The benchmark was conducted using vLLM version 0.19, a popular open-source library for high-throughput LLM inference. This level of performance, especially with such a large context window and INT4 quantization, demonstrates significant advancements in VRAM optimization and efficient GPU utilization for large language models. The RTX 5090, while not officially released, represents a next-generation GPU, making this an early indicator of future LLM capabilities on cutting-edge hardware.

Comment: Achieving 100 tps on an RTX 5090 with a 256k context window using vLLM and INT4 quantization is a powerful example of real-world VRAM and throughput optimization. This is exactly what we look for when deploying large models for demanding applications.

MOREFINE Unveils Compact RTX 5060 Ti eGPU with Thunderbolt 5 & OCuLink (r/nvidia)

Source: https://reddit.com/r/nvidia/comments/1sw2gq7/morefine_launches_compact_geforce_rtx_5060_ti/

MOREFINE has introduced a new compact external GPU (eGPU) enclosure featuring the GeForce RTX 5060 Ti. Priced at $1099, this solution is notable for its inclusion of both Thunderbolt 5 and OCuLink connectivity. Thunderbolt 5, with its substantially increased bandwidth (up to 80 Gbps bi-directional, 120 Gbps for display), significantly enhances the performance potential of eGPUs by reducing bottlenecks associated with data transfer between the host system and the external GPU. OCuLink (Optical-Copper Link) provides an even higher-bandwidth direct PCIe connection, bypassing USB-C overheads for critical applications. This eGPU targets users needing portable, high-performance graphics, leveraging advanced I/O to deliver near-desktop GPU performance in a compact form factor.

Comment: The integration of Thunderbolt 5 and OCuLink in this RTX 5060 Ti eGPU marks a significant step forward for external graphics, directly addressing bandwidth limitations that have historically plagued eGPU performance. This provides compelling options for engineers and researchers requiring portable yet powerful GPU compute.

NVIDIA MFG (Frame Generation) Delivers "Black Magic" FPS Boost on RTX 5050 Laptops (r/nvidia)

Source: https://reddit.com/r/nvidia/comments/1swel85/mfg_is_actually_black_magic_at_least_for_star/

A user reports exceptional performance gains using NVIDIA's Motion Frame Generation (MFG) technology, alongside DLSS, on an RTX 5050 laptop. Specifically, in "Star Wars Outlaws" at 1080p, applying DLSS quality preset K combined with 3x or 4x MFG resulted in frame rates between 130-160 FPS. This highlights the transformative impact of NVIDIA's driver-level frame generation techniques, effectively multiplying perceived frame rates and improving fluidity, even on more mid-range laptop GPUs. These technologies are crucial for extending the performance lifespan of GPUs and enabling smooth gameplay at higher resolutions or refresh rates than native rendering could achieve alone.

Comment: NVIDIA's Frame Generation, coupled with DLSS, continues to be a game-changer, especially for laptop GPUs like the RTX 5050. Achieving 130-160 FPS demonstrates the power of these driver optimizations in delivering high-performance experiences without needing top-tier hardware.

Top comments (0)