GPU Hardware & Driver Update: RTX 5090 Benchmarks, llama.cpp MTP, Windows 11 Fix
Today's Highlights
This week's top GPU news features practical performance optimization on NVIDIA's RTX 5090, a critical driver fix for Windows 11 users, and deep dives into multi-tensor processing for local LLM inference.
Testing llama.cpp MTP Support on RTX 5090 (r/LocalLLaMA)
Source: https://reddit.com/r/LocalLLaMA/comments/1tfgxc8/testing_llamacpp_mtp_support_on_qwen36_rtx_5090/
This report details an experimental setup to test Multi-Tensor Processing (MTP) support within llama.cpp on an NVIDIA RTX 5090 GPU running Linux. The user built llama.cpp from a specific commit to leverage the latest MTP optimizations, which are designed to improve efficiency and performance for large language model inference by better utilizing GPU resources. The test involves running the Qwen 3.6 model, highlighting how developers and enthusiasts can benchmark and optimize their local LLM setups on cutting-edge hardware.
The findings provide insights into the practical application of llama.cpp’s advanced features on high-VRAM GPUs. By focusing on specific hardware and software configurations (RTX 5090, 32 GB VRAM, Linux, custom llama.cpp build), the post demonstrates a hands-on approach to performance tuning. This kind of user-driven testing is invaluable for the open-source community, offering real-world data on the effectiveness of new features and helping to identify potential bottlenecks or areas for further optimization in GPU-accelerated ML inference.
Comment: This is a great real-world test, showing how the llama.cpp community is pushing the limits of local LLM inference on the latest NVIDIA hardware. Leveraging MTP support on a powerful GPU like the RTX 5090 is key for future efficiency gains.
RTX 5090 Overclock & Undervolt for 7% Performance Gain (r/nvidia)
Source: https://reddit.com/r/nvidia/comments/1tfxrfq/aorus_master_5090_975mv_3000_mem_2950mhz/
An NVIDIA RTX 5090 Aorus Master owner shared impressive results from optimizing their GPU through a combination of undervolting and memory overclocking. By reducing the voltage to 975mV while boosting memory clock speeds by an impressive +3000MHz and maintaining a core clock of 2950MHz, the user achieved a reported 7% performance increase. This technical tuning demonstrates that significant gains in both performance and potentially power efficiency can be extracted from the latest high-end GPUs beyond their factory settings.
This type of optimization is crucial for enthusiasts and professionals looking to maximize their hardware investment. Undervolting helps manage power consumption and heat output, contributing to a more stable and potentially longer-lasting card, while memory overclocking directly improves bandwidth, which is critical for many GPU-intensive workloads, including gaming and AI. The 7% performance uplift from this specific tuning offers a tangible benchmark for other RTX 5090 owners to aim for, providing concrete evidence of the benefits of manual GPU adjustments.
Comment: Achieving a 7% performance boost on an RTX 5090 through careful undervolting and memory OC is excellent. This shows the headroom these cards have for advanced users to fine-tune for better power efficiency and speed.
Microsoft Confirms Windows 11 Downgrading Graphics Drivers (r/Amd)
Source: https://reddit.com/r/Amd/comments/1te13zm/microsoft_confirms_windows_11_has_been/
Microsoft has officially acknowledged a significant issue within Windows 11 where the operating system has been automatically downgrading graphics drivers, leading to potential performance degradation, instability, and compatibility problems for users. This affects both NVIDIA and AMD GPU owners, as Windows Update inadvertently replaces newer, manually installed drivers with older versions. The confirmation from Microsoft includes a commitment to release a fix, addressing a widespread frustration among PC users and gamers who rely on the latest drivers for optimal GPU performance and feature support.
The problem highlights a critical flaw in Windows Update's driver management logic, particularly for graphics components where frequent updates are common and often essential for new game releases or software optimizations. Users have often found themselves in a continuous loop of reinstalling their preferred drivers, only for Windows to revert them. The upcoming fix is expected to prevent these unrequested downgrades, ensuring that user-installed drivers persist and providing a more stable and predictable experience for maintaining GPU software on Windows 11 systems.
Comment: This is huge for anyone on Windows 11. Automatic driver downgrades have been a persistent headache, impacting performance and features. A fix from Microsoft is long overdue and critical for GPU stability.
Top comments (0)