keeper

Posted on May 17

I squeezed my iGPU dry, then added an eGPU — a GPU buying guide for AI on mini PCs

#ai #llm #machinelearning #hardware

Last month, I hit a wall with my local LLM setup. Here's the full story — from software optimization to OCuLink eGPU to picking the right RTX 5060 Ti 16GB, with real pricing and brand teardown data.

Not a review. A decision log.

The problem

My machine — call it T2 — is a Minisforum AI X1 Pro (AMD Ryzen AI 9 HX 370, 96GB RAM). It runs LM Studio with Gemma 4 E4B and Peach 2.0 for local inference.

The Radeon 890M iGPU is decent. But shared memory architecture is a hard ceiling:

Bandwidth: ~120 GB/s vs 448 GB/s on a dedicated GPU
Long contexts (32K+) fight the CPU for memory bandwidth
Two models can't stay loaded simultaneously without painful reload times

Software optimizations helped (multi-model loading, continuous batching, KV cache quantization) but couldn't break the physical bottleneck. Time for a discrete GPU.

The approach: OCuLink + RTX 5060 Ti 16GB

For HX370 mini PCs, OCuLink (PCIe 4.0 x4) is the only reasonable expansion path — 2x the bandwidth of USB4, and the eGPU dock costs ¥200–400 (~$30–60).

The bandwidth myth

"Won't PCIe 4.0 x4 bottleneck the 5060 Ti?" For LLM inference, the impact is < 5%. The model loads into VRAM once, then inference is compute-bound. Bandwidth only matters during those few seconds of model loading.

Why the 5060 Ti 16GB?

Option	VRAM	TDP	Used Price	Verdict
RTX 5060	8GB	150W	~$450	❌ Can't run 14B models
5060 Ti 16GB	16GB	180W	~$460	✅ Sweet spot
RTX 4070	12GB	220W	~$490	❌ 12GB insufficient, more power
RTX 5070	12GB	250W	~$630+	❌ Less VRAM for more money
RX 9070 XT	16GB	260W	~$490+	❌ Too hot for OCuLink PSU

16GB is the real entry point for local AI. 180W means a single 8-pin connector — no PSU upgrade needed for the eGPU dock.

AI card selection criteria

Gamers look at frame rates and ray tracing. AI inference needs a different priority list:

VRAM > Cooling (baseplate type > heatpipe count > fan count) > VRM > Brand > RGB

Baseplate hierarchy (this matters more than most people realize):

Vapor chamber > Nickel-plated copper > Tinned copper > Untinned copper > Copper-aluminum > Aluminum > ❌ Heatpipe direct touch (HDT)

HDT baseplates have uneven contact surfaces. Thermal performance degrades under sustained load — an absolute no-go for AI workloads running 24/7.

Brand comparison (16GB models only)

ASUS

Model	VRM	Heatpipes	Baseplate	Rating
DUAL OC	5+2 50A	4×6mm	⚠️ Non-plated copper	⭐⭐
TUF Gaming	7+2 50A	5×6mm	Nickel-plated	⭐⭐⭐⭐⭐

The TUF is the most overbuilt 5060 Ti — 7+2 phase VRM is overkill for 180W, but great for 24/7 reliability. Also the most expensive at ~$560+.

MSI

Model	VRM	Heatpipes	Baseplate	Rating
❌ Ventus	5+2 50A	2×6mm	❌ HDT	❌
Gaming Trio	6+2 50A	3×6mm	Nickel-plated	⭐⭐⭐

The Ventus series uses HDT + plastic backplate across the board. Hard pass. Only consider MSI from Gaming Trio and up.

Gigabyte

Model	VRM	Heatpipes	Baseplate	Rating
❌ Windforce	5+2 50A	3×6mm	❌ Untinned copper	❌
Gaming OC	6+2 50A	5×6mm	Tinned copper	⭐⭐⭐

The Windforce is severely cut down. Brand reputation is mixed in the community.

Colorful (七彩虹)

Model	VRM	Heatpipes	Baseplate	Rating
Battle Axe DUO	5+2 50A	2×8mm	Nickel-plated	⭐⭐⭐⭐
Ultra W OC	6+2 50A	4×6mm	Nickel-plated	⭐⭐⭐⭐⭐
Advanced OC	8+2 50A	5×6mm	Nickel-plated	⭐⭐⭐⭐⭐

Colorful is the most consistently built brand across their entire lineup — everything from the budget Battle Axe to the Advanced has nickel-plated copper baseplates. The Ultra W OC is the best all-rounder.

GALAX (影驰)

Model	VRM	Heatpipes	Baseplate	Rating
❌ FIRE	6+2 50A	3×6mm	❌ HDT	❌
Metal Master	6+2 50A	3×6mm	Nickel-plated	⭐⭐⭐⭐

The Metal Master is all-metal, no RGB — ideal for headless AI servers where lights are just noise.

Quick look at other brands

Brand	Model	VRM	Heatpipes	Baseplate	Price
Maxsun	iCraft OC	5+2	3×6 nickel	Plated	~$460 used
Inno3D	Twin X2	5+2	4×6mm	Tinned	~$475
Yeston	Gaia	5+2	4×6mm	Tinned	~$450+
Gainward	Python III	6+2	3×6mm	Tinned	~$490
Zotac	X-GAMING	5+2	3×6mm	Tinned	~$490+

Models to avoid

❌ MSI Ventus — HDT + plastic backplate
❌ Gigabyte Windforce — Untinned copper baseplate
❌ GALAX FIRE — HDT
❌ Any 8GB model — can't run 14B models

Rule of thumb: No HDT, no untinned baseplates, no plastic backplates, and never 8GB VRAM.

Purchase ranking (May 2026, China pricing)

Rank	Model	Price	Condition	Why
🥇	Maxsun iCraft OC 16G	~$460	Used	No competition at this price
🥇'	Colorful Ultra W OC 16G	~$530	New	Best all-rounder, buy new
🥈	GALAX Metal Master 16G	~$500	New	All-metal, no RGB, quiet
🥉	Inno3D Twin X2 16G	~$475	New	Cheapest reliable new card
💎	ASUS TUF 16G	~$560+	New	Overbuilt, most expensive

When to buy: 618 shopping festival predictions

China's 618 (June 18) sale is the biggest mid-year shopping event, running May 13 – June 20.

Current prices (May 17)

JD.com lowest: ~$490 (Zotac X-Gaming)
Channel wholesale: ~$510-525
Secondary market (Xianyu): ~$420-460

Key factors

GDDR7 price hike won't affect 5060 Ti. Nvidia raised GDDR7 costs for the 5090 only — all other GDDR7 models are unaffected. The 5060 Ti uses 28Gbps modules with much looser supply constraints than the 5090's 32Gbps.

618 has three waves:

Phase	Date	Expected discount
Current	May 13–31	Platform coupons
Main event	June 1–3	Direct brand cuts + stackable coupons
Final	June 15–20	Clearance pricing

Price forecast

May   → New $490-545 / Used $420-460
June  → New $460-500
July  → New $450-490
Nov   → New $420-460 (Singles' Day, theoretical floor)

Three scenarios

🔴 Need it now → Buy used at ~$460. The 180W thermal design means low failure risk on the used market.
🟡 Can wait, want new → Watch JD.com June 1–3. Zotac and Colorful will likely drop below $460.
🟢 Targeting $420 → Wait until Singles Day (Nov 11). The card will be a year old by then with well-released pricing.

Final build reference

Host:  Minisforum AI X1 Pro (HX370 / 96GB)
Link:  OCuLink eGPU dock (~$40)
GPU:   Maxsun iCraft OC 16GB (used, ~$460)
Stack: LM Studio → Gemma 4 E4B + Peach 2.0

Takeaways

Optimize software first (free) — multi-model loading, continuous batching, KV cache quantization
Only upgrade hardware when you hit the physical ceiling
Best path for mini PC AI inference: OCuLink + 5060 Ti 16GB
Selection priority: VRAM > cooling > VRM > brand > RGB

Local AI inference is still at the "find your bottleneck and patch the cheapest one" stage. The most cost-effective solution is always the one that's just enough.

DEV Community