DEV Community

AI Tech Connect
AI Tech Connect

Posted on • Originally published at aitechconnect.in

The Great TPU Migration: Anthropic and Meta Cut Inference 65%

Originally published on AI Tech Connect.

What's actually happening For most of the current AI cycle, the default answer to "what runs your model?" has been a single word: Nvidia. That is now loosening. Reporting indicates that three high-profile operators — Midjourney, Anthropic and Meta — are migrating inference workloads from Nvidia GPUs toward Google's tensor processing units. The same reporting attributes cost cuts of roughly 65% to the move. That 65% figure deserves an immediate flag, and we will keep flagging it through this piece because it is the number most likely to be misused. It comes from secondary reporting, not from a primary disclosure by any of the three companies. Nobody has published an audited before-and-after. So read it as "cost cuts of roughly 65%, according to reporting" — an indicative figure for very…


Read the full article on AI Tech Connect →

Top comments (0)