Originally published on AI Tech Connect.
What's actually happening For most of the current AI cycle, the default answer to "what runs your model?" has been a single word: Nvidia. That is now loosening. Reporting indicates that three high-profile operators — Midjourney, Anthropic and Meta — are migrating inference workloads from Nvidia GPUs toward Google's tensor processing units. The same reporting attributes cost cuts of roughly 65% to the move. That 65% figure deserves an immediate flag, and we will keep flagging it through this piece because it is the number most likely to be misused. It comes from secondary reporting, not from a primary disclosure by any of the three companies. Nobody has published an audited before-and-after. So read it as "cost cuts of roughly 65%, according to reporting" — an indicative figure for very…
Top comments (0)