Sakana's new model isn't a model -- it's a committee of models behind one door

#orchestration #multiagent #sakana #agents

Sakana AI released Fugu, a coordinator system that sits in front of multiple frontier models, picks the best one (or combination) for each request, and returns a single answer through one connection point (sakana.ai/fugu). It looks like any other AI model to call; internally it runs a committee. The code and technical report are public (repo; technical report).

Key facts

What: Fugu routes each request across several frontier AIs and answers through a single endpoint, pitched explicitly as a hedge against depending on any one provider.
When: 2026-06-22
Primary source: read the source

No single model excels at everything. A model strong at math can struggle with creative writing; a careful, literal one can miss what a more freewheeling model catches. Fugu's approach: dispatch the math to the math specialist, the writing to the writer, and sometimes ask two or more and reconcile their outputs. Sakana says this produces better results than any single model alone. The system rests on two pieces of published research: a coordinator that manages the team, and a method for steering that team with natural-language instructions rather than rigid rules.

Fugu's launch messaging leans into the word "collective" and frames the product as a hedge against depending on a single provider -- a direct nod to the same week's defining event, when a single lab's top models were switched off by government order. The argument is straightforward: if your AI draws on a rotating panel of several models, no single shutdown, price hike, or outage can take you down. Sakana notes that Fugu reaches frontier-level results without even including the suspended models in its panel, because those models are currently inaccessible.

Think of Fugu as a general contractor rather than a single tradesperson. You don't hire the contractor to pour the concrete and wire the house personally; you hire them because they know which specialist to call for each job and how to make the pieces fit. The contractor's value is their judgment about who to call and how to combine the work -- and that judgment is the hard, valuable part. For the broader pattern of AI systems that act and coordinate rather than just answer, see our explainer on AI agents.

This release is part of a larger shift: multi-agent setups -- several AIs working together -- are collapsing from do-it-yourself science projects into single products you can just call. If that pattern holds, the unit of competition moves up a level. Instead of labs fighting to have the single best model, a layer on top treats all models as interchangeable parts and competes on how cleverly it combines them. That's good for buyers, who gain resilience and the best tool for each job by default, and unsettling for any one lab hoping to lock customers in.

The caveats are the usual ones for a fresh, self-launched product, plus one specific to this design. The performance numbers come from Sakana itself and haven't been independently checked, so the "matches the frontier" claim remains a vendor claim for now. And there's a cost question critics raised immediately: if one convenient endpoint secretly calls several paid models behind the scenes, you may pay multiple vendors at once for a single request. The convenience could carry a quiet premium. A committee gives you resilience and breadth; it can also give you a bigger bill and a coordinator whose judgment you must trust as much as you'd trust any single model.

Originally published on Ground Truth, where every claim is checked against the primary source.