That's a really good point. It's easy to overlook how much processing power is involved in training the network. I'm also really impressed by how DeepMind were able to break the problem down into tasks that could be massively distributed across processing units in parallel.