TorchTPU: Running PyTorch Natively on TPUs at Google Scale

developers.googleblog.com

134 points by mji 12 hours ago

yu3zhou4 43 minutes ago

Adding a support for new hardware to PyTorch is actually quite convenient. I did that with WebGPU using the same PrivateUse1 mechanism TorchTPU used. Every hardware has its own slot and identifier, and when you want to add a support for a new one without merging it into PyTorch, PrivateUse1 works essentially like plug-in slot

https://github.com/jmaczan/torch-webgpu

in-silico 9 hours ago

This is great to see.

I did trained some research models using the existing PyTorch/XLA on TPUs, and it was a mess of undocumented behavior and bugs (silently hanging after 8 hours of training!).

If anyone is trying to use PyTorch on TPU before TorchTPU is released, you can check out the training pipeline that I ended up building to support my research: https://github.com/aklein4/easy-torch-tpu

immanuwell 43 minutes ago

pitch basically boils down to 'just change one line and it works' which sounds too good to be true, but if they actually pull it off at 100k-chip scale, that's genuinely a big deal

Reubend 9 hours ago

Sounds good, but my main question is: is this a fork, or a new backend they're building in (like MPS)?

musebox35 4 hours ago

I attended the related session at Next’26 yesterday. From my understanding it is a new backend and they will release the torch tpu source on github in one or two months. It will not support all ops initially but they are moving fast. Still for a while torchax is mature enough to run torch models on tpus by translating to jax.
yu3zhou4 47 minutes ago

They write that they use PrivateUse1, so it’s a custom out-of-tree backend

noracists 5 hours ago

Very excited for this.

MASNeo 2 hours ago

Now all that’s missing is an actual chip that can be purchased. Any ideas?

sterlind 56 minutes ago

I'm thinking of picking up some used Gaudis from eBay. they're pretty TPU-like. but other than oddball hardware like that it's just the GPU duopoly and proprietary bespoke stuff the hyperscalers have made for themselves.
shit, maybe China will start selling Huawei Ascend chips internationally.