wip: use torch from a wheel #9340

rickeylev · 2025-06-11T21:05:09Z

Using torch from a wheel will eliminate the ~20 minutes it takes to build from source.

A custom repo rule is used to replicate the structure that python_init_repositories
expects (a directory with a dist/ folder with a wheel) to add the whl file
into the requirements.txt files.

TODO:

Handle different builds (cuda, nightly, etc). There thousands of pytorch wheels.
Unclear which combinations need to be pulled.

Work towards #9173

bhavya01 · 2025-06-12T17:04:18Z

Thanks for looking into this. It will helpful for #9173

rickeylev · 2025-06-12T20:16:53Z

Thanks! There's a couple questions I have that would help me figure out next steps:

Can the ability to use a pytorch source checkout be removed entirely? It's easy to allow using a locally built wheel still.
The ts_native_functions yaml and cpp files aren't in the wheel. Are the actually needed? From my local building, their absence hasn't resulted in any errors.
Which versions and configurations of pytorch are needed? In order to use wheels, we'll need to specify them by URL (either via environment variables, or having a list in a .bzl file)

iwknow · 2025-06-16T04:48:47Z

I am also curious about which versions and configurations of pytorch will it pins to. Currently, the pytorch/xla assumes the HEAD of main. some features rely on the "unreleased" code (e.g. #8632 (comment)). pining pytorch to any released version will break this feature because a hashable TreeSpec is not included in any release. I believe there are many instances of these making the choices of pytorch be very limited.

bhavya01 · 2025-06-17T22:44:40Z

Can the ability to use a pytorch source checkout be removed entirely? It's easy to allow using a locally built wheel still.
I think that the lazy_tensor_generator.py needs the pytorch source. I am trying to remove it by copying the generated code locally to my repository and not have it codegen'd for now till I figure out a better solution.

The ts_native_functions yaml and cpp files aren't in the wheel. Are the actually needed? From my local building, their absence hasn't resulted in any errors.
Not sure if these are actually needed. I think that we just need these generated files https://github.com/bhavya01/playground/tree/main/torch_xla_generated_04232025/csrc

Which versions and configurations of pytorch are needed? In order to use wheels, we'll need to specify them by URL (either via environment variables, or having a list in a .bzl file)
We just need the CPU versions. For the nightly builds, we should use the nightly torch wheels. Each torch_xla stable release depends on the corresponding torch stable release.

rickeylev · 2025-06-18T18:30:07Z

Thanks for the info, @bhavya01 and @iwknow

what versions of torch would this pin to

Whichever you want, for the most part. We could probably have it automatically discover the latest nightlys and use those (I'm pretty sure, anyways; a bit more complicated but I think I see a way to do that).

I am also curious about which versions and configurations of pytorch will it pins to. Currently, the pytorch/xla assumes the HEAD of main. some features rely on the "unreleased" code (e.g. #8632 (comment)). pining pytorch to any released version will break this feature because a hashable TreeSpec is not included in any release. I believe there are many instances of these making the choices of pytorch be very limited.

oh hm, this is concerning for a couple reasons.

First, it simply prevents saving the 20 minutes spent building torch in CI, which is about a third of the CI time. As a rule of thumb, waiting more than 10 minutes for presubmit checks is where productivity tends to plummet. The only other option is to fully bazelify torch itself -- possible in theory, but I'm skeptical of its feasibility in practice. Our experience building torch from source within Google has been hard and brittle (and my experience building torch from source outside google isn't much better); this is unsurprising given the size and complexity of torch.

Second, if you're developing against torch head, then you're approximately locked to torch's release schedule. Which, seems like thats what you want to do as a project? ("Each stable torch-xla release depends on the corresponding stable torch release").

I just want to make clear that this makes for a tough route. The problem with head is torch is very active, so it's constantly changing and almost every CI run has to, essentially, start from scratch. Similarly, you, as developers, have to pay that same large tax to your run-edit flow and are disincentivized from syncing, nor obviously know when you must update torch or when you shouldn't update torch. On the infra side, it makes hopping in to address things harder because setup is complex and volatile.

rickeylev force-pushed the torch.from.whl branch from f83237b to ff14b58 Compare June 13, 2025 00:02

wip: use torch from a wheel

28d7dd1

rickeylev force-pushed the torch.from.whl branch from ff14b58 to 28d7dd1 Compare June 13, 2025 00:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

wip: use torch from a wheel #9340

wip: use torch from a wheel #9340

Uh oh!

rickeylev commented Jun 11, 2025 •

edited

Loading

Uh oh!

bhavya01 commented Jun 12, 2025

Uh oh!

rickeylev commented Jun 12, 2025

Uh oh!

iwknow commented Jun 16, 2025 •

edited

Loading

Uh oh!

bhavya01 commented Jun 17, 2025

Uh oh!

rickeylev commented Jun 18, 2025

Uh oh!

Uh oh!

wip: use torch from a wheel #9340

Are you sure you want to change the base?

wip: use torch from a wheel #9340

Uh oh!

Conversation

rickeylev commented Jun 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bhavya01 commented Jun 12, 2025

Uh oh!

rickeylev commented Jun 12, 2025

Uh oh!

iwknow commented Jun 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bhavya01 commented Jun 17, 2025

Uh oh!

rickeylev commented Jun 18, 2025

Uh oh!

Uh oh!

rickeylev commented Jun 11, 2025 •

edited

Loading

iwknow commented Jun 16, 2025 •

edited

Loading