Simplify device count external API calls

Currently there are many external APIs related getting the number of devices associate with PyTorch XLA. Those that I could find were:

- "global_runtime_device_count": returns the total number of devices across all processes/hosts, but it has "@functools.lru_cache()"
- "global_device_count": returns the total number of devices across all processes/hosts, but it has "@functools.lru_cache()"
- "addressable_runtime_device_count": Access number of [addressable devices](https://github.com/pytorch/xla/blob/r2.7/torch_xla/csrc/init_python_bindings.cpp#L15026) visible to a process.
- "addressable_device_count": Access number of [addressable devices](https://github.com/pytorch/xla/blob/r2.7/torch_xla/csrc/init_python_bindings.cpp#L1481) visible to a process. It specifically returns 1 in case of SPMD.
- "local_device_count": takes the number of [addressable devices](https://github.com/pytorch/xla/blob/01b5408dded9bf5bdea3e59c387b3b201a2bdab9/torch_xla/csrc/init_python_bindings.cpp#L1486) and multiplies it by the number of local [process counts](https://github.com/pytorch/xla/blob/r2.7/torch_xla/runtime.py#L129). Equivalent of the answer of the number of devices running on a host.

From these, some existing observations are:
- `addressable_runtime_device_count` and `addressable_device_count` are extremely similar in implementation and name. Perhaps we should make the distinction more clear. Perhaps there is some context around `addressable_device_count` particular I don't fully grasp.
- `local_device_count` terminology can be confusing when compared with JAX's concept for local devices for [jax.local_devices](https://docs.jax.dev/en/latest/_autosummary/jax.local_devices.html). `local_device_count` being the number of devices in the host, while JAX's definition is of devices in the process
- We should deduplicate `global_runtime_device_count` and `global_device_count`, just have one reference the other to remove multiple calls

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Simplify device count external API calls #9199

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Simplify device count external API calls #9199

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions