[XLA:GPU] Use DeviceDescription instead of hard-coding warp size as 32 #2938

amd-jianli12 · 2025-04-22T05:06:40Z

We should query the hardware to discover its warp size.

PiperOrigin-RevId: 700787004

tensorflow/tf-build-actions@600513b [ROCm] Fix flaky gpu compiler test when building with rocm tensorflow/tf-build-actions@a35cf48 [XLA:GPU] Use DeviceDescription instead of hard-coding warp size as 32 xla@e849446 [ROCm] Pass correct warp size to Triton pipeline xla@3e7b0fe cherry-picked warp size passing to triton calls, and globally enabled warpsize=64 xla@750ad89 Fixes.

amd-jianli12 force-pushed the r2.18-rocm-enhanced-warpsize branch from c76562c to 5e84717 Compare April 23, 2025 04:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[XLA:GPU] Use DeviceDescription instead of hard-coding warp size as 32 #2938

[XLA:GPU] Use DeviceDescription instead of hard-coding warp size as 32 #2938

amd-jianli12 commented Apr 22, 2025

[XLA:GPU] Use DeviceDescription instead of hard-coding warp size as 32 #2938

Are you sure you want to change the base?

[XLA:GPU] Use DeviceDescription instead of hard-coding warp size as 32 #2938

Conversation

amd-jianli12 commented Apr 22, 2025