forked from pytorch/pytorch
-
Notifications
You must be signed in to change notification settings - Fork 57
Pull requests: ROCm/pytorch
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Enable input vectorization in ewk for input tensors with heterogeneou…
#1906
opened Feb 15, 2025 by
carlobertolli
Loading…
[release/2.5] [CP] Respect ROCR_VISIBLE_DEVICES on AMD GPU device discovery (#144026)
#1895
opened Feb 12, 2025 by
jataylo
Loading…
Enable load-compute-store interleaving for unrolled elementwise kernel.
#1886
opened Feb 6, 2025 by
carlobertolli
•
Draft
[rocm6.4_internal_testing] [NAVI32] Skipped sdpa_2 test in test_aot_inductor for Navi32
#1882
opened Feb 5, 2025 by
iupaikov-amd
Loading…
[release/2.6] [ROCm] Improvements for vectorized elementwise kernels (#143269)
#1878
opened Feb 3, 2025 by
jerrymannil
Loading…
Revert "[release/2.4] fix test_pointwise_op_fusion_post_grad (#1763)"
#1865
opened Jan 30, 2025 by
dnikolaev-amd
Loading…
[Do NOT MERGE] [release/2.5] Enable tf32 testing on test_nn
#1859
opened Jan 27, 2025 by
jagadish-amd
Loading…
[ROCm] Eliminate the need for divisions in layernorm for default vector size.
#1850
opened Jan 22, 2025 by
doru1004
Loading…
[release/2.4] Update numpy versions to fix PyTorch wheel build issues
#1822
opened Jan 8, 2025 by
jithunnair-amd
•
Draft
[ROCm][WIP] Improve performance of casted elementwise add operations
#1805
opened Dec 20, 2024 by
doru1004
Loading…
[WIP][release/2.5] refactor condition to use miopen for batchnorm
#1787
opened Dec 13, 2024 by
dnikolaev-amd
•
Draft
Previous Next
ProTip!
no:milestone will show everything without a milestone.