Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Sync main into release/2.6 branch #1117

Merged
merged 5 commits into from
Nov 22, 2024
Merged

Sync main into release/2.6 branch #1117

merged 5 commits into from
Nov 22, 2024

Conversation

xytintel
Copy link
Contributor

Reset to bfdbaf4

mengfei25 and others added 5 commits November 21, 2024 10:45
Fix the bug where shared memory initialization is missing in the foreach
backbone

---------

Co-authored-by: Yutao Xu <[email protected]>
Primarily adopt better tuning for `scatter-gather` kernel launch
configurations.
torch_xpu_ops_sycl_kernels leads to around 1.83GB in size on windows,
splitting it to reduce the lib size.

New libs introduced in this PR:

torch_xpu_ops_sycl_tensor_srcs
torch_xpu_ops_sycl_norm_loss_srcs
torch_xpu_ops_sycl_poly_srcs
torch_xpu_ops_sycl_dist_srcs

---------

Co-authored-by: Feng Yuan <[email protected]>
@xytintel xytintel requested a review from chuanqi129 November 22, 2024 06:14
@xytintel xytintel merged commit 1e32bbc into release/2.6 Nov 22, 2024
0 of 3 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants