Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Release/2.6] Fix batch norm vectorize path accuracy issue by enforcing shape alignment #1241

Merged
merged 2 commits into from
Jan 3, 2025

Conversation

xytintel
Copy link
Contributor

@xytintel xytintel commented Jan 2, 2025

When to implement a kernel for all shapes with vectorized LD/ST, we have to handle non-aligned head (base address) and short tail (tail < vector size). Before the commit, there was lack of head handling which leaded to non-vector-size-aligned vectorized ST.
The fixing enforces shapes for vectorize path. The vectorized kernel could be called only in the case in which feature dim could be divided by vector size.

@xytintel xytintel requested a review from fengyuan14 January 2, 2025 07:45
@xytintel xytintel requested a review from ZhiweiYan-96 January 2, 2025 08:20
@xytintel xytintel merged commit fe4ac6a into release/2.6 Jan 3, 2025
2 of 3 checks passed
@xytintel xytintel deleted the xyt/fix_bn_vec_acc_error branch January 3, 2025 01:59
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants