-
Notifications
You must be signed in to change notification settings - Fork 1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
cpu: aarch64: optimising memory/thread utilization in BRGEMM Matmul #2103
cpu: aarch64: optimising memory/thread utilization in BRGEMM Matmul #2103
Conversation
@Sqvid can you please review this for the aarch64? Thanks. |
@Shreyas-fuj Thanks for your contribution. In addition to some of the comments could you also squash and re-push with a commit message that follows the conventions outlined here: https://github.com/oneapi-src/oneDNN/blob/main/CONTRIBUTING.md#code-contribution-guidelines. Thanks! |
We need a commit message linter... |
99af259
to
af69a0b
Compare
Hi @Sqvid , |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hi @Sqvid , I have squashed the commits in to a single one by following the guidelines. Thanks.
Thanks. (nit) commit message: hueristics -> heuristics
.
@vpirogov we can have a python script called by a dedicated github action, that fails a PR if the start of the PR does not start with a word followed by a semi colon. I can help with this, we can create an issue and I can work on it if you think it is worth it. Thoughts? |
Sounds like a plan. In ideal world I would use an off the shelf action for this, but I don't see any. The only tool I was able to find is CommitLint. |
CommitLint looks very capable for the task. The only issue I could see (which is not necessarily a bad thing) is that it allows the sub-category to be in parenthesis Either way a custom parser should be fairly easy to throw together with some regex. |
af69a0b
to
80175f7
Compare
Hi @Shreyas-fuj. This looks fine on my side. |
@theComputeKid, @Sqvid requesting for review and approval of the PR, thanks. |
@Shreyas-fuj Yes looking into now. Will revert in ~48 hours. Thanks. |
@Shreyas-fuj performance looks good to me. Just a couple of nitpicks I'd appreciate if you could address: Other than this I'm happy to approve the changes. Thanks. |
@Sqvid , I have applied the suggested changes, please have a look, thanks. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM.
@Shreyas-fuj Could you please rebase the changes on top of main and squash the commits? Or change the PR name to match the desired commit title. If you could do both it would be even better - merging PR with multiple commits squashes the PR (we want to avoid merge commits on release branches) and takes the commit message from PR title, which currently is too long and not in sync with our guidelines. |
df7e752
to
a957b2f
Compare
@spalicki , I have rebased and squashed the commits, and renamed the PR as suggested, thanks. |
Description
This PR brings some optimizations to brgemm matmul operator by improving memory utilization and multithreading capabilities.
This PR contains the following changes:
fadd()
instruction before storing the accumulator results in destination matrix.General
make test
andmake test_benchdnn_*
) pass locally for each commit?