Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Dev] Bring Block Reduction into our seach space and policy #132

Merged
merged 151 commits into from
Aug 5, 2024

Conversation

LeiWang1999
Copy link
Contributor

This pull request includes several changes to the bitblas module, focusing on improving functionality, optimizing performance, and refactoring code. The most important changes include adding a deprecation decorator, modifying tensorcore policies, enhancing the general_matmul module, and updating the ladder_permutate and lop3_permutate modules.

Code Enhancements and New Features:

  • bitblas/__init__.py: Added a deprecation decorator to mark functions as deprecated and emit warnings when used.
  • bitblas/module/__init__.py: Introduced the unpack_qweight function for unpacking quantized weights and updated the load_and_transform_weight method to use this function. [1] [2]

Tensorcore Policy Optimizations:

General Matmul Module Enhancements:

Ladder Permutate and LOP3 Permutate Module Updates:

@LeiWang1999 LeiWang1999 merged commit 2e60d2b into microsoft:main Aug 5, 2024
6 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant