-
Notifications
You must be signed in to change notification settings - Fork 34
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[Dev] Append Efficient CUDA test for low precision batch decoding (#80)
* Refactor BatchMatMulEmitter and BatchMatMulSelector for improved readability and maintainability * Refactor import statements for improved readability and maintainability * Refactor import statements for improved readability and maintainability * disable failure email for ci * remove email notifications. * move relax pass from testing to mlc_llm * Refactor scripts with se check_eual_ref_scripts_with_emitter function * Lint Fix * Refactor scripts with se check_eual_ref_scripts_with_emitter function * bug fix in test * lint fix. * test cuda i4 kernel * Refactor copyright notice in i4matmul.hpp * Refactor BitBLASLinear test module for improved readability and maintainability * refactor test as version below python 3.9 cannot handle int32 overflow. * format lint for test * Refactor test_int4b_fp16_convert.py for improved readability and maintainability
- Loading branch information
1 parent
541839b
commit c6f3ca5
Showing
8 changed files
with
1,563 additions
and
58 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,20 @@ | ||
# Copyright (c) Microsoft Corporation. | ||
# Licensed under the MIT License. | ||
function (ADD_CUDA_TEST_EXECUTABLE name) | ||
add_executable(${name} ${name}.cu) | ||
set_target_properties(${name} PROPERTIES CUDA_ARCHITECTURES 80) | ||
# add flags | ||
target_compile_options(${name} PRIVATE --expt-relaxed-constexpr) | ||
set_target_properties(${name} PROPERTIES | ||
CUDA_SEPARABLE_COMPILATION ON) | ||
target_link_libraries(${name} gtest gtest_main) | ||
endfunction(ADD_CUDA_TEST_EXECUTABLE) | ||
|
||
ADD_CUDA_TEST_EXECUTABLE(efficient_i4) | ||
|
||
function (ADD_CPP_TEST_EXECUTABLE name) | ||
add_executable(${name} ${name}.cpp) | ||
target_link_libraries(${name} gtest gtest_main pthread) | ||
endfunction(ADD_CPP_TEST_EXECUTABLE) | ||
|
||
ADD_CPP_TEST_EXECUTABLE(param_permutate) |
Oops, something went wrong.