Skip to content

Commit

Permalink
Add necessary compile flag
Browse files Browse the repository at this point in the history
  • Loading branch information
rocking5566 committed Jul 10, 2024
1 parent b7ffdbf commit 3b62d48
Showing 1 changed file with 1 addition and 0 deletions.
1 change: 1 addition & 0 deletions setup.py
Original file line number Diff line number Diff line change
Expand Up @@ -317,6 +317,7 @@ def validate_and_update_archs(archs):
"nvcc":
[
"-O3","-std=c++17",
"-mllvm", "-enable-post-misched=0",
"-DCK_TILE_FMHA_FWD_FAST_EXP2=1",
"-fgpu-flush-denormals-to-zero",
"-DCK_ENABLE_BF16",
Expand Down

0 comments on commit 3b62d48

Please sign in to comment.