Jetson (aarch64) support #724

jasl · 2023-12-14T10:18:19Z

I refactored setup.py to make it work on my Jetson AGX Orin, I think it also helps for future ARM + GPU platforms

I don't want to make it complex so I just allow to set CUDA gencode from ENV, Jetson is compute_87, sm_87

setup.py

Co-authored-by: Aaron Gokaslan <[email protected]>

FenardH · 2024-05-02T09:00:39Z

Hello, thanks for the amazing job.

I installed flash attention from source using your committed setup.py with commit hash 0097ec4 for jetson Orin. The installation was completed without error and I can successfully import it in python. However, it returns all Failures when I run the unit tests with test_flash_attn.py. I don' t know if this is normal? Do we have other ways to test/check if flash attention works on Orin? Thank you.

jasl · 2024-05-02T12:04:57Z

Hello, thanks for the amazing job.

I installed flash attention from source using your committed setup.py with commit hash 0097ec4 for jetson Orin. The installation was completed without error and I can successfully import it in python. However, it returns all Failures when I run the unit tests with test_flash_attn.py. I don' t know if this is normal? Do we have other ways to test/check if flash attention works on Orin? Thank you.

Which version of Jetpack you're using? I just tried on JP 6.0 DP

jasl · 2024-05-02T12:33:43Z

I'm waiting for the JP 6.0 production release, I guess we just need to let the setup.py support sm_87

FenardH · 2024-05-02T14:37:36Z

Hello, thanks for the amazing job.
I installed flash attention from source using your committed setup.py with commit hash 0097ec4 for jetson Orin. The installation was completed without error and I can successfully import it in python. However, it returns all Failures when I run the unit tests with test_flash_attn.py. I don' t know if this is normal? Do we have other ways to test/check if flash attention works on Orin? Thank you.

Which version of Jetpack you're using? I just tried on JP 6.0 DP

Hello,

Thank you for the quick reply.

Mine is JP 5.1.2. Couple months ago, just back to the moment of release of flash attention 2, I tried to install it with setting compute_87 or sm_87 but both attmpts were failed with the same JP. Do you have any ideas about what's wrong here? Thank you again.

Best regards,

jasl · 2024-05-02T15:03:26Z

Hello, thanks for the amazing job.
I installed flash attention from source using your committed setup.py with commit hash 0097ec4 for jetson Orin. The installation was completed without error and I can successfully import it in python. However, it returns all Failures when I run the unit tests with test_flash_attn.py. I don' t know if this is normal? Do we have other ways to test/check if flash attention works on Orin? Thank you.

Which version of Jetpack you're using? I just tried on JP 6.0 DP

Hello,

Thank you for the quick reply.

Mine is JP 5.1.2. Couple months ago, just back to the moment of release of flash attention 2, I tried to install it with setting compute_87 or sm_87 but both attmpts were failed with the same JP. Do you have any ideas about what's wrong here? Thank you again.

Best regards,

I haven't tried on 5.1.x. I guess the reason is that the CUDA is too old. I have to upgrade to 6.0 because Ubuntu 18.04, CUDA 11.4, and Python 3.7 are too old to run recent versions of LLM and Stable Diffusion

FenardH · 2024-05-03T15:44:12Z

Hello, thanks for the amazing job.
I installed flash attention from source using your committed setup.py with commit hash 0097ec4 for jetson Orin. The installation was completed without error and I can successfully import it in python. However, it returns all Failures when I run the unit tests with test_flash_attn.py. I don' t know if this is normal? Do we have other ways to test/check if flash attention works on Orin? Thank you.

Which version of Jetpack you're using? I just tried on JP 6.0 DP

Hello,
Thank you for the quick reply.
Mine is JP 5.1.2. Couple months ago, just back to the moment of release of flash attention 2, I tried to install it with setting compute_87 or sm_87 but both attmpts were failed with the same JP. Do you have any ideas about what's wrong here? Thank you again.
Best regards,

I haven't tried on 5.1.x. I guess the reason is that the CUDA is too old. I have to upgrade to 6.0 because Ubuntu 18.04, CUDA 11.4, and Python 3.7 are too old to run recent versions of LLM and Stable Diffusion

Hello again, I upgraded Orin to JP 6.0 DP today and tried to install flash_attn 2 again with your fork (branch aarch64). The upgraded JP eventually did not help for the correct installation. I noticed that the CUDA gencode was with compute_90 and sm_90 while compiling instead of 87 for Orin. Could you please share more info how you install the package from source? Thank you.

jasl · 2024-05-03T15:47:56Z

Hello, thanks for the amazing job.
I installed flash attention from source using your committed setup.py with commit hash 0097ec4 for jetson Orin. The installation was completed without error and I can successfully import it in python. However, it returns all Failures when I run the unit tests with test_flash_attn.py. I don' t know if this is normal? Do we have other ways to test/check if flash attention works on Orin? Thank you.

Which version of Jetpack you're using? I just tried on JP 6.0 DP

Hello,
Thank you for the quick reply.
Mine is JP 5.1.2. Couple months ago, just back to the moment of release of flash attention 2, I tried to install it with setting compute_87 or sm_87 but both attmpts were failed with the same JP. Do you have any ideas about what's wrong here? Thank you again.
Best regards,

I haven't tried on 5.1.x. I guess the reason is that the CUDA is too old. I have to upgrade to 6.0 because Ubuntu 18.04, CUDA 11.4, and Python 3.7 are too old to run recent versions of LLM and Stable Diffusion

Hello again, I upgraded Orin to JP 6.0 DP today and tried to install flash_attn 2 again with your fork (branch aarch64). The upgraded JP eventually did not help for the correct installation. I noticed that the CUDA gencode was with compute_90 and sm_90 while compiling instead of 87 for Orin. Could you please share more info how you install the package from source? Thank you.

I don't want to make it complex (Jetson isn't popular) so the PR actually introduces an env CUDA_GENCODE so you can override it when compiling on the Jetson platform

You can use this command

MAX_JOBS=8 FORCE_BUILD=True CUDA_GENCODE='arch=compute_87,code=sm_87' pip3 wheel --wheel-dir=dist --no-deps --verbose .

jasl added 2 commits December 14, 2023 17:03

refactor setup.py get_platform() for support aarch64

a187679

Allow specify CUDA gencode from env

badd388

jasl changed the title ~~Jetson support~~ Jetson (aarch64) support Dec 14, 2023

Skylion007 reviewed Dec 29, 2023

View reviewed changes

setup.py Outdated Show resolved Hide resolved

Skylion007 reviewed Dec 29, 2023

View reviewed changes

setup.py Outdated Show resolved Hide resolved

Skylion007 reviewed Dec 29, 2023

View reviewed changes

setup.py Outdated Show resolved Hide resolved

jasl and others added 4 commits December 30, 2023 18:05

Update setup.py

1b10b26

Co-authored-by: Aaron Gokaslan <[email protected]>

Update setup.py

870a07f

Co-authored-by: Aaron Gokaslan <[email protected]>

Update setup.py

fb6e446

Co-authored-by: Aaron Gokaslan <[email protected]>

fix

0097ec4

jasl requested a review from Skylion007 December 30, 2023 10:07

tridao mentioned this pull request Mar 7, 2024

Has anyone successfully compiled this on ARM Linux (aarch64)? #879

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jetson (aarch64) support #724

Jetson (aarch64) support #724

jasl commented Dec 14, 2023 •

edited

Loading

FenardH commented May 2, 2024

jasl commented May 2, 2024

jasl commented May 2, 2024

FenardH commented May 2, 2024

jasl commented May 2, 2024 •

edited

Loading

FenardH commented May 3, 2024

jasl commented May 3, 2024

Jetson (aarch64) support #724

Are you sure you want to change the base?

Jetson (aarch64) support #724

Conversation

jasl commented Dec 14, 2023 • edited Loading

FenardH commented May 2, 2024

jasl commented May 2, 2024

jasl commented May 2, 2024

FenardH commented May 2, 2024

jasl commented May 2, 2024 • edited Loading

FenardH commented May 3, 2024

jasl commented May 3, 2024

jasl commented Dec 14, 2023 •

edited

Loading

jasl commented May 2, 2024 •

edited

Loading