Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Device kernels using FP8 #2510

Merged
merged 107 commits into from
Dec 6, 2023
Merged

Device kernels using FP8 #2510

merged 107 commits into from
Dec 6, 2023

Conversation

umangyadav
Copy link
Member

depends on #2473

Device kernels doesn't have support for FP8. therefore this PR makes use of eliminate_fp8 pass to convert inputs to float and converts back final result to fp8.

I've kept target branch to #2473 PR for now for easy review but that should change to develop once #2473 gets merged.

@umangyadav umangyadav marked this pull request as ready for review December 5, 2023 01:55
Base automatically changed from rocblas_fp8 to develop December 6, 2023 01:20
@causten causten merged commit 98ef0ab into develop Dec 6, 2023
14 of 15 checks passed
@causten causten deleted the device_fp8 branch December 6, 2023 01:22
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
FP8 issues related to FP8 implemenation
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants