Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

float8 upcoming feature tracker #556

Open
vkuzo opened this issue Jul 30, 2024 · 0 comments
Open

float8 upcoming feature tracker #556

vkuzo opened this issue Jul 30, 2024 · 0 comments
Labels

Comments

@vkuzo
Copy link
Contributor

vkuzo commented Jul 30, 2024

configurability

  • [planned] support rowwise/blockwise scaling granularity, configurable separately for each gemm
  • [planned] configure settings for each of the three gemms in linear fwd/bwd separately
  • [planned] support more fine grained configuration of how to apply Float8Linear to individual modules
  • [planned] inference support (see [RFC] Float8 Inference pytorch-labs/float8_experimental#314)

performance

distributed

  • [in progress] integrate with FSDP2 with 16-bit or 8-bit all-gather with delayed scaling for weights
    • POC is done, performance optimizations are ongoing
  • [planned] verify integration with PP

other

copied from pytorch-labs/float8_experimental#187

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

1 participant