Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Remove model move to support low mem usage in SPMD-FSDP #13

Merged
merged 1 commit into from
Sep 3, 2024

Conversation

lausannel
Copy link
Contributor

In the current implementation of SPMD-FSDP, the model movement is managed by the following code segment regardless of whether low memory usage mode is activated.

https://github.com/AlibabaPAI/xla/blob/63e20fb8e02243a27fc2486670a85d7b4ab03c1b/torch_xla/experimental/spmd_fully_sharded_data_parallel.py#L100

However, model movement may cause the materialize_module to fail under low memory usage.

@lausannel lausannel merged commit a0c893b into main Sep 3, 2024
3 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants