Skip to content

HF2UCP: Converting a pytorch_model.bin or .safetensors checkpoint to UCP #7212

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 11 commits into
base: master
Choose a base branch
from

Conversation

Schwidola0607
Copy link

@Schwidola0607 Schwidola0607 commented Apr 10, 2025

PR for HF2UCP feature

Converting a pytorch_model.bin or .safetensors checkpoint to UCP will

  • zero initialize optimizer states (exp_avg_sq.pt and exp_avg.pt)
  • skip over copying _model_states.pt and optimizer_state.pt files as those are not available to a HF checkpoint

Schwidola0607 and others added 5 commits April 10, 2025 05:08
Signed-off-by: Schwidola0607 <[email protected]>
Signed-off-by: Schwidola0607 <[email protected]>
Signed-off-by: Schwidola0607 <[email protected]>
Signed-off-by: Schwidola0607 <[email protected]>
Signed-off-by: Schwidola0607 <[email protected]>
@Schwidola0607 Schwidola0607 marked this pull request as ready for review April 13, 2025 08:34
Schwidola0607 and others added 4 commits April 15, 2025 03:34
Signed-off-by: Schwidola0607 <[email protected]>
Signed-off-by: Schwidola0607 <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants