Skip to content
This repository has been archived by the owner on Nov 1, 2024. It is now read-only.

Re-release consolidated OPT / OPT-IML checkpoints #625

Open
suchenzang opened this issue Feb 1, 2023 · 2 comments
Open

Re-release consolidated OPT / OPT-IML checkpoints #625

suchenzang opened this issue Feb 1, 2023 · 2 comments
Assignees
Labels
enhancement New feature or request

Comments

@suchenzang
Copy link
Contributor

After #459 and #556, we can now release updated checkpoints that are consolidated from FSDP shards with different model parallelism as well. We should update all of our checkpoints as a start to help address some of the following painpoints that users are facing:

and previous issues:

We have internal consolidated versions for 2.7B and 30B to check against, and will also need to confirm that generation looks roughly sane after consolidation.

@suchenzang suchenzang added the enhancement New feature or request label Feb 1, 2023
@EIFY
Copy link

EIFY commented Feb 2, 2023

I can work around it, but could the following issue be considered related as well?

@andrewPoulton
Copy link
Contributor

Yeah looks like it, at least tangentially - the loading logic there could probably do with simplifying. It should be possible to identify naming convention by just reading the checkpoint directory.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

6 participants