-
Notifications
You must be signed in to change notification settings - Fork 5.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
FLUX error when loading with low_cpu_mem_usage=False and ignore_mismatched_sizes=True #9343
Comments
Why would you want to not use |
Thanks for your reply. |
Will take a look tomorrow. |
cc @SunMarc |
I have the same question, is there any progress on this issue? |
There is a simple method that is save_pretrained FluxTransformer2DModel in one big *.safetensors file. |
Please try to understand scope of the issue, first. Here we're talking about supporting the loading of big checkpoints (which should usually be sharded) with A checkpoint that is sharded means it's usually big and hence it's sharded. So, setting
This effectively doubles the model loading time and is also prone to OOMs. |
it can be an issue when accelerate was not installed. as in that case, diffusers/src/diffusers/models/modeling_utils.py Lines 533 to 540 in 86bcbc3
|
Same problem here. Fixed after install |
This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread. Please note that issues that do not follow the contributing guidelines are likely to be ignored. |
@PromeAIpro @zodiacg sorry for the delay. Would you maybe open to contributing a PR? Cc: @SunMarc here too. #10013 is highly relevant I guess. |
This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread. Please note that issues that do not follow the contributing guidelines are likely to be ignored. |
Let's keep a close eye on #10604 |
Describe the bug
I'd like to change the input layers of FLUX for training some img2img stuff, but got:
TypeError: expected str, bytes or os.PathLike object, not NoneType
when loading
FluxTransformer2DModel
withlow_cpu_mem_usage=False, ignore_mismatched_sizes=True
.Reproduction
Logs
System Info
NVIDIA A100 80GB PCIe, 81920 MiB
NVIDIA A100 80GB PCIe, 81920 MiB
NVIDIA A100 80GB PCIe, 81920 MiB
NVIDIA A100 80GB PCIe, 81920 MiB
NVIDIA A100 80GB PCIe, 81920 MiB
NVIDIA A100 80GB PCIe, 81920 MiB
NVIDIA A100 80GB PCIe, 81920 MiB
Who can help?
@sayakpaul @DN6
The text was updated successfully, but these errors were encountered: