Add _no_split_modules
to some models
#10308
Open
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Adds some of the missing
_no_split_modules
for internal blocks to Allegro, Hunyuan, CogVideoX and CogView. The tests seem to be passing for the ones modified here. Will look into the remaining in a separate PR - it seems to be happening on models that have ascale_shift_table
or similar learnednn.Parameter
The number of layers in the test was modified because if we don't, all layers end up on same device and the test fails (because it expects both devices to be used)