Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to load the old model to new #42

Open
zhrli opened this issue Jan 13, 2023 · 0 comments
Open

How to load the old model to new #42

zhrli opened this issue Jan 13, 2023 · 0 comments
Labels
enhancement New feature or request

Comments

@zhrli
Copy link

zhrli commented Jan 13, 2023

Like half year ago, I trained this model in a A100. Now, it seems easy to train less params. So my question is as I have checked generator code, there no change at all. But when I load the old model to new, it seems different.
I list the difference of two models.

Is there any suggestion to transfer to a new model efficiently?

old one:
0 | discriminator_loss | NowcastingLoss | 0
1 | grid_regularizer | GridCellLoss | 0
2 | conditioning_stack | ContextConditioningStack | 4.2 M
3 | latent_stack | LatentConditioningStack | 7.2 M
4 | sampler | Sampler | 42.1 M
5 | generator | Generator | 53.6 M
6 | discriminator | Discriminator | 44.7 M

98.3 M Trainable params
0 Non-trainable params
98.3 M Total params
393.086 Total estimated model params size (MB)

new one:

| Name | Type | Params

0 | discriminator_loss | NowcastingLoss | 0
1 | grid_regularizer | GridCellLoss | 0
2 | conditioning_stack | ContextConditioningStack | 1.1 M
3 | latent_stack | LatentConditioningStack | 1.8 M
4 | sampler | Sampler | 10.5 M
5 | generator | Generator | 13.4 M
6 | discriminator | Discriminator | 44.7 M

58.1 M Trainable params
0 Non-trainable params
58.1 M Total params
232.417 Total estimated model params size (MB)


Possible Implementation

@zhrli zhrli added the enhancement New feature or request label Jan 13, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant