-
-
Notifications
You must be signed in to change notification settings - Fork 142
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Any suggestion to modify the arch based on the gan training result? #161
Comments
I plan to try again in this field in the next month or so. In my application, the generator (SR model) has VRAM usage and inference time limitations so the model cannot be made complex. The other problem I see is that the disc_weight keeps decreasing, I plan to add an option to use a fixed weight instead of using adaptive weight. Also, because of these problems, I have only run up to epoch=40-80(with --num-samples 25000) in my current release of the model. |
Could you briefly explain what l3v1 discriminator is? |
It is defined in |
About the adaptive weight. When adaptive weight is small, does it mean that the gradient of the generator loss is much greater than the gradient of the reconstruction loss? So its purpose is to limit the gan so that it does not change too fast to break the psnr model, right? |
Ideally, yes.
However, in practice, when the discriminator gets stronger, weight decrease and psnr increases. I referenced taming-transformers for GAN loss (L1+LPIPS+GAN), so I adopted adaptive weight. |
I see you use Hinge loss for discriminator. Have you tried other loss like Wasserstein Loss? |
No, I do not have much GAN experience. |
Does waifu2x support self-supervised discriminator? |
I have tried an autoencoder type self supervised discriminator(refer FastGAN) but I have not commit the model code. |
I'm training a photo swin_unet_2x model using gan. I use a cosine lr scheduler with init lr = 1e-5. After some tries I found that the discriminator loss fluctuated around 0.8 (the threshold for generator training to begin), so I increased discriminator lr to 5e-5. And this is the result.
After ~160 epochs the gen loss start to increase which trade off with the decrease of discr loss, which is not good. My guess is that maybe the gen model is undergoing some underfitting? So could you give me some suggestion to modify the arch to make the model more complex? Or should I try with a different arch?
The text was updated successfully, but these errors were encountered: