-
Notifications
You must be signed in to change notification settings - Fork 192
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
My train model using train_celeba.yml #12
Comments
As I kown ,batch size is related to the learning rate,maybe you can try to decrese your learning rate as the half. @fashionguy |
Hmmm, interesting! This is a scale ambiguity issue (bas-relief ambiguity), but the current parameter setting should converge to a good solution. I have tested it with several random seeds, and it should be rather stable. But let me try to run it again and see if there's any discrepency. Previously I have found the border depth clamping correlated with the scale. If border depth is clamped too far, it seems to drive the model into predicting a stretched face, like your results. But I can tell this is not case with your results. |
Hi, @dafuny! I have double checked the parameters and tested the code on two different machines. The results are consistent with those released. Would like to share your training details in order to investigate the issue? |
Hi,@elliottwu,it's really kind of you giving me so detailed reply.I have figured out the problem that my conduct environment cuased.Previously,I used cuda10.2,but rencently I installed cuda9.2 and set the same environment as yours.As a results,it can get a good result!The first is your result,and the following is my trained result. |
Thanks @dafuny for sharing your results! It is very interesting that CUDA 10.2 gives completely different convergence! I have never tested it on CUDA 10.2, and was not really aware of it. I might take a closer look into this at some point. Thanks for flagging this! |
These is no network initialization in this repo. Probably, this is the reason why we get totally diffrient results by using CUDA10.2 and CUDA 9.2 |
Hi!
I train a model using trian_celeba.yml provided by you without other changes, only change the batch size from 64 to 32.(because my memory has only 8G)
Now I get the trained model after 30 epoches. I run demo.py and get results using my trained model and your model respectively.
But the two results are not exactly the same(eg. the eyes). I don't know why and hope get some advices from you.
(The right is my trained model.)
The text was updated successfully, but these errors were encountered: