-
Notifications
You must be signed in to change notification settings - Fork 34
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
adam optimize #1
Comments
I think β1,t ← 1 − (1 − β1)λ^(t−1) becomes close 1 and make the momentum degenerate (β1=0.1 here). This particular ADAM code was based on version 2 of the paper, which had β1,t ← 1 − (1 − β1)λ^(t−1). |
but I think λ should be 1-1e-8 not be 1e-8, why the momentum should degenerate. |
hi, I have just updated the repostiory, to merge everything into gran.py. That Also, make sure the full path to the data is given correctly and ends with dataset = ',,/,,/,,/preprocessed_100/' also could you let me know what epoch does it fail? (if its at 0 onwards, its likely a path problem I think) and did you print out the samples at every epoch to see whether the samples make sense? could you check if it works on CIfar10? because it might be the preprocessing part that is causing the problem. we also tried on LSUN "living room and kitchen" dataset and it works fine, we will upload the samples shortly. :) Chris On 20 May 2016 at 11:00, houruibing notifications@github.com wrote:
|
it work fine in cifar10, in lsun epoch 0 it will failed. |
hi :) can you show me what you get ? On 20 May 2016 at 21:30, houruibing notifications@github.com wrote:
|
it is epoch 0 ,and i also found in cifar10 when I set b1=0 it will work ,and when I use this update algorithm, it will failed. `class Adam(Update):
|
I don't think it is optimizer's problem, because ours work fine.. I suspect that the reason might be due to hyper-parameter tuning. Our pre-processed version of LSUN churches, living room + kitchen works fine. As you said GRAN on cifar10 works well, so maybe it is not optimization method. If you strongly believe that the problem comes from optimizer, then you could also try with different optimization methods. |
in optimize_gan.py : function ADAM param l=1e-8, I wonder if it is wrong, because b1_t will become close to 0 .
The text was updated successfully, but these errors were encountered: