Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Can't reproduce the same results as the paper on LSUN churches dataset. #19

Open
ljh0v0 opened this issue Jul 23, 2023 · 0 comments
Open

Comments

@ljh0v0
Copy link

ljh0v0 commented Jul 23, 2023

Thank you so much for this work. Currently, I'm trying to reproduce the results in the paper. I kept all the hyperparameters the same as the source code of this repository.

I trained the 1st stage vq-gan model for 2.2M steps, the same as the pre-train model you provided. Then I did the reconstruction on the whole LSUN churches dataset and calculated the FID, using calc_VQGAN_FID.py. As training went by, the FID score went down at first but increased after 0.8M steps, as shown in Figure 1 below.
Fig1_1-stage_vqgan

Then I trained the 2nd stage diffusion model for 2.0M steps. I sampled 50K images with a temperature of 0.9 and calculated the FID score. The FID score of my local reproducing experiments is 5.53, which has a gap with the FID of 4.07 reported in the paper.
Fig2_2-stage_vqgan

I did all my local experiments on a single NVIDIA V100 Volta GPU.

So I just wonder if the hyperparameter setting in the hparams/defaults/ folder is what you used to get the final results? Or are there any tricks that I might neglect? And do you have any clue about the weird increase when training the 1st-stage vq-gan models?

Thank you so much!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant