You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In 'run_func_OCD.py', I noticed that the backbone model(loaded at lines 88~90) is not set to eval mode. Could you explain the reason for this?
Additionally, at lines 134~135, the trained diffusion model and scale model are loaded but are not set to eval mode. I am wondering why this is the case.
I attempted to run the code for reproduction purposes, but unfortunately, the results were not satisfactory. Specifically, the diffusion loss(test phase, line 173) for MNIST(LeNet5), exceeded 30.
However, after setting the backbone model to eval mode, I was able to get correct results.
This makes me think that not setting the backbone model to eval mode might have some influence on the training process of the diffusion model and the baseline accuracy during the test phase. The baseline accuracy(loss) varies with each test.
If there's something I'm misunderstanding about the code, I'd appreciate a brief explanation.
The text was updated successfully, but these errors were encountered:
What torch version you have?
This is interesting phenomena, I will check whether it should be in eval mode for newer versions of torch. Thank you for your input.
In 'run_func_OCD.py', I noticed that the backbone model(loaded at lines 88~90) is not set to eval mode. Could you explain the reason for this?
Additionally, at lines 134~135, the trained diffusion model and scale model are loaded but are not set to eval mode. I am wondering why this is the case.
I attempted to run the code for reproduction purposes, but unfortunately, the results were not satisfactory. Specifically, the diffusion loss(test phase, line 173) for MNIST(LeNet5), exceeded 30.
However, after setting the backbone model to eval mode, I was able to get correct results.
This makes me think that not setting the backbone model to eval mode might have some influence on the training process of the diffusion model and the baseline accuracy during the test phase. The baseline accuracy(loss) varies with each test.
If there's something I'm misunderstanding about the code, I'd appreciate a brief explanation.
The text was updated successfully, but these errors were encountered: