Nan loss for ResNext backbone trained on cifar 100 #27

devavratTomar · 2024-01-31T09:43:13Z

Thank you for your work. While trying your code for the Resnext backbone on cifar100, I get nan values for the training loss. As mentioned in the published paper, I use the initial learning rate of 0.1 for SGD with cosine scheduling.

SaraGhazanfari · 2024-02-09T21:28:19Z

Yes, same here.
Could you please help with this?

Thanks,
Sara

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Nan loss for ResNext backbone trained on cifar 100 #27

Nan loss for ResNext backbone trained on cifar 100 #27

devavratTomar commented Jan 31, 2024

SaraGhazanfari commented Feb 9, 2024

Nan loss for ResNext backbone trained on cifar 100 #27

Nan loss for ResNext backbone trained on cifar 100 #27

Comments

devavratTomar commented Jan 31, 2024

SaraGhazanfari commented Feb 9, 2024