You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
envs: Mxnet1.8 fp16 resnet50, byteps 0.2.5.post15.
Training process occurs the nan, as shown in the following: ( changing lr from 0.1 to 0.001, the nan disappear, But the loss seems not able to decrease.)
envs: Mxnet1.8 fp16 resnet50, byteps 0.2.5.post15.
Training process occurs the nan, as shown in the following: ( changing lr from 0.1 to 0.001, the nan disappear, But the loss seems not able to decrease.)
But when I use fp32, the nan disappear, Is there any problems with byteps fp16.
The text was updated successfully, but these errors were encountered: