Spikes in training loss #189

slala2121 · 2022-02-06T05:44:43Z

I'm wondering if you have encountered this recurring spike in the training loss and why might this occur.

It does appear to occur upon processing the last batch but I don't think it has to do with uneven batch size because the last batch is dropped. The data is shuffled so it can't be any particular examples causing this. But it's not clear why the processing of the last batch would produce these spikes.

Thanks.

thecooltechguy · 2022-10-27T19:03:05Z

@slala2121 I'm actually experiencing the exact same behavior, and I'm also shuffling + dropping the last batch. Did you happen to figure out the reason or resolve this?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Spikes in training loss #189

Spikes in training loss #189

slala2121 commented Feb 6, 2022

thecooltechguy commented Oct 27, 2022

Spikes in training loss #189

Spikes in training loss #189

Comments

slala2121 commented Feb 6, 2022

thecooltechguy commented Oct 27, 2022