Loss raise to abnormal and batchsize #124

Hong-yu-Zhang · 2022-11-09T03:59:34Z

Loss raises to several million after 50 epochs (Before 50 epoch is normal). And why I can only allow batchsize 2 on RTX3090 when training, 2 more will out of memory.

HLImg · 2022-11-13T03:45:53Z

I have the same problem. The device I used is the RTX 3090ti. After 200 epochs, both the char loss and edge loss grow graduallty.

jidongkuang · 2022-12-03T11:56:39Z

I'm in the same situation as you. How can I solve it?

HLImg · 2022-12-03T12:03:24Z

我和你情况一样。我该如何解决？
clipping the gradient,

torch.nn.utils.clip_grad_norm_(self.net.parameters(), 0.01)

jidongkuang · 2022-12-03T15:43:54Z

Could you tell me where to put this code？

HLImg · 2022-12-04T02:26:16Z

Could you tell me where to put this code？

loss.backward() 
torch.nn.utils.clip_grad_norm_(model_restoration.parameters(), 0.01)
optimizer.step()

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Loss raise to abnormal and batchsize #124

Loss raise to abnormal and batchsize #124

Hong-yu-Zhang commented Nov 9, 2022

HLImg commented Nov 13, 2022

jidongkuang commented Dec 3, 2022

HLImg commented Dec 3, 2022

jidongkuang commented Dec 3, 2022

HLImg commented Dec 4, 2022

Loss raise to abnormal and batchsize #124

Loss raise to abnormal and batchsize #124

Comments

Hong-yu-Zhang commented Nov 9, 2022

HLImg commented Nov 13, 2022

jidongkuang commented Dec 3, 2022

HLImg commented Dec 3, 2022

jidongkuang commented Dec 3, 2022

HLImg commented Dec 4, 2022