You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In version 0.9 the guards to calculate the gradient norms and then log the metrics can't be satisfied in the same batch unless the row_log_interval is 1. In most places the guard seems to be (batch_idx + 1) % self.row_log_interval == 0 such as here:
🐛 Bug
In version 0.9 the guards to calculate the gradient norms and then log the metrics can't be satisfied in the same batch unless the row_log_interval is 1. In most places the guard seems to be
(batch_idx + 1) % self.row_log_interval == 0
such as here:https://github.com/PyTorchLightning/pytorch-lightning/blob/b40de5464a953ff5866a255f4670d318bd8fd65a/pytorch_lightning/trainer/training_loop.py#L749-L757
However in
run_batch_backward_pass
it isbatch_idx % self.row_log_interval == 0
https://github.com/PyTorchLightning/pytorch-lightning/blob/b40de5464a953ff5866a255f4670d318bd8fd65a/pytorch_lightning/trainer/training_loop.py#L929-L939
To Reproduce
Steps to reproduce the behavior:
4 Confirm that gradients are now being logged.
Code sample
Expected behavior
Gradients should be logged if track_grad_norm is True
Environment
- GPU:
- GeForce GTX 1080 Ti
- available: True
- version: 10.2
- numpy: 1.19.1
- pyTorch_debug: False
- pyTorch_version: 1.6.0
- pytorch-lightning: 0.9.0
- tensorboard: 2.2.1
- tqdm: 4.48.2
- OS: Windows
- architecture:
- 64bit
- WindowsPE
- processor: AMD64 Family 23 Model 113 Stepping 0, AuthenticAMD
- python: 3.7.9
- version: 10.0.19041
Additional context
The text was updated successfully, but these errors were encountered: