-
Notifications
You must be signed in to change notification settings - Fork 654
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Patches for Lightning have not kept up with backwards-incompatible changes #1249
Labels
bug
Something isn't working
Comments
Thanks for letting us know @a-gardner1. We'll take a look and update on fix availability. |
a-gardner1
added a commit
to a-gardner1/clearml
that referenced
this issue
Apr 22, 2024
I can open a PR with a proposed fix if you like. I've already implemented one |
Contributions are most welcome @a-gardner1 🙂 |
@ainoam In case you missed it, I did open a PR. Let me know if anything looks off |
Thanks for the friendly nudge @a-gardner1. We'll try to address your PR soon. |
jkhenning
pushed a commit
that referenced
this issue
May 10, 2024
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Describe the bug
While there has clearly been some effort to keep pace with changes to Lightning (see #1033), it has fallen behind since the initial patches were created (64e10b2) and new versions of Lightning were released. Unfortunately, it silently fails to apply patches to model saving and restoration, which can hide the fact that model logging doesn't fully work as expected. One of the two related (and nearly duplicate) patch methods is shown below (linked here)
Three
AttributeErrors
exist in_patch_pytorch_lightning_io
with newer versions ofpytorch-lightning
:pytorch-lightning-0.10.0
,Trainer.restore
was removed whenCheckpointConnector
was introduced and therestore
method was no longer inherited fromTrainerIOMixin
(Lightning-AI/pytorch-lightning@4724cdf)pytorch-lightning-2.0.0
,CheckpointConnector
was renamed to_CheckpointConnector
(Mark the connectors as protected Lightning-AI/pytorch-lightning#17008)pytorch-lightning-2.1.0
,_CheckpointConnector.save_checkpoint
was removed and inlined intoTrainer
([TPU] Fix PjRT tests Lightning-AI/pytorch-lightning#17408 (comment))To reproduce
No reproduction is necessary. There are multiple clear
AttributeError
s that get caught by theException
handler depending on thepytorch-lightning
version.Expected behaviour
The checkpointing mechanism of
pytorch-lightning
should have been patched to enable automatic logging of models with ClearML.Environment
The text was updated successfully, but these errors were encountered: