-
Notifications
You must be signed in to change notification settings - Fork 3.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
signal handling and teardown #3632
Conversation
Codecov Report
@@ Coverage Diff @@
## master #3632 +/- ##
=======================================
+ Coverage 83% 87% +4%
=======================================
Files 117 118 +1
Lines 9106 9125 +19
=======================================
+ Hits 7575 7983 +408
+ Misses 1531 1142 -389 |
This pull request is now in conflict... :( |
This pull request is now in conflict... :( |
Co-authored-by: Justus Schock <12886177+justusschock@users.noreply.github.com>
3c0c678
to
d0af75f
Compare
@awaelchli why do we want this again? this is also going to break DDP... DDP .fit .test() <-------- fail because process group was killed... the way i currently have it works fine and keeps the progress groups around. This was that major bug we both spent a long time on. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
i believe this will break DDP.
This is something taht we should revisit after 1.0
I'm sorry william, but this PR was before your refactor and fixes. |
no worries. i was going to let it run to see if tests did indeed fail. Then we can decide what to do |
retrigger Drone to see how it runs on multi-gpu... |
This pull request is now in conflict... :( |
What does this PR do?
Continuation of #2165
Fixes #1999
Before submitting
PR review
Anyone in the community is free to review the PR once the tests have passed.
If we didn't discuss your PR in Github issues there's a high chance it will not be merged.
Did you have fun?
Make sure you had fun coding 🙃