-
Notifications
You must be signed in to change notification settings - Fork 3.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Remove deadlock detection / process reconciliation logic #16204
Remove deadlock detection / process reconciliation logic #16204
Conversation
cafbc0c
to
f7a4b28
Compare
a37c43d
to
a043ba8
Compare
I rebased this |
⚡ Required checks status: All passing 🟢Groups summary🟢 pytorch_lightning: Tests workflowThese checks are required after the changes to 🟢 pytorch_lightning: Azure GPU
These checks are required after the changes to 🟢 pytorch_lightning: Azure HPU
These checks are required after the changes to 🟢 pytorch_lightning: Azure IPU
These checks are required after the changes to 🟢 pytorch_lightning: Docs
These checks are required after the changes to 🟢 mypy
These checks are required after the changes to 🟢 installThese checks are required after the changes to Thank you for your contribution! 💜
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Pushed a commit with some leftover removals to the fsdp strategy
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
What does this PR do?
Deadlock detection / process reconciliation is a feature of the DDP strategy family. It is not documented, but it is turned on by default when ddp strategies are used, unless PL_RECONCILIATE_PROCESS=0 is set.
Part of #16410
Closes #12797
Does your PR introduce any breaking changes? If yes, please list them.
Yes, for everyone using the ddp strategy, and anyone else that enables
PL_RECONCILIATE_PROCESS=1
, the feature will no longer work. However, errorsBefore submitting
PR review
Anyone in the community is free to review the PR once the tests have passed.
Before you start reviewing make sure you have read Review guidelines. In short, see the following bullet-list:
Did you have fun?
I made sure I had fun coding 🙃
cc @Borda @justusschock @awaelchli