-
Notifications
You must be signed in to change notification settings - Fork 3.7k
DeepSpeed ZeRO Update #6546
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
DeepSpeed ZeRO Update #6546
Changes from all commits
Commits
Show all changes
70 commits
Select commit
Hold shift + click to select a range
2d3f617
Add context to call hook to handle all modules defined within the hook
99495e8
Expose some additional parameters
c3aac67
Added docs, exposed parameters
340f817
Make sure we only configure if necessary
ae02102
Merge branch 'master' into feat/ds_update
f192afc
Setup activation checkpointing regardless, saves the user having to d…
a2784a4
Add some tests that fail currently
b0dab3d
update
tchaton 0c44f05
update
tchaton 26655d7
update
tchaton ac19f36
add tests
tchaton d273393
change docstring
tchaton c91d128
resolve accumulate_grad_batches
tchaton 959d7b7
resolve flake8
tchaton f0cb6e7
Update DeepSpeed to use latest version, add some comments
914de86
add metrics
tchaton 5d16c74
Merge branch 'feat/ds_update' of https://github.com/PyTorchLightning/…
tchaton 712814c
update
tchaton a1644c6
Small formatting fixes, clean up some code
64f624f
Few cleanups
89fbbcb
No need for default state
701d417
Fix tests, add some boilerplate that should move eventually
270d6ed
Add hook removal
2b71ed8
Merge branch 'master' into feat/ds_update
a236ff0
Add a context manager to handle hook
e1f865e
Small naming cleanup
80fb792
wip
tchaton d621b1f
Merge branch 'master' into feat/ds_update
tchaton 1de2bcd
move save_checkpoint responsability to accelerator
tchaton 90d6e03
resolve flake8
tchaton b6361b8
add BC
tchaton 924d9e2
Merge branch 'feat/ds_update' of https://github.com/PyTorchLightning/…
tchaton 6acaccb
Change recommended scale to 16
f7a373e
Merge branch 'master' into feat/ds_update
tchaton 68b8a43
resolve flake8
tchaton a7dcb7b
update test
tchaton 08df0b5
Merge branch 'feat/ds_update' of https://github.com/PyTorchLightning/…
tchaton 6b08478
update install
tchaton 45a49c5
update
tchaton a8da299
update test
tchaton 99f1d96
update
tchaton 89601d8
update
tchaton eb1495e
Merge branch 'feat/ds_update' of https://github.com/PyTorchLightning/…
tchaton 389c60b
update test
tchaton de5f358
resolve flake8
tchaton 301b1aa
update
tchaton b9542ae
update
tchaton 48c0950
update on comments
tchaton c230407
Push
783265f
pull
c8f79f9
Update pytorch_lightning/plugins/training_type/deepspeed.py
tchaton 61378de
Update pytorch_lightning/plugins/training_type/deepspeed.py
tchaton 45c9569
update
tchaton deb2ea2
Apply suggestions from code review
SeanNaren 122e911
Swap to using world size defined by plugin
dfb403b
update
tchaton 9bd2821
Merge branch 'feat/ds_update' of https://github.com/PyTorchLightning/…
tchaton 066e0f0
update todo
tchaton d41284e
Remove deepspeed from extra, keep it in the base cuda docker install
0c9836c
Push
d1c511e
pull
67d31fa
update
tchaton e65aaf3
Merge branch 'feat/ds_update' of https://github.com/PyTorchLightning/…
tchaton 1740eed
update
tchaton 300f3aa
update
tchaton 40b1cc6
update
tchaton 603caf1
Minor changes
carmocca 62f67e8
duplicate
Borda 5786c4b
format
Borda 83e1343
format2
Borda File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.