Skip to content
Merged
Changes from all commits
Commits
Show all changes
31 commits
Select commit Hold shift + click to select a range
c6fed8b
Fix docstring
tjruwase Jan 5, 2021
8f1b191
Make screenshots clickable for easier viewing
tjruwase Feb 10, 2021
efeea1b
Merge branch 'master' of github.com:microsoft/DeepSpeed into olruwase…
tjruwase Feb 10, 2021
cc57f64
Navigation menu in alphabetical order; More clicable screenshots
tjruwase Feb 16, 2021
ad0c999
Merge with master
tjruwase Feb 16, 2021
e0a6d66
Rename 1Cycle doc
tjruwase Feb 16, 2021
17647ad
Tweak naming
tjruwase Feb 16, 2021
8b09a4d
Merge branch 'master' into olruwase/docs
tjruwase Feb 16, 2021
5685d49
Merge branch 'master' of github.com:microsoft/DeepSpeed into olruwase…
tjruwase Mar 5, 2021
9e71fe2
Remove no longer used flag
tjruwase Mar 5, 2021
c739493
Merge branch 'olruwase/docs' of github.com:microsoft/DeepSpeed into o…
tjruwase Mar 5, 2021
cde2c9a
ZeRO3 Offload release
tjruwase Mar 8, 2021
4999829
Single GPU results
tjruwase Mar 8, 2021
6ee38ec
Rearrange figures
tjruwase Mar 8, 2021
8da1dc2
Single GPU text
tjruwase Mar 8, 2021
7fde596
tweak intro
tjruwase Mar 8, 2021
23f6779
zero3-offload section
tjruwase Mar 8, 2021
289c5dd
Merge branch 'master' of github.com:microsoft/DeepSpeed into olruwase…
tjruwase Mar 8, 2021
1aa0806
Merge branch 'master' of github.com:microsoft/DeepSpeed into olruwase…
tjruwase Apr 23, 2021
8db23f0
Add asynchronous i/o docs
tjruwase Apr 23, 2021
edcad26
ignore_unused_parameters
tjruwase May 10, 2021
734b67b
Merge branch 'master' into olruwase/docs
tjruwase May 13, 2021
139997a
Merge branch 'master' of github.com:microsoft/DeepSpeed into olruwase…
tjruwase Jun 16, 2021
8a9b137
Fix print_per_steps doc
tjruwase Jun 16, 2021
e5a2950
Merge branch 'olruwase/docs' of github.com:microsoft/DeepSpeed into o…
tjruwase Jun 16, 2021
e9ba3b7
Merge branch 'master' into olruwase/docs
tjruwase Jun 16, 2021
d66e791
Merge branch 'master' of github.com:microsoft/DeepSpeed into olruwase…
tjruwase Jul 28, 2021
756af96
Merge branch 'master' of github.com:microsoft/DeepSpeed into olruwase…
tjruwase Jul 29, 2021
63d5a2c
Document round_robin_gradients
tjruwase Jul 29, 2021
517e0a9
Tweak description
tjruwase Jul 29, 2021
e68c75c
Trigger CI
tjruwase Jul 30, 2021
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
7 changes: 7 additions & 0 deletions docs/_pages/config-json.md
Original file line number Diff line number Diff line change
Expand Up @@ -301,6 +301,7 @@ Enabling and configuring ZeRO memory optimizations
"elastic_checkpoint" : [true|false],
"stage3_gather_fp16_weights_on_model_save": [true|false],
"ignore_unused_parameters": [true|false]
"round_robin_gradients": [true|false]
}
```

Expand Down Expand Up @@ -358,6 +359,12 @@ Enabling and configuring ZeRO memory optimizations
| ------------------------------------------------------------------------------------------------------------------------------------------ | ------- |
| For use with ZeRO stage 1, enable backward hooks to reduce gradients during the backward pass or wait until the end of the backward pass. | `True` |

***round_robin_gradients***: [boolean]

| Description | Default |
| ------------------------------------------------------------------------------------------------------------------------------------------ | ------- |
| Stage 2 optimization for CPU offloading that parallelizes gradient copying to CPU memory among ranks by fine-grained gradient partitioning. Performance benefit grows with gradient accumulation steps (more copying between optimizer steps) or GPU count (increased parallelism). | `False` |

***offload_param***: [dictionary]

| Description | Default |
Expand Down