Add optimizers and schedules to RTD and updated the corresponding part in the website by cli99 · Pull Request #799 · deepspeedai/DeepSpeed

cli99 · 2021-02-26T19:22:20Z

This PR fixes #625.

…heng/rtd

ShadenSmith · 2021-03-09T22:52:29Z

Thanks a ton @cli99 ! I hastily put together a CPU Adam page for the ZeRO-3 release, but your PR is much better. I'm not able to PR to your branch here; can you incorporate this to revert my changes? We just need to remove cpu-adam.rst and its reference in index.rst.

ShadenSmith@eb349f8

* set adamw_mode default true (follows FusedAdam and < 0.3.11 logic) (deepspeedai#844) * less scary overflow notice (deepspeedai#833) Co-authored-by: Jeff Rasley <jerasley@microsoft.com> * Add optimizers and schedules to RTD and updated the corresponding part in the website (deepspeedai#799) * add optimizers and schedules to rtd * update ds website and fix links * add optimizers and schedules to rtd * update ds website and fix links * add flops profiler to rtd * fix Co-authored-by: Shaden Smith <Shaden.Smith@microsoft.com> * small tweaks (deepspeedai#839) * Control ZeRO wall clock timers (deepspeedai#849) * Control ZeRO wall clock timers * Disable more ZeRO3 debug prints Co-authored-by: Jeff Rasley <jerasley@microsoft.com> * [WarmupDecayLR] fix log(0) & 1/log(1) bugs (deepspeedai#772) * fix log(0) & 1/log(1) bugs * simplify Co-authored-by: Jeff Rasley <jerasley@microsoft.com> Co-authored-by: Reza Yazdani <44502768+RezaYazdaniAminabadi@users.noreply.github.com> Co-authored-by: Cheng Li <pistasable@gmail.com> * bump to v0.3.12 * Bug fix: Remove client optimizer param_group list item that does not have 'params' (deepspeedai#827) Co-authored-by: Jeff Rasley <jerasley@microsoft.com> * [doc] pipeline doc typos/improvements (deepspeedai#659) Admin merging for pure-doc PR that does not trigger build. * Samyamr/inference hook fix (deepspeedai#851) * Fix mis-aligned-grad When a parameter is not divisible by world size, the partitioned gradients are mis-aligned due to incorrect padding handling. This PR should fix for that. * Formatting fix * Adding static_scale test back for Z3, and also changing hidden size to be not divisile by world_size * also removing alignment from flat fp16 buffers * Testing for hidden dim alignment * inference hook fix * Update stage3.py * formatting * [bug-fix] move params to gpu if offload params is turned off Co-authored-by: Samyam Rajbhandari <samyamr@microsoft.com> Co-authored-by: Shaden Smith <Shaden.Smith@microsoft.com> Co-authored-by: Jeff Rasley <jerasley@microsoft.com> * ZeRO Stage 2: Clear reduced gradients (deepspeedai#856) * Ensure gradients of other partitions are cleared after reduction * Remove redundant code Co-authored-by: Jeff Rasley <jerasley@microsoft.com> * Squash stage3 v1 (deepspeedai#146) Co-authored-by: Samyam <samyamr@microsoft.com> Co-authored-by: Jeff Rasley <jerasley@microsoft.com> Co-authored-by: Samyam Rajbhandari <samyamr@microsoft.com> Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com> Co-authored-by: Shaden Smith <Shaden.Smith@microsoft.com> Co-authored-by: Shaden Smith <ShadenTSmith@gmail.com> Co-authored-by: eltonzheng <eltonz@microsoft.com> * formatting fix (deepspeedai#150) * stage3 bugfix (API) update and simplified FP16 Z3 tests (deepspeedai#151) * fp16 Z3 API update and bugfix * revert debug change * docs * filling in allocation docs * better assumption docs * doc progress * config json * major docs edits * auto registration works for accessed cases * working on small models. * debugging large-model discovery? * fix discovery to first forward pass? * return obj ext param * support None parameters in auto-discovery Co-authored-by: Jeff Rasley <jerasley@microsoft.com> Co-authored-by: Stas Bekman <stas00@users.noreply.github.com> Co-authored-by: Cheng Li <pistasable@gmail.com> Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com> Co-authored-by: Reza Yazdani <44502768+RezaYazdaniAminabadi@users.noreply.github.com> Co-authored-by: Samyam Rajbhandari <samyamr@microsoft.com> Co-authored-by: eltonzheng <eltonz@microsoft.com>

…t in the website (deepspeedai#799) * add optimizers and schedules to rtd * update ds website and fix links * add optimizers and schedules to rtd * update ds website and fix links * add flops profiler to rtd * fix Co-authored-by: Shaden Smith <Shaden.Smith@microsoft.com>

cli99 added 6 commits February 26, 2021 11:19

add optimizers and schedules to rtd

fb569cb

update ds website and fix links

258d64a

add optimizers and schedules to rtd

d789bc1

update ds website and fix links

9de3ec0

Merge branch 'cheng/rtd' of https://github.com/cli99/DeepSpeed into c…

990668d

…heng/rtd

Merge branch 'master' into cheng/rtd

9b3afce

cli99 marked this pull request as ready for review February 26, 2021 21:14

cli99 requested review from RezaYazdaniAminabadi, ShadenSmith, arashashari, awan-10, conglongli, eltonzheng, jeffra, minjiaz, niumanar, samyam and tjruwase as code owners February 26, 2021 21:14

add flops profiler to rtd

c94048c

cli99 mentioned this pull request Mar 2, 2021

[website] link to schedulers is broken #625

Closed

Merge branch 'master' into cheng/rtd

c33fa9a

ShadenSmith approved these changes Mar 9, 2021

View reviewed changes

cli99 added 2 commits March 9, 2021 23:00

fix

11b7108

Merge branch 'master' into cheng/rtd

8539bd0

cli99 merged commit e0f36ed into deepspeedai:master Mar 11, 2021

cli99 deleted the cheng/rtd branch March 25, 2021 21:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add optimizers and schedules to RTD and updated the corresponding part in the website#799

Add optimizers and schedules to RTD and updated the corresponding part in the website#799
cli99 merged 10 commits intodeepspeedai:masterfrom
cli99:cheng/rtd

cli99 commented Feb 26, 2021

Uh oh!

ShadenSmith commented Mar 9, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

cli99 commented Feb 26, 2021

Uh oh!

ShadenSmith commented Mar 9, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants