Skip to content

Comments

Big science#1382

Closed
jeffra wants to merge 24 commits intomasterfrom
big-science
Closed

Big science#1382
jeffra wants to merge 24 commits intomasterfrom
big-science

Conversation

@jeffra
Copy link
Collaborator

@jeffra jeffra commented Sep 21, 2021

No description provided.

Shaden Smith and others added 24 commits June 6, 2021 11:27
* removes repeated overflow log

* pipe_replicated

* _pipe_replicated -> ds_pipe_replicated

* Adds send/recv fallback to bcast when torch version <= 1.8
…er (#1263)

Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com>
* Use mpu in DeepSpeedConfig() call

* Improve argument naming
* FP16 fused and unfused grad norm query.

* API for obtaining global unclipped gradient norm across parameter groups

* Use global norm not group norms

Co-authored-by: Shaden Smith <shaden.smith@microsoft.com>
* restore fp16 params if no zero ckpts available

* formatting
Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
@jeffra
Copy link
Collaborator Author

jeffra commented Sep 28, 2021

closing in favor of #1407

@jeffra jeffra closed this Sep 28, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants