Skip to content

Use mpu in zero.Init()#1325

Merged
jeffra merged 10 commits intomasterfrom
olruwase/deepspeed_config_mpu
Sep 1, 2021
Merged

Use mpu in zero.Init()#1325
jeffra merged 10 commits intomasterfrom
olruwase/deepspeed_config_mpu

Conversation

@tjruwase
Copy link
Contributor

Calling DeepSpeedConfig() without the available mpu can lead to avoidable assertions concerning batch size and gradient accumulation steps. This avoidable assertions occur because mpu determines world size.

Replay of BigScience PR #1271.

@jeffra jeffra enabled auto-merge (squash) September 1, 2021 21:35
@jeffra jeffra merged commit e08c239 into master Sep 1, 2021
@mrwyattii mrwyattii deleted the olruwase/deepspeed_config_mpu branch July 7, 2023 02:40
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants

Comments