DeepSpeed: hardcode torch.arange
dtype on float
usage to avoid incorrect initialization#28760
Merged
gante merged 5 commits intohuggingface:mainfrom gante:deepspeed_init_fixJan 31, 2024
+192-118
Commits
Commits on Jan 30, 2024
- committed
- committed