Skip to content

DeepSpeed: hardcode torch.arange dtype on float usage to avoid incorrect initialization#28760

Merged
gante merged 5 commits intohuggingface:mainfrom gante:deepspeed_init_fixJan 31, 2024

Commits

Commits on Jan 29, 2024

Commits on Jan 30, 2024