Fix lion8b error correction with torch 2.1 #656

dblalock · 2023-10-08T22:33:33Z

Un-disables error correction and instead works around the new FSDP limitations.

This looks like a big diff because it reverts all the indentation changes in the error correction disabling PR, but it's a only a few lines different in lion8b.py and test_lion8b.py compared to the state of those files before that. Basically we just had to tell torch the errors are bf16 tensors and actually cast to bf16 when saving the state dict.

dakinggg

Thanks Davis!

dblalock added 3 commits October 8, 2023 22:32

fix lion8b error correction with torch 2.1

bbb0441

handle + test fsdp with odd param sizes

d1d9002

precommit

d1bd47b

dblalock marked this pull request as ready for review October 9, 2023 03:23

dblalock requested a review from dakinggg October 9, 2023 03:23

dakinggg approved these changes Oct 9, 2023

View reviewed changes

dakinggg merged commit aa2ba9f into main Oct 9, 2023
11 checks passed

dakinggg deleted the davis/lion8b-fsdp-fix branch October 11, 2023 21:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix lion8b error correction with torch 2.1 #656

Fix lion8b error correction with torch 2.1 #656

dblalock commented Oct 8, 2023 •

edited

Loading

dakinggg left a comment

Fix lion8b error correction with torch 2.1 #656

Fix lion8b error correction with torch 2.1 #656

Conversation

dblalock commented Oct 8, 2023 • edited Loading

dakinggg left a comment

Choose a reason for hiding this comment

dblalock commented Oct 8, 2023 •

edited

Loading