8-bit optimizers dont work with FSDP #89

Open

Open

8-bit optimizers dont work with FSDP#89

Feature

Assignees

Labels

Contributions WelcomeDuplicateFSDPOptimizersTo Discuss Internally

When I use an 8-bit ADAM with FSDP, I get an error as follows:

RuntimeError: output tensor must have the same type as input tensor

If my understanding is correct, there seems to be a casting issue. Is there any workaround this?

TIA.

Metadata

Assignees

Titus-von-Koeller

Labels

Contributions WelcomeDuplicateFSDPOptimizersTo Discuss Internally

Type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests