Gradient Checkpointing for OpenCLIP should be optional #36

apolinario · 2022-06-02T11:01:44Z

I know hardcoding it came from me but while Gradient Checkpointing makes things faster and use less VRAM so very useful on some use-cases, but can break things on A100 and also break cutn_batches on most text-to-image implementations, so ideally it should be optional for the user

More broadly we should think on how to load options that pertain to particular loaders/modules/perceptors while not breaking the overall mocking logics

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Gradient Checkpointing for OpenCLIP should be optional #36

Gradient Checkpointing for OpenCLIP should be optional #36

apolinario commented Jun 2, 2022

Gradient Checkpointing for OpenCLIP should be optional #36

Gradient Checkpointing for OpenCLIP should be optional #36

Comments

apolinario commented Jun 2, 2022