use duck-typing to ensure underlying optimizer supports schedulefree hooks #3055

tmm1 · 2024-08-27T16:52:58Z

the new duck-typing approach in huggingface/transformers#30079 is causing build failures over there, because of the path through accelerate

/usr/local/lib/python3.10/site-packages/transformers/trainer.py:3448: in training_step
    self.optimizer.train()
/usr/local/lib/python3.10/site-packages/accelerate/optimizer.py:128: in train
    return self.optimizer.train()
E   AttributeError: 'AdamW' object has no attribute 'train'

we need to replicate that logic here, ensuring this code path is safe to call for all optimizers

cc @amyeroberts @muellerzr @winglian #2631

…hooks

HuggingFaceDocBuilderDev · 2024-08-27T17:06:58Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

tmm1 · 2024-08-29T18:15:22Z

friendly ping cc @amyeroberts @muellerzr

muellerzr

Thanks, solution makes sense to me. cc @BenjaminBossan

BenjaminBossan

Thanks, LGTM.

Just for my understanding: This is necessary because in transformers, we want to do stuff like:

    if hasattr(self.optimizer, "eval") and callable(self.optimizer.eval):
        self.optimizer.eval()

In accelerate, it was assumed that optimizer.train() and optimizer.eval() are only called if the underlying optimizer supports it, but with the proposed change to transformers, they are called if the method exists, which breaks with the accelerate assumption.

IMO this could be confusing to debug and it would be better if there were a dedicated method to check this, like: if optimizer.supports_train_eval_mode() or so. But overall, the proposed solution is also okay.

amyeroberts · 2024-08-30T15:08:02Z

IMO this could be confusing to debug and it would be better if there were a dedicated method to check this, like: if optimizer.supports_train_eval_mode() or so. But overall, the proposed solution is also okay.

Agreed - this would be a nicer way to handle!

muellerzr · 2024-09-02T15:43:14Z

@BenjaminBossan while that's good, there's the problem of minimum accelerate versions. We can do this, in a follow-up I'll include the supports_train_eval(), and then on the Trainer side we can do some if/else versions until the minimum Trainer requirement is accelerate 1.0+ (since that's probably when that'd be slammed onto)

use duck-typing to ensure underlying optimizer supports schedulefree …

5b91ba7

…hooks

tmm1 mentioned this pull request Aug 27, 2024

schedulefree optimizers huggingface/transformers#30079

Merged

5 tasks

fixup

d993dac

muellerzr approved these changes Aug 29, 2024

View reviewed changes

muellerzr requested a review from BenjaminBossan August 29, 2024 20:06

BenjaminBossan approved these changes Aug 30, 2024

View reviewed changes

muellerzr merged commit 1d09a20 into huggingface:main Sep 2, 2024
25 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

use duck-typing to ensure underlying optimizer supports schedulefree hooks #3055

use duck-typing to ensure underlying optimizer supports schedulefree hooks #3055

tmm1 commented Aug 27, 2024 •

edited

Loading

HuggingFaceDocBuilderDev commented Aug 27, 2024

tmm1 commented Aug 29, 2024

muellerzr left a comment

BenjaminBossan left a comment

amyeroberts commented Aug 30, 2024

muellerzr commented Sep 2, 2024

use duck-typing to ensure underlying optimizer supports schedulefree hooks #3055

use duck-typing to ensure underlying optimizer supports schedulefree hooks #3055

Conversation

tmm1 commented Aug 27, 2024 • edited Loading

HuggingFaceDocBuilderDev commented Aug 27, 2024

tmm1 commented Aug 29, 2024

muellerzr left a comment

Choose a reason for hiding this comment

BenjaminBossan left a comment

Choose a reason for hiding this comment

amyeroberts commented Aug 30, 2024

muellerzr commented Sep 2, 2024

tmm1 commented Aug 27, 2024 •

edited

Loading