Add warnings if training args differ from checkpoint trainer state #29255

jonflynng · 2024-02-23T16:50:49Z

What does this PR do?

Resolves #28867

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Training arguments are not applied when resuming from a checkpoint #28867
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

…rainer_state.json

jonflynng · 2024-02-25T23:26:38Z

src/transformers/trainer.py

@@ -1370,6 +1370,29 @@ def ipex_optimize_model(self, model, training=False, dtype=torch.float32):

        return model

+    def compare_trainer_and_checkpoint_args(self, training_args, trainer_state):
+        attributes_map = {


These are some parameters I expected to be overridden via the training_args when resuming from a checkpoint.

ArthurZucker · 2024-02-28T02:11:37Z

cc @muellerzr !

github-actions · 2024-03-25T08:03:45Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

ArthurZucker

This LGTM, @muellerzr fine with you ?

HuggingFaceDocBuilderDev · 2024-03-25T12:02:53Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

muellerzr

Thanks! LG2M as well 🤗

…uggingface#29255) * add warnings if training args differ from checkpoint args stored in trainer_state.json * run formatting and styling * add a test * format and styling --------- Co-authored-by: Jonathan Flynn <jonl.flynn@guardian.co.uk>

…29255) * add warnings if training args differ from checkpoint args stored in trainer_state.json * run formatting and styling * add a test * format and styling --------- Co-authored-by: Jonathan Flynn <jonl.flynn@guardian.co.uk>

Jonathan Flynn added 4 commits February 23, 2024 16:41

add warnings if training args differ from checkpoint args stored in t…

a225a52

…rainer_state.json

run formatting and styling

79d8ede

add a test

1c91572

format and styling

1fbb3be

jonflynng marked this pull request as ready for review February 25, 2024 23:14

jonflynng commented Feb 25, 2024

View reviewed changes

jonflynng mentioned this pull request Feb 25, 2024

Training arguments are not applied when resuming from a checkpoint #28867

Closed

4 tasks

jonflynng changed the title ~~Add warnings if training args differ~~ Add warnings if training args differ from checkpoint trainer state Feb 26, 2024

ArthurZucker requested a review from muellerzr February 28, 2024 02:11

ArthurZucker approved these changes Mar 25, 2024

View reviewed changes

muellerzr approved these changes Mar 25, 2024

View reviewed changes

ArthurZucker merged commit b5a6d6e into huggingface:main Mar 26, 2024
21 checks passed

muellerzr mentioned this pull request Mar 26, 2024

Rework tests to compare trainer checkpoint args #29883

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add warnings if training args differ from checkpoint trainer state #29255

Add warnings if training args differ from checkpoint trainer state #29255

jonflynng commented Feb 23, 2024 •

edited

Loading

jonflynng Feb 25, 2024

ArthurZucker commented Feb 28, 2024

github-actions bot commented Mar 25, 2024

ArthurZucker left a comment

HuggingFaceDocBuilderDev commented Mar 25, 2024

muellerzr left a comment

Add warnings if training args differ from checkpoint trainer state #29255

Add warnings if training args differ from checkpoint trainer state #29255

Conversation

jonflynng commented Feb 23, 2024 • edited Loading

What does this PR do?

Before submitting

Who can review?

jonflynng Feb 25, 2024

Choose a reason for hiding this comment

ArthurZucker commented Feb 28, 2024

github-actions bot commented Mar 25, 2024

ArthurZucker left a comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Mar 25, 2024

muellerzr left a comment

Choose a reason for hiding this comment

jonflynng commented Feb 23, 2024 •

edited

Loading