Make training args fully immutable #25435

muellerzr · 2023-08-10T12:44:40Z

What does this PR do?

This PR ensures that the TrainingArguments are a fully immutable dataclass after the __post_init__ has been ran. We'll find that the tests suddenly fail now 😉 Should be merged after #25390

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

@amyeroberts @sgugger

muellerzr · 2023-08-10T12:45:32Z

src/transformers/training_args.py

+
+    def __setattr__(self, name, value):
+        # Once fully through the `__post_init__`, `TrainingArguments` are immutable
+        if getattr(self, "_frozen", False):


Since adding _frozen to the dataclass args earlier would make it show up as an available option to pass through, we don't set it until the very end of __post_init__, hence the getattr instead of setting it earlier.

HuggingFaceDocBuilderDev · 2023-08-10T13:04:12Z

The documentation is not available anymore as the PR was closed or merged.

sgugger

Nice! Now to see what breaks with this 😅

amyeroberts

Nice ❄️ !

src/transformers/training_args.py

sgugger · 2023-08-11T06:59:59Z

tests/trainer/test_trainer_distributed.py

@@ -205,7 +205,7 @@ def compute_metrics(p: EvalPrediction) -> Dict:
            logger.error(p.metrics)
            exit(1)

-        trainer.args.eval_accumulation_steps = 2
+        trainer.args._set_value("eval_accumulation_steps", 2)


It's better to create a new set of training args here. People sometimes look for inspiration in our tests and we definitely don't want to advertise that method.

Done, there's also a method in dataclasses that lets us change frozen params the proper way with a new set of args overriding it

tests/trainer/test_trainer_distributed.py

muellerzr · 2023-08-15T14:28:59Z

@sgugger can you give it one final look please 😄

sgugger

Thanks, looking great!

* Make training args fully immutable * Working tests, PyTorch * In test_trainer * during testing * Use proper dataclass way * Fix test * Another one * Fix tf * Lingering slow * Exception * Clean

muellerzr added the trainer label Aug 10, 2023

muellerzr requested review from amyeroberts and sgugger August 10, 2023 12:44

muellerzr commented Aug 10, 2023

View reviewed changes

muellerzr mentioned this pull request Aug 10, 2023

Fix issue with ratio evaluation steps and auto find batch size #25436

Merged

5 tasks

sgugger approved these changes Aug 10, 2023

View reviewed changes

amyeroberts approved these changes Aug 10, 2023

View reviewed changes

muellerzr commented Aug 10, 2023

View reviewed changes

src/transformers/training_args.py Outdated Show resolved Hide resolved

muellerzr requested a review from sgugger August 10, 2023 18:18

sgugger reviewed Aug 11, 2023

View reviewed changes

muellerzr added 7 commits August 15, 2023 13:12

Make training args fully immutable

23c1f7a

Working tests, PyTorch

2312b75

In test_trainer

9e0170a

during testing

4270dab

Use proper dataclass way

3d622c0

Fix test

a943e08

Another one

02c2b62

muellerzr force-pushed the muellerzr-immutable branch from 3e3e1ee to 02c2b62 Compare August 15, 2023 13:13

muellerzr added 4 commits August 15, 2023 13:29

Fix tf

578f2e0

Lingering slow

5904048

Exception

10db575

Clean

245c7cd

muellerzr requested a review from sgugger August 15, 2023 14:28

sgugger approved these changes Aug 15, 2023

View reviewed changes

muellerzr merged commit ca51499 into main Aug 15, 2023
3 checks passed

muellerzr deleted the muellerzr-immutable branch August 15, 2023 15:47

younesbelkada mentioned this pull request Aug 23, 2023

[CI] Fix unmutable TrainingArguments issue huggingface/trl#676

Merged

echarlaix mentioned this pull request Aug 23, 2023

create trainer training args which are now immutable huggingface/optimum-intel#412

Merged

stefan-it mentioned this pull request Aug 23, 2023

Getting error dataclasses.FrozenInstanceError: cannot assign to field generation_config when executing any of the scripts in the scripts folder with default parameters. artidoro/qlora#253

Open

tomaarsen mentioned this pull request Aug 24, 2023

Use dataclasses.replace due to immutability of TrainingArguments tomaarsen/SpanMarkerNER#27

Merged

canberk17 mentioned this pull request Sep 3, 2023

TrainingArguments are now Immutable nlp-with-transformers/notebooks#118

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make training args fully immutable #25435

Make training args fully immutable #25435

muellerzr commented Aug 10, 2023

muellerzr Aug 10, 2023

HuggingFaceDocBuilderDev commented Aug 10, 2023 •

edited

Loading

sgugger left a comment

amyeroberts left a comment

sgugger Aug 11, 2023

muellerzr Aug 15, 2023

muellerzr commented Aug 15, 2023

sgugger left a comment

Make training args fully immutable #25435

Make training args fully immutable #25435

Conversation

muellerzr commented Aug 10, 2023

What does this PR do?

Before submitting

Who can review?

muellerzr Aug 10, 2023

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Aug 10, 2023 • edited Loading

sgugger left a comment

Choose a reason for hiding this comment

amyeroberts left a comment

Choose a reason for hiding this comment

sgugger Aug 11, 2023

Choose a reason for hiding this comment

muellerzr Aug 15, 2023

Choose a reason for hiding this comment

muellerzr commented Aug 15, 2023

sgugger left a comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Aug 10, 2023 •

edited

Loading