Ensure `RewardConfig` is backwards compatible #748

lewtun · 2023-09-08T12:21:24Z

In #726 we replaced the RewardTrainer.args with a dedicated RewardConfig to collect all hyperparameters in a dedicated class (in particular max_length).

Unfortunately, that implementation wasn't backwards compatible because RewardTrainer.args was of type transformers.TrainingArguments and that object doesn't have the max_length attribute. This meant that part of the logic which checks for the presence of args.max_length would throw an error for any users who hadn't updated their code to use RewardConfig.

This PR fixes that by first checking the type of RewardTrainer.args and raising a warning if it's transformers.TrainingArguments.

Happy to add a unit test for this, but I felt it was a bit of overkill.

HuggingFaceDocBuilderDev · 2023-09-08T12:26:31Z

The documentation is not available anymore as the PR was closed or merged.

lewtun · 2023-09-08T14:21:20Z

trl/trainer/reward_trainer.py

+                        " It will be set to `512` by default, but you should do it yourself in the future.",
+                        UserWarning,
+                    )
+                    max_length = 512


Side remark for a later PR, but I think this should ideally be set to be the model's max context size as the default

younesbelkada

Thanks a lot ! I left two comments, otherwise looking great ! Thanks for taking care of the backward compatibility lewis!

younesbelkada · 2023-09-14T08:30:51Z

trl/trainer/reward_trainer.py

-                "You cannot specify both `max_length` and `args.max_length`. Please use the `RewardConfig` to set `max_length` once."
-            )
-        if max_length is not None and args.max_length is None:
+        if type(args) == TrainingArguments:


Suggested change

if type(args) == TrainingArguments:

if isinstance(args, TrainingArguments):

I tried this initially, but realised it wont' work because args is a subclass of TrainingArguments and thus is an instance of TrainingArguments which makes the if-statement always true. See e.g. this: https://stackoverflow.com/questions/1549801/what-are-the-differences-between-type-and-isinstance

Happy to refactor to a try / except clause if you prefer :)

I see perfect, thanks a lot lewis for explaining!

younesbelkada · 2023-09-14T08:31:19Z

trl/trainer/reward_trainer.py

-                max_length = 512
-            if max_length is None and args.max_length is not None:
-                max_length = args.max_length
+            if type(args) == TrainingArguments:


Suggested change

if type(args) == TrainingArguments:

if isinstance(args, TrainingArguments):

younesbelkada

Thanks again for your work on this @lewtun !

Fix type checking

e24d8c3

lvwerra requested a review from younesbelkada September 8, 2023 13:45

lewtun commented Sep 8, 2023

View reviewed changes

younesbelkada approved these changes Sep 14, 2023

View reviewed changes

younesbelkada merged commit 9a8d52c into main Sep 18, 2023

younesbelkada deleted the fix-rm-args branch September 18, 2023 11:54

younesbelkada reviewed Sep 18, 2023

View reviewed changes

kushal-tri pushed a commit to kushalarora/trl that referenced this pull request Sep 19, 2023

Fix type checking (huggingface#748)

88492f3

lapp0 pushed a commit to lapp0/trl that referenced this pull request May 10, 2024

Fix type checking (huggingface#748)

438e1f4

qgallouedec mentioned this pull request Nov 12, 2024

⚠️ Add warning guidelines and update codebase to follow best practices #2350

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ensure `RewardConfig` is backwards compatible #748

Ensure `RewardConfig` is backwards compatible #748

lewtun commented Sep 8, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Sep 8, 2023 •

edited

Loading

lewtun Sep 8, 2023

younesbelkada left a comment

younesbelkada Sep 14, 2023

lewtun Sep 18, 2023

younesbelkada Sep 18, 2023

younesbelkada Sep 14, 2023

younesbelkada left a comment

	if type(args) == TrainingArguments:
	if isinstance(args, TrainingArguments):

Ensure RewardConfig is backwards compatible #748

Ensure RewardConfig is backwards compatible #748

Conversation

lewtun commented Sep 8, 2023 • edited Loading

HuggingFaceDocBuilderDev commented Sep 8, 2023 • edited Loading

lewtun Sep 8, 2023

Choose a reason for hiding this comment

younesbelkada left a comment

Choose a reason for hiding this comment

younesbelkada Sep 14, 2023

Choose a reason for hiding this comment

lewtun Sep 18, 2023

Choose a reason for hiding this comment

younesbelkada Sep 18, 2023

Choose a reason for hiding this comment

younesbelkada Sep 14, 2023

Choose a reason for hiding this comment

younesbelkada left a comment

Choose a reason for hiding this comment

Ensure `RewardConfig` is backwards compatible #748

Ensure `RewardConfig` is backwards compatible #748

lewtun commented Sep 8, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Sep 8, 2023 •

edited

Loading