Skip to content

Migrate train_on_inputs to sft-specific params #297

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 3 commits into
base: main
Choose a base branch
from

Conversation

connermanuel
Copy link
Contributor

this PR adjusts the behavior of train_on_inputs.
if train type is SFT, we include this parameter in the TrainingMethod and default it to auto.
if train type is DPO, we default to None and raise if parameter is supplied.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

formatting only

@connermanuel connermanuel force-pushed the cmanuel/eng-24978-move-train_on_inputs-to-the-parameters-of-sft-training branch from 4b502d1 to 8ef8769 Compare May 8, 2025 21:25
@connermanuel connermanuel force-pushed the cmanuel/eng-24978-move-train_on_inputs-to-the-parameters-of-sft-training branch from 8ef8769 to 1ce13d6 Compare May 8, 2025 22:03
)
train_on_inputs = "auto"

if dpo_beta is not None and training_method != "dpo":

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Other option might be just a warning. What do you think?

Copy link
Contributor

@artek0chumak artek0chumak left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.


if train_on_inputs is None and training_method == "sft":
log_warn_once(
"train_on_inputs is not set for SFT training, it will be set to 'auto' automatically"
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
"train_on_inputs is not set for SFT training, it will be set to 'auto' automatically"
"train_on_inputs is not set for SFT training, it will be set to 'auto'"

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants