-
Notifications
You must be signed in to change notification settings - Fork 27.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support logging_ratio
, save_ratio
, and eval_ratio
(like for warmup_ratio
)
#23171
Comments
We already have 96 training arguments though, and that would make three more of all users to learn :-/ |
Another option would be to allow But yes there is a tradeoff with too much complexity |
I would prefer that solution actually, even if the naming is not perfect. |
Going over the code a bit, it seems like we would have to wait until after this if statement in transformers/src/transformers/trainer.py Lines 1683 to 1711 in 04ab560
At first glance, it looks like the setting of when things get logged/saved/evaluated in DefaultFlowCallback should work out-of-the-box with this change.I'm willing to contribute the changes once I find some time, does the general plan sound reasonable? |
Yes it does. I'll be looking forward to your PR! |
Can someone clarify if this ratio means the % of training steps per epoch or the total training steps? If the latter, how do we know preemptively the total number of epochs (or total number of training steps) that the model is going to train for? |
Feature request
I would love if
TrainingArguments
and the HuggingfaceTrainer
would supportlogging_ratio
,save_ratio
, andeval_ratio
arguments (complementinglogging_steps
,save_steps
, andeval_steps
). If the*_ratio
argument is set to e.g.0.1
, logging/saving/eval would be done every0.1 * total_training_steps
. This is already done forwarmup_ratio
andwarmup_steps
.Motivation
When dealing with many different tasks and datasets, it can be frustrating to have to calculate different appropriate
logging_steps
etc. for each individual dataset. This proposal would enable a unified, simple and concise way to solve this problem.Your contribution
I realize this might not be trivial to fully integrate, but hopefully, we can take
warmup_steps
andwarmup_ratio
as a reference. Depending on how deep the required changes are, I can also submit a PR (with some pointers on what to look out for).The text was updated successfully, but these errors were encountered: