-
Notifications
You must be signed in to change notification settings - Fork 4.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Questions on fine-tuning process #8
Comments
My two cents on the fine-tuning process. |
Happy to answer!
|
Thanks for the comment! Post-release we have retrained the model and optimized the training pipeline. So far, we have reduced the resource requirement by a factor of 2x. We are working on further reducing the training cost. |
Thank you for your explanation. I have a little confusion about the second answer, if training example is padded,and the predict response is logger than training example, should we ignore the padded part when when calculating losse(by setting the pad token in labels to -100)?
|
I have three questions regarding the fine-tuning process.
Thank you in advance.
The text was updated successfully, but these errors were encountered: