-
Notifications
You must be signed in to change notification settings - Fork 100
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[P0] Can't reproduce commonsense reasoning numbers #141
Comments
Hey @dyahadila Thanks for reporting this issue. Could you share your running command? and your loss curve? I did observe sometimes DiReFT might not converge due to numeric instability when overfitting to short-generation task like commensense reasoning. Could you also try another random seed, or reduce your learning rate a little and epoch to 3 to see if it solve the issue? |
BTW, we shared commonsense reasoning logs (loss curve, final eval, stats) through wandb (link). You might find it helpful. But since there are changes in this repo as well as in the huggingface transformer repo, this could be a new issue. If you could, also try to lower your transformers version to something like 4.44.0 which is closer to the version we used before. |
I was using LoREFT, and this is my command:
identical to https://github.com/stanfordnlp/pyreft/tree/main/examples/loreft#commonsense-reasoning-tasks just with smaller batch size.. I am trying ep=6 lr = 1e-4 |
And did you install pyreft by using |
i used |
Hello!
I am trying to reproduce commonsense reasoning numbers on LLaMA with this run script: https://github.com/stanfordnlp/pyreft/tree/main/examples/loreft#commonsense-reasoning-tasks
But my eval_results.json after training is showing all zeros:
The text was updated successfully, but these errors were encountered: