-
Notifications
You must be signed in to change notification settings - Fork 24
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
KeyError: 'raw_prompt' #13
Comments
And there is another error.
non_pad_index = torch.nonzero(prompt_token_ids != pad_token_id, as_tuple=False)[0][0] It seems that the first paramter should not be prompt_token_ids != pad_token_id, but a Tensor. |
Hi @YixinSong-e , for the first issue, it's because you have set the input_tokenizer in reward model. This field in config will be set only when the reward model utilize different tokenizer from the actor model. So, you only need to set the For the second issue, it seems that the |
Very nice work!
I'm runing PPO using the hhrlhf datasets in verl repo. And the error is here.
The text was updated successfully, but these errors were encountered: