You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
First of all, fantastic work! Thank you for your valuable contribution to the open source community.
I'm interested in the GRPO training process but currently lack sufficient GPU resources to conduct the training myself. In this regard, I have a few questions:
Are there any GRPO-trained models available for public use,I didn't find it on huggingface。
I'm particularly curious about the model's inference process after GRPO training, specifically:
Whether reflection capabilities can emerge in a 7B-sized model
The minimum VRAM requirements for evaluating models on a single GPU (I currently have access to 48GB GPU memory)
Thank you again for your dedication to the open source community. Your work is greatly appreciated!
Best regards
The text was updated successfully, but these errors were encountered:
Ethereal-sakura
changed the title
Could you provide the model trained by GPRO?
Could you provide the model trained by GRPO?
Jan 27, 2025
First of all, fantastic work! Thank you for your valuable contribution to the open source community.
I'm interested in the GRPO training process but currently lack sufficient GPU resources to conduct the training myself. In this regard, I have a few questions:
Are there any GRPO-trained models available for public use,I didn't find it on huggingface。
I'm particularly curious about the model's inference process after GRPO training, specifically:
Thank you again for your dedication to the open source community. Your work is greatly appreciated!
Best regards
The text was updated successfully, but these errors were encountered: