Could you provide the model trained by GRPO? #64

Ethereal-sakura · 2025-01-27T01:54:47Z

First of all, fantastic work! Thank you for your valuable contribution to the open source community.

I'm interested in the GRPO training process but currently lack sufficient GPU resources to conduct the training myself. In this regard, I have a few questions:

Are there any GRPO-trained models available for public use，I didn't find it on huggingface。
I'm particularly curious about the model's inference process after GRPO training, specifically:
- Whether reflection capabilities can emerge in a 7B-sized model
- The minimum VRAM requirements for evaluating models on a single GPU (I currently have access to 48GB GPU memory)

Thank you again for your dedication to the open source community. Your work is greatly appreciated!

Best regards

Ethereal-sakura changed the title ~~Could you provide the model trained by GPRO?~~ Could you provide the model trained by GRPO? Jan 27, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Could you provide the model trained by GRPO? #64

Could you provide the model trained by GRPO? #64

Ethereal-sakura commented Jan 27, 2025 •

edited

Loading

Could you provide the model trained by GRPO? #64

Could you provide the model trained by GRPO? #64

Comments

Ethereal-sakura commented Jan 27, 2025 • edited Loading

Ethereal-sakura commented Jan 27, 2025 •

edited

Loading