Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Could you provide the model trained by GRPO? #64

Open
Ethereal-sakura opened this issue Jan 27, 2025 · 0 comments
Open

Could you provide the model trained by GRPO? #64

Ethereal-sakura opened this issue Jan 27, 2025 · 0 comments

Comments

@Ethereal-sakura
Copy link

Ethereal-sakura commented Jan 27, 2025

First of all, fantastic work! Thank you for your valuable contribution to the open source community.

I'm interested in the GRPO training process but currently lack sufficient GPU resources to conduct the training myself. In this regard, I have a few questions:

  1. Are there any GRPO-trained models available for public use,I didn't find it on huggingface。

  2. I'm particularly curious about the model's inference process after GRPO training, specifically:

    • Whether reflection capabilities can emerge in a 7B-sized model
    • The minimum VRAM requirements for evaluating models on a single GPU (I currently have access to 48GB GPU memory)

Thank you again for your dedication to the open source community. Your work is greatly appreciated!

Best regards

@Ethereal-sakura Ethereal-sakura changed the title Could you provide the model trained by GPRO? Could you provide the model trained by GRPO? Jan 27, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant