Skip to content
@OpenRLHF

OpenRLHF

Open-sourced Reinforcment Learning from Human Feedback

Pinned Loading

  1. OpenRLHF Public

    An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agent RL)

    Python 7.2k 694

  2. OpenRLHF-M Public

    An Easy-to-use, Scalable and High-performance RLHF Framework designed for Multimodal Models.

    Python 130 6

  3. OpenRLHF-Docs Public

    3 4

Repositories

Showing 3 of 3 repositories