Issues · huggingface/trl

[Tracking issue] General dataset support

#2071 opened Sep 15, 2024 by qgallouedec

Open

[Tracking issue] Integrate native liger-kernel losses

#2495 opened Dec 17, 2024 by qgallouedec

Open 4

[Tracking issue] Wrong loss scaling when accumulating gradient

#2617 opened Jan 23, 2025 by qgallouedec

Open

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

171 Open 1,231 Closed

🐛 bug

#2719 opened Jan 31, 2025 by JohnConnor123

5 tasks done

🏋 GRPO 🏋 Reward

#2715 opened Jan 31, 2025 by korbinian-hoermann

🏋 GRPO 🏋 Reward

#2712 opened Jan 31, 2025 by accupham

3 tasks

🏋 DPO ✨ enhancement

#2710 opened Jan 31, 2025 by lucasjinreal

🐛 bug 🏋 GRPO ⚡ PEFT

#2709 opened Jan 31, 2025 by willccbb

⏳ needs more info ⚡ PEFT 🏋 PPO

#2707 opened Jan 30, 2025 by kooryan

✨ enhancement 🏋 GRPO

#2706 opened Jan 30, 2025 by nch0w

🏋 GRPO ❓ question

#2703 opened Jan 30, 2025 by arnavgarg1

5 tasks done

✨ enhancement 🏋 GRPO

#2702 opened Jan 30, 2025 by Superskyyy

✨ enhancement 🏋 GRPO

#2701 opened Jan 30, 2025 by Superskyyy

🏋 GRPO ⚡ PEFT

#2698 opened Jan 30, 2025 by gagan3012

5 tasks done

⚡accelerate ⚡ PEFT 🏋 PPO

#2696 opened Jan 30, 2025 by daehuikim

5 tasks done

🐛 bug 🚀 deepspeed 🏋 GRPO

#2688 opened Jan 30, 2025 by abacaj

5 tasks done

🐛 bug 🏋 GRPO

#2686 opened Jan 29, 2025 by shirinyamani

🏋 GRPO ❓ question 🏋 Reward

#2685 opened Jan 29, 2025 by shirinyamani

✨ enhancement 🏋 GRPO ⚡ PEFT

#2684 opened Jan 29, 2025 by howardzhou

🏋 GRPO ❓ question

#2681 opened Jan 29, 2025 by macheng6

✨ enhancement 🏋 GRPO

#2680 opened Jan 29, 2025 by Palmik

🐛 bug ⏳ needs more info 🏋 Reward

#2674 opened Jan 28, 2025 by Tarak200

🐛 bug 🏋 GRPO 🏋 Online DPO

#2671 opened Jan 28, 2025 by benjamin-marie

5 tasks done

ProTip! Find all open issues with in progress development work with linked:pr.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issues: huggingface/trl

Issues list