generated from fastai/nbdev_template
-
Notifications
You must be signed in to change notification settings - Fork 1.4k
Pull requests: huggingface/trl
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
PPOTrainer: fix progress bar for num_mini_batches > 1
#2531
opened Dec 29, 2024 by
dawidm
Loading…
4 tasks done
Include stop token in policy model's generation_config
#2528
opened Dec 28, 2024 by
dawidm
Loading…
2 of 5 tasks
RLOO trainer: fix calculations of steps, episodes and epochs
#2516
opened Dec 23, 2024 by
dawidm
Loading…
dpo_trainer gather metrics across ranks before logging
#2474
opened Dec 13, 2024 by
zhc7
Loading…
2 of 5 tasks
🧪 [Experimental] Train LeRobot policy with TRL
#2359
opened Nov 15, 2024 by
qgallouedec
•
Draft
5 tasks
👩🏫 Add SFT notebook for chatbot development
#2321
opened Nov 4, 2024 by
qgallouedec
•
Draft
5 tasks
Previous Next
ProTip!
no:milestone will show everything without a milestone.