Skip to content

Latest commit

 

History

History
13 lines (11 loc) · 303 Bytes

File metadata and controls

13 lines (11 loc) · 303 Bytes

DPO Training

Follow set up in setup environment Data setup:

bash setup/setup_train_data.sh

Data includes 300k frames used for SFT and DPO, plus 17k preference data.

DPO script

bash dpo_scripts/train_dpo.sh