You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
regarding point 2 @akk-123, the reference model is used in evaluation mode and the main model can certainly be prepared for training via Peft and that should work as per the other trainers using peft
@kashif thanks. I use lora to train main model, and set save_steps to save weight, I found that there saved too many things
how can I only save adapter_model.bin and adapter_config.json?
self.log_metrics
to log metric now, please support more friendly wandb log like PPOTrainerThe text was updated successfully, but these errors were encountered: