Wrong invocation of PeftModelForCausalLM.generate in DPOTrainer? #877

jploski · 2023-10-16T00:03:32Z

Reporting against trl 0.7.3.dev0 and peft 0.5.0:

Apparently PeftModelForCausalLM.generate does not like positional passing of inputs, causing the following backtrace:

    dpo_trainer.train()
  File "/mnt/seagate/miniconda3/lib/python3.10/site-packages/transformers/trainer.py", line 1591, in train
    return inner_training_loop(
  File "/mnt/seagate/miniconda3/lib/python3.10/site-packages/transformers/trainer.py", line 1984, in _inner_training_loop
    self._maybe_log_save_evaluate(tr_loss, model, trial, epoch, ignore_keys_for_eval)
  File "/mnt/seagate/miniconda3/lib/python3.10/site-packages/transformers/trainer.py", line 2328, in _maybe_log_save_evaluate
    metrics = self.evaluate(ignore_keys=ignore_keys_for_eval)
  File "/mnt/seagate/miniconda3/lib/python3.10/site-packages/transformers/trainer.py", line 3066, in evaluate
    output = eval_loop(
  File "/mnt/seagate/miniconda3/lib/python3.10/site-packages/trl/trainer/dpo_trainer.py", line 617, in evaluation_loop
    policy_output_decoded, ref_output_decoded = self.get_batch_samples(self.model, random_batch)
  File "/mnt/seagate/miniconda3/lib/python3.10/site-packages/trl/trainer/dpo_trainer.py", line 514, in get_batch_samples
    policy_output = model.generate(
TypeError: PeftModelForCausalLM.generate() takes 1 positional argument but 2 were given

If I change the generate calls in dpo_trainer.py to instead pass a named parameter (i.e. inputs=batch["prompt_input_ids"]), the crash does not happen.

The text was updated successfully, but these errors were encountered:

NitCoh · 2023-10-18T07:30:51Z

Yep, I'm also having the same issues.
I believe it happens only with PeftModelForCausalLM

ZixuanLiu4869 · 2023-10-20T17:14:44Z

I previously used the generate and have the problem, it should explicitly pass the input_ids like this model.generate(input_ids=[your input_ids], ....) and it will solve the problem.

younesbelkada · 2023-11-01T21:30:18Z

Hi there! Makes sense, just made #941 to solve the issue

younesbelkada mentioned this issue Nov 1, 2023

Fix DPOTrainer + PEFT #941

Merged

younesbelkada closed this as completed in #941 Nov 2, 2023

rdk31 mentioned this issue Dec 1, 2023

Fix DPOTrainer + PEFT 2 #1049

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Wrong invocation of PeftModelForCausalLM.generate in DPOTrainer? #877

Wrong invocation of PeftModelForCausalLM.generate in DPOTrainer? #877

jploski commented Oct 16, 2023

NitCoh commented Oct 18, 2023

ZixuanLiu4869 commented Oct 20, 2023

younesbelkada commented Nov 1, 2023

Wrong invocation of PeftModelForCausalLM.generate in DPOTrainer? #877

Wrong invocation of PeftModelForCausalLM.generate in DPOTrainer? #877

Comments

jploski commented Oct 16, 2023

NitCoh commented Oct 18, 2023

ZixuanLiu4869 commented Oct 20, 2023

younesbelkada commented Nov 1, 2023