Allow passing inputs_embeds instead of input_ids #757

BenjaminBossan · 2023-07-27T12:57:36Z

Resolves #727

Description

Right now, there is an issue with a few PeftModelForXxx classes when users pass only inputs_embeds but not input_ids. First of all, the batch size was being derived on input_ids, now it is derived from inputs_embeds instead if input_ids is None. Furthermore, in PeftModelForCausalLM, the forward calls to the base model was not passing the inputs_embeds along, which resulted in errors down the line. These issues have been fixed now.

Open issues

During testing, I ran into a some problems.

For decoder models, the test would fail for GPTBigCodeForCausalLM. I'm not sure why, but that model is already excluded in other tests, so I just excluded it for this test too.
For feature extraction tests, the test would fail for prefix tuning. Not sure exactly why only here. I thought I only needed pass the inputs_embeds to the self.base_model call for prefix tuning, but then I got the error below, which is why I just excluded those tests for the time being.

TypeError: DebertaV2Model.forward() got an unexpected keyword argument
'past_key_values'

Similarly, I wanted to add the tests to encoder-decoder models, but there I got the error:

ValueError: If no decoder_input_ids or decoder_inputs_embeds are
passed, input_ids cannot be None. Please pass either input_ids or decoder_input_ids or decoder_inputs_embeds.

So now, the tests are not run for encoder-decoder.

It would be great if someone else with more expertise could ensure that those cases work correctly.

Resolves huggingface#727 Right now, there is an issue with a few PeftModelForXxx classes when users pass only inputs_embeds but not input_ids. First of all, the batch size was being derived on input_ids, now it is derived from inputs_embeds instead if input_ids is None. Furthermore, a few forward calls to the base model were not passing the inputs_embeds along, which resulted in errors down the line. These issues have been fixed now. During testing, I ran into a some problems. 1. For decoder models, the test would fail for GPTBigCodeForCausalLM. I'm not sure why, but that model is already excluded in another test, so I just excluded it for this test too. 2. For feature extraction tests, the test would fail for prefix tuning. Not sure exactly why. I thought I only needed pass the inputs_embeds to the self.base_model call for prefix tuning, but then I got the error > TypeError: DebertaV2Model.forward() got an unexpected keyword argument 'past_key_values' Therefore, I just excluded those tests for the time being. 3. Similarly, I wanted to add the tests to encoder-decoder models, but there I got the error: > ValueError: If no `decoder_input_ids` or `decoder_inputs_embeds` are passed, `input_ids` cannot be `None`. Please pass either `input_ids` or `decoder_input_ids` or `decoder_inputs_embeds`. So now, the tests are not run for encoder-decoder. It would be great if someone else with more expertise could ensure that those cases work correctly.

HuggingFaceDocBuilderDev · 2023-07-27T13:02:03Z

The documentation is not available anymore as the PR was closed or merged.

younesbelkada

Looking great thanks a lot!
Regarding gptbigcode + prompt learning - I think because of the current attention mechanism that is used by that model (MQA) it is not possible to support prompt learning methods as it involves concatenating the past key values in a specific way that is currently not supported in PEFT. Not sure if the fix should go in PEFT or in transformers though, but for now I would advocate to skip it unless there are a lot of request for that model + prompt learning

BenjaminBossan · 2023-07-27T13:55:05Z

the current attention mechanism that is used by that model (MQA) it is not possible to support prompt learning methods

I see, thanks for explaining. This also means that I can re-use the skip_non_pt_mqa function instead of implementing a new one, I made that change.

pacman100

Thank you @BenjaminBossan for fixing the issues, LGTM 🚀. Left a comment.

pacman100 · 2023-08-02T10:10:08Z

src/peft/peft_model.py

@@ -75,6 +75,22 @@
 }


+def _get_batch_size(input_ids: Optional[torch.Tensor], inputs_embeds: Optional[torch.Tensor]) -> int:


can we move it to utils?

Done.

I also found an error in the new test where I forgot to send the model to the torch device, which is also fixed now.

Forgot to send the model to the right torch device.

pacman100

Thank you @BenjaminBossan! 🚀

BenjaminBossan requested review from pacman100 and younesbelkada July 27, 2023 13:25

younesbelkada approved these changes Jul 27, 2023

View reviewed changes

Remove redundant skipping function in test

0749555

BenjaminBossan added 2 commits July 27, 2023 15:58

Make style

b0fcb6e

Merge branch 'main' into fix-727-passing-input-embeds

ff17aa6

pacman100 reviewed Aug 2, 2023

View reviewed changes

BenjaminBossan added 3 commits August 2, 2023 12:30

Merge branch 'main' into fix-727-passing-input-embeds

d45b2c2

Fix device issue in new test

c41a0b6

Forgot to send the model to the right torch device.

Move _get_batch_size to utils/other.py

2ba3d77

pacman100 approved these changes Aug 2, 2023

View reviewed changes

BenjaminBossan merged commit ec267c6 into huggingface:main Aug 2, 2023

BenjaminBossan deleted the fix-727-passing-input-embeds branch August 2, 2023 14:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow passing inputs_embeds instead of input_ids #757

Allow passing inputs_embeds instead of input_ids #757

BenjaminBossan commented Jul 27, 2023

HuggingFaceDocBuilderDev commented Jul 27, 2023 •

edited

Loading

younesbelkada left a comment

BenjaminBossan commented Jul 27, 2023

pacman100 left a comment

pacman100 Aug 2, 2023

BenjaminBossan Aug 2, 2023

pacman100 left a comment

		@@ -75,6 +75,22 @@
		}


		def _get_batch_size(input_ids: Optional[torch.Tensor], inputs_embeds: Optional[torch.Tensor]) -> int:

Allow passing inputs_embeds instead of input_ids #757

Allow passing inputs_embeds instead of input_ids #757

Conversation

BenjaminBossan commented Jul 27, 2023

Description

Open issues

HuggingFaceDocBuilderDev commented Jul 27, 2023 • edited Loading

younesbelkada left a comment

Choose a reason for hiding this comment

BenjaminBossan commented Jul 27, 2023

pacman100 left a comment

Choose a reason for hiding this comment

pacman100 Aug 2, 2023

Choose a reason for hiding this comment

BenjaminBossan Aug 2, 2023

Choose a reason for hiding this comment

pacman100 left a comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Jul 27, 2023 •

edited

Loading