Generate: detect special architectures when loaded from PEFT #24198

gante · 2023-06-12T13:55:44Z

What does this PR do?

As identified in this comment, a PEFT-loaded BLOOM can't be used as an assistant with assisted generation.

BLOOM (and GPTBigCode) need special handling due to their different cache API, and the architecture detection code was incompatible with PEFT models. This PR adds the logic to detect these special architectures when loaded with PEFT.

amyeroberts

Thanks for fixing!

Just a q about robustness w/ listed architectures

amyeroberts · 2023-06-12T14:02:26Z

src/transformers/generation/utils.py

@@ -4512,7 +4517,8 @@ def _crop_past_key_values(model, past_key_values, maximum_length):
                )
            )
        past_key_values = tuple(new_past)
-    elif "bloom" in model.__class__.__name__.lower():  # bloom is special
+    # bloom is special
+    elif "bloom" in model.__class__.__name__.lower() or "bloom" in model.config.architectures[0].lower():


Are we guaranteed it's always the 0th indexed architecture?

I have never seen more than 1 item in architectures -- in fact, the only PT reference writing into .config.architrectures is this one

I am assuming it works in the vast majority of cases -- even if the user decides to append more info into this config member, it should work.

Not a super satisfactory answer, I know 😅

amyeroberts · 2023-06-12T14:02:37Z

src/transformers/generation/utils.py

@@ -4521,7 +4527,8 @@ def _crop_past_key_values(model, past_key_values, maximum_length):
                )
            )
        past_key_values = tuple(new_past)
-    elif "gptbigcode" in model.__class__.__name__.lower():  # gptbigcode is too
+    # gptbigcode is too
+    elif "gptbigcode" in model.__class__.__name__.lower() or "gptbigcode" in model.config.architectures[0].lower():


Same here re 0 indexing

HuggingFaceDocBuilderDev · 2023-06-12T14:19:09Z

The documentation is not available anymore as the PR was closed or merged.

HuggingFaceDocBuilderDev · 2023-06-12T15:07:03Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

…face#24198)

gante added 2 commits June 12, 2023 13:51

PEFT + Assisted Generation + Special models

f2b8d29

make fixup

ffe0412

gante requested a review from amyeroberts June 12, 2023 13:56

gante mentioned this pull request Jun 12, 2023

support for model.generate with assistant_model / model being load_in_8bit and PeftModel (LoRA) #23686

Closed

4 tasks

amyeroberts approved these changes Jun 12, 2023

View reviewed changes

gante added 2 commits June 12, 2023 14:23

handle empty architecture fields

f02f17c

correct .get()

5f3a577

gante merged commit 60b69f7 into huggingface:main Jun 12, 2023

gante deleted the peft_assisted_generation branch June 12, 2023 15:06

novice03 pushed a commit to novice03/transformers that referenced this pull request Jun 23, 2023

Generate: detect special architectures when loaded from PEFT (hugging…

d4a9e75

…face#24198)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generate: detect special architectures when loaded from PEFT #24198

Generate: detect special architectures when loaded from PEFT #24198

gante commented Jun 12, 2023 •

edited

Loading

amyeroberts left a comment

amyeroberts Jun 12, 2023

gante Jun 12, 2023 •

edited

Loading

amyeroberts Jun 12, 2023

HuggingFaceDocBuilderDev commented Jun 12, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Jun 12, 2023

Generate: detect special architectures when loaded from PEFT #24198

Generate: detect special architectures when loaded from PEFT #24198

Conversation

gante commented Jun 12, 2023 • edited Loading

What does this PR do?

amyeroberts left a comment

Choose a reason for hiding this comment

amyeroberts Jun 12, 2023

Choose a reason for hiding this comment

gante Jun 12, 2023 • edited Loading

Choose a reason for hiding this comment

amyeroberts Jun 12, 2023

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Jun 12, 2023 • edited Loading

HuggingFaceDocBuilderDev commented Jun 12, 2023

gante commented Jun 12, 2023 •

edited

Loading

gante Jun 12, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Jun 12, 2023 •

edited

Loading