fix cpu bnb path #34647

jiqing-feng · 2024-11-08T01:43:35Z

Hi @Rocketknight1 @ArthurZucker @SunMarc @gante

We have "cpu" in hf_device_map when using bnb model in CPU. The cpu device bnb model should be accepted by transformers because CPU backend has been enabled in BNB. Please take a review, thx!

Rocketknight1 · 2024-11-09T13:25:05Z

Looks good, but do you have an example of code that failed before, that is fixed by this PR?

HuggingFaceDocBuilderDev · 2024-11-09T13:52:10Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

jiqing-feng · 2024-11-11T01:30:39Z

Looks good, but do you have an example of code that failed before, that is fixed by this PR?

I run this case in a CPU-only device

from transformers import AutoModelForCausalLM, AutoTokenizer, BitsAndBytesConfig

model_id = "Felladrin/Llama-68M-Chat-v1"

text = ["I am happy because", "This is"]
tokenizer = AutoTokenizer.from_pretrained(model_id)
tokenizer.pad_token = tokenizer.eos_token
input_ids = tokenizer(text, return_tensors="pt", padding=True)

quantization_config = BitsAndBytesConfig(load_in_8bit=True)

model = AutoModelForCausalLM.from_pretrained(model_id, device_map="auto", quantization_config=quantization_config)
model.generation_config.cache_implementation = "static"
model.generate(**input_ids)

Error: IndexError: list index out of range

jiqing-feng · 2024-11-12T01:58:35Z

Hi @Rocketknight1 . The script can reproduce this error easily both on AWQ and BNB. Besides, it's not safe to get index without checking the list length.

jiqing-feng · 2024-11-13T01:05:15Z

Hi @Rocketknight1 @ArthurZucker @SunMarc @gante @zucchini-nlp , do you mind reviewing this change? Thanks

jiqing-feng · 2024-11-13T05:08:49Z

Hi @Titus-von-Koeller . This change is needed for bitsandbytes cpu path, can you help to review it?

Besides, it's also needed for AWQ cpu path which is already enabled here: #33460

jiqing-feng · 2024-11-14T01:10:51Z

The following script can reproduce the AWQ error:
I run this case on a CPU-only device.

from transformers import AutoModelForCausalLM, AutoTokenizer, BitsAndBytesConfig, AwqConfig

model_id = "PrunaAI/JackFram-llama-68m-AWQ-4bit-smashed"

text = ["I am happy because", "This is"]
tokenizer = AutoTokenizer.from_pretrained(model_id)
tokenizer.pad_token = tokenizer.eos_token
input_ids = tokenizer(text, return_tensors="pt", padding=True)

# quantization_config = BitsAndBytesConfig(load_in_8bit=True)
quantization_config = AwqConfig(version="ipex")

model = AutoModelForCausalLM.from_pretrained(model_id, device_map="auto", quantization_config=quantization_config)
model.generation_config.cache_implementation = "static"
model.generate(**input_ids)

jiqing-feng · 2024-11-15T07:57:03Z

Hi @aymeric-roucher @LysandreJik , do you have time to review this bug fix? Thanks!

zucchini-nlp · 2024-11-15T11:55:39Z

@jiqing-feng sorry for late reply, the transformers team was off last week and someone will review the PR soon. Seems like you tagged all relevant people already

SunMarc

Thanks for the fix ! I've suggested a similar fix that follows more closely what we have in accelerate. LMK if this works for you !

src/transformers/generation/utils.py

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

jiqing-feng · 2024-11-18T02:36:04Z

Hi @SunMarc , I have applied your changes, thanks!

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

SunMarc

Nice, thanks !

ArthurZucker

LGTM 🤗

* fix cpu bnb path * Update src/transformers/generation/utils.py Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> * fix awq quantizer env check * fix awq quantizer device check Signed-off-by: jiqing-feng <jiqing.feng@intel.com> --------- Signed-off-by: jiqing-feng <jiqing.feng@intel.com> Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

jiqing-feng added 2 commits November 8, 2024 01:39

fix cpu bnb path

6292b51

Merge branch 'main' into bnb

1d83e74

Merge branch 'main' into bnb

902762c

Merge branch 'main' into bnb

e5d6d13

Merge branch 'main' into bnb

692e5e8

SunMarc approved these changes Nov 15, 2024

View reviewed changes

src/transformers/generation/utils.py Outdated Show resolved Hide resolved

Update src/transformers/generation/utils.py

7180082

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

jiqing-feng added 3 commits November 18, 2024 10:04

fix awq quantizer env check

0f88247

fix awq quantizer device check

b06ddd3

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

Merge branch 'main' into bnb

e02fd16

SunMarc requested a review from LysandreJik November 18, 2024 14:45

SunMarc approved these changes Nov 18, 2024

View reviewed changes

Merge branch 'main' into bnb

6eddc60

ArthurZucker approved these changes Nov 19, 2024

View reviewed changes

ArthurZucker merged commit 5de58d5 into huggingface:main Nov 19, 2024
20 of 22 checks passed

jiqing-feng deleted the bnb branch December 19, 2024 02:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix cpu bnb path #34647

fix cpu bnb path #34647

jiqing-feng commented Nov 8, 2024 •

edited

Loading

Rocketknight1 commented Nov 9, 2024

HuggingFaceDocBuilderDev commented Nov 9, 2024

jiqing-feng commented Nov 11, 2024

jiqing-feng commented Nov 12, 2024

jiqing-feng commented Nov 13, 2024

jiqing-feng commented Nov 13, 2024

jiqing-feng commented Nov 14, 2024 •

edited

Loading

jiqing-feng commented Nov 15, 2024

zucchini-nlp commented Nov 15, 2024

SunMarc left a comment

jiqing-feng commented Nov 18, 2024

SunMarc left a comment

ArthurZucker left a comment

fix cpu bnb path #34647

fix cpu bnb path #34647

Conversation

jiqing-feng commented Nov 8, 2024 • edited Loading

Rocketknight1 commented Nov 9, 2024

HuggingFaceDocBuilderDev commented Nov 9, 2024

jiqing-feng commented Nov 11, 2024

jiqing-feng commented Nov 12, 2024

jiqing-feng commented Nov 13, 2024

jiqing-feng commented Nov 13, 2024

jiqing-feng commented Nov 14, 2024 • edited Loading

jiqing-feng commented Nov 15, 2024

zucchini-nlp commented Nov 15, 2024

SunMarc left a comment

Choose a reason for hiding this comment

jiqing-feng commented Nov 18, 2024

SunMarc left a comment

Choose a reason for hiding this comment

ArthurZucker left a comment

Choose a reason for hiding this comment

jiqing-feng commented Nov 8, 2024 •

edited

Loading

jiqing-feng commented Nov 14, 2024 •

edited

Loading