Pipeline image-to-text task and Bitsandbytes error #24834

mediocreatmybest · 2023-07-15T02:30:43Z

System Info

Python 3.10.6
Transformers 4.30.0
Bitsandbytes 0.39.1

Windows / Linux

Who can help?

@NAR

Information

The official example scripts
My own modified scripts

Tasks

An officially supported task in the examples folder (such as GLUE/SQuAD, ...)
My own task or dataset (give details below)

Reproduction

Using an 4 or 8bit quantised model such as:

https://huggingface.co/Mediocreatmybest/blip2-opt-2.7b_8bit

Expected behavior

The Pipeline image processor to detect the model is running with a 4 or 8bit model with bitsandbytes.

I apologise if this should be a feature request or if it’s a bug, I couldn’t find any examples of what I was trying to do.

When running through the pipeline examples from the hugging face website, if I try using an 8bit model, the model seems to be detected correctly and casts it to 8bit, but the Processor doesn’t seem to follow suit and runs at its default, throwing an error that they both should be set at the same floating point.

I’ve uploaded a few models set at 8bit to save on size and memory, as BLIP2 is pretty heavy, using it on consumer devices is oviously challenging.

The models I’ve uploaded to HuggingFace are:

Mediocreatmybest/blip2-opt-2.7b_8bit
Mediocreatmybest/blip2-opt-6.7b_8bit
Mediocreatmybest/blip2-flan-t5-xxl_8bit

I can get them working with regular methods, but as I’m a beginner it’s obviously challenging. Thanks again for all the great work!

The text was updated successfully, but these errors were encountered:

mediocreatmybest · 2023-07-16T07:14:39Z

Based on this document, it should be possible, but maybe this is just an issue with multimodal or image processors with pipeline?

https://huggingface.co/docs/transformers/main/pipeline_tutorial

_# pip install accelerate bitsandbytes
import torch
from transformers import pipeline

pipe = pipeline(model="facebook/opt-1.3b", device_map="auto", model_kwargs={"load_in_8bit": True})
output = pipe("This is a cool example!", do_sample=True, top_p=0.95)_

mediocreatmybest · 2023-07-17T03:08:26Z

Also I did create a huggingface.co spaces using pipeline with the ability to try load in 8bit (obviously errors)

https://huggingface.co/spaces/Mediocreatmybest/PipelineImageCaption

Thanks.

mediocreatmybest · 2023-07-18T02:06:01Z

Adding the stack trace from google colab.

RuntimeError Traceback (most recent call last)
in <cell line: 19>()
17 captioner
18 # caption
---> 19 caption = captioner(image)[0]['generated_text']
20 print(caption)

16 frames
/usr/local/lib/python3.10/dist-packages/torch/nn/modules/conv.py in _conv_forward(self, input, weight, bias)
457 weight, bias, self.stride,
458 _pair(0), self.dilation, self.groups)
--> 459 return F.conv2d(input, weight, bias, self.stride,
460 self.padding, self.dilation, self.groups)
461

RuntimeError: Input type (float) and bias type (c10::Half) should be the same

sgugger · 2023-07-18T11:46:37Z

cc @younesbelkada

younesbelkada · 2023-07-18T14:39:26Z

Hi @mediocreatmybest
Thanks for the issue, it seems the input image needs to be converted into half-precision (torch.float16), can you share a small handy reproducible snippet that leads to your bug?

mediocreatmybest · 2023-07-18T23:39:54Z

Thanks for the fast response!

The snippet I was using to test on google colab and on my personal device was:

from transformers import pipeline
import torch


image = "https://huggingface.co/datasets/Narsil/image_dummy/raw/main/parrots.png"
model = "Salesforce/blip-image-captioning-base"

model_kwargs = {"load_in_8bit": True, "torch_dtype": torch.float16}
captioner = pipeline(task="image-to-text",
model=model,
max_new_tokens=30,
model_kwargs=model_kwargs, use_fast=True
)
# load model
captioner
# caption
caption = captioner(image)[0]['generated_text']
print(caption)

(Copy and pasted from my mobile device, hopefully this formatted correctly)

Thanks 🙏

JimAllanson · 2023-07-20T11:27:05Z

I encountered similar errors while using Blip/Blip2/Git models in an image_to_text pipeline. In my case, I was working with float16 instead of 8bit precision, as under my setup I was encountering additional issues with 8bit. I think there's a very good chance that the fix I've made in #24947 might also fix your issue (for the three models I've implemented the fix for). If you're able to give it a try I'd be interested in hearing if it fixes your issue too.

mediocreatmybest · 2023-07-21T09:56:54Z

I encountered similar errors while using Blip/Blip2/Git models in an image_to_text pipeline. In my case, I was working with float16 instead of 8bit precision, as under my setup I was encountering additional issues with 8bit. I think there's a very good chance that the fix I've made in #24947 might also fix your issue (for the three models I've implemented the fix for). If you're able to give it a try I'd be interested in hearing if it fixes your issue too.

Thanks @JimAllanson, happy to try test, but I'm pretty new to Python, what is the best way to test this for you? editing the site-packages with the change?

mediocreatmybest changed the title ~~Pipeline + Bitsandbytes~~ Pipeline task image-to-text task and Bitsandbytes error Jul 18, 2023

mediocreatmybest changed the title ~~Pipeline task image-to-text task and Bitsandbytes error~~ Pipeline image-to-text task and Bitsandbytes error Jul 18, 2023

JimAllanson mentioned this issue Jul 20, 2023

fix: cast input pixels to appropriate dtype for image_to_text pipelines #24947

Merged

sgugger closed this as completed in #24947 Jul 21, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pipeline image-to-text task and Bitsandbytes error #24834

Pipeline image-to-text task and Bitsandbytes error #24834

mediocreatmybest commented Jul 15, 2023

mediocreatmybest commented Jul 16, 2023

mediocreatmybest commented Jul 17, 2023

mediocreatmybest commented Jul 18, 2023

sgugger commented Jul 18, 2023

younesbelkada commented Jul 18, 2023 •

edited

Loading

mediocreatmybest commented Jul 18, 2023 •

edited

Loading

JimAllanson commented Jul 20, 2023

mediocreatmybest commented Jul 21, 2023

Pipeline image-to-text task and Bitsandbytes error #24834

Pipeline image-to-text task and Bitsandbytes error #24834

Comments

mediocreatmybest commented Jul 15, 2023

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

mediocreatmybest commented Jul 16, 2023

mediocreatmybest commented Jul 17, 2023

mediocreatmybest commented Jul 18, 2023

sgugger commented Jul 18, 2023

younesbelkada commented Jul 18, 2023 • edited Loading

mediocreatmybest commented Jul 18, 2023 • edited Loading

JimAllanson commented Jul 20, 2023

mediocreatmybest commented Jul 21, 2023

younesbelkada commented Jul 18, 2023 •

edited

Loading

mediocreatmybest commented Jul 18, 2023 •

edited

Loading