Fix image segmentation tool bug #23897

amyeroberts · 2023-05-31T10:39:56Z

What does this PR do?

Currently tools using the ImageSegmentation tool fail because the parameters for the image processor are overridden with the input image dimensions. This results in incompatible input dimensions being passed to the model.

This PR removes this logic in the encode method and removes the resizing in the tests which only happened for the segmentation tool and hid the issue.

Fixes #23328

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

amyeroberts · 2023-05-31T10:54:48Z

@sgugger @LysandreJik The fix is pretty simple, but in the spirit of Chesterton's fence, I wasn't sure/couldn't remember the reason for the self.pre_processor.image_processor.size = ... logic so wanted to ask you both in case this is breaking assumptions elsewhere.

HuggingFaceDocBuilderDev · 2023-05-31T10:55:48Z

The documentation is not available anymore as the PR was closed or merged.

sgugger

LGTM but let's wait for @LysandreJik as he wrote this tool :-)

LysandreJik · 2023-05-31T19:21:01Z

This seems reasonable! I had run into issues where I couldn't segment images with different sizes when implementing it.

I have tried with different flavors of the following

and it seems to work well even without specifying the size.

Your change looks good to me @amyeroberts; are you aware of sizes that may not work with this model?

amyeroberts · 2023-06-01T13:47:33Z

@LysandreJik It should all be OK if the image processor matches the model: there shouldn't be any sizes that don't work because the image processor will resize as needed.

In terms of sizes that won't work:

We can't input images with either height or width smaller that the model's patch size - 16 by default
Images which (image_height // patch_size) * (image_width // patch_size) > max_sequence_length
Non-square images where (image_height // patch_size) != (image_width // patch_size). Just looking high-level at the model, I think it could be reworked to accept non-square images but haven't dug into it deeply.

LysandreJik

Thank you, @amyeroberts!

amyeroberts added 2 commits May 30, 2023 18:02

Image segmentation tool bug

64dd82d

Remove resizing in the tests

531ce95

amyeroberts requested review from LysandreJik and sgugger May 31, 2023 10:52

sgugger approved these changes May 31, 2023

View reviewed changes

LysandreJik approved these changes Jun 15, 2023

View reviewed changes

LysandreJik merged commit e6122c3 into huggingface:main Jun 15, 2023

amyeroberts deleted the fix-image-segmentation-tool-bug branch June 15, 2023 12:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix image segmentation tool bug #23897

Fix image segmentation tool bug #23897

amyeroberts commented May 31, 2023

amyeroberts commented May 31, 2023

HuggingFaceDocBuilderDev commented May 31, 2023 •

edited

Loading

sgugger left a comment

LysandreJik commented May 31, 2023

amyeroberts commented Jun 1, 2023

LysandreJik left a comment

Fix image segmentation tool bug #23897

Fix image segmentation tool bug #23897

Conversation

amyeroberts commented May 31, 2023

What does this PR do?

Before submitting

amyeroberts commented May 31, 2023

HuggingFaceDocBuilderDev commented May 31, 2023 • edited Loading

sgugger left a comment

Choose a reason for hiding this comment

LysandreJik commented May 31, 2023

amyeroberts commented Jun 1, 2023

LysandreJik left a comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented May 31, 2023 •

edited

Loading