BaseImageProcessor #26

amyeroberts · 2022-08-08T11:09:24Z

What does this PR do?

Introduces the BaseImageProcessor class for all other model image processors to inherit from.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

HuggingFaceDocBuilderDev · 2022-08-08T11:21:04Z

The documentation is not available anymore as the PR was closed or merged.

Co-authored-by: Sylvain Gugger <Sylvain.gugger@gmail.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

alaradirik

Nice work, excited to see this moving forward :)

alaradirik · 2022-08-11T16:26:32Z

src/transformers/image_processing_utils.py

+        data (`dict`):
+            Dictionary of lists/arrays/tensors returned by the __call__/pad methods ('pixel_values', 'attention_mask',
+            etc.).
+        tensor_type (`Union[None, str, TensorType]`, *optional*):


A quick question - are we going to default to "np" ? If so, maybe we can remove None from the accepted argument types or make it a non-optional argument

It depends partly on whether we merge in: huggingface#18499

Defaulting to "np" isn't necessary to be able to use different combinations of e.g. do_resize and do_normalize. As we're aliasing the previous feature extractors with the new image processors, change the default would still be a breaking change.

If we decided to default to "np", then we'd have to include additional checks on the processed images. At the moment, because resize resizes the images to multiples of size_divisor, they are not guaranteed to all be the same size. This means calls the BatchFeature will fail if any of "np", "tf", "pt"or"jax"` are passed in as the images can't be batched together.

My preference would be to keep return_tensors=None as this more closely matches the behaviour of our tokenizers. However, our tokenizer provides arguments such that batches can be created e.g. padding=True. Not sure if an equivalent makes sense here.

What do you think? If we want to set "np" as default we should discuss how to handle introducing the image processors versus introducing that change.
cc @NielsRogge

Co-authored-by: Alara Dirik <8944735+alaradirik@users.noreply.github.com>

Image processor glpn

amyeroberts added 30 commits July 27, 2022 16:12

Base processor skeleton

54aed8b

BatchFeature for packaging image processor outputs

ba55c89

Initial image processor for GLPN

4b430d4

REmove accidental import

b1c8b59

Import BatchFeature from feature_extraction_utils

b9ce4a0

Fixup and docs

6b678fb

Fixup and docs

db93437

Fixup and docs

bd890d5

Fixup and docs

4b27a34

BatchFeature for packaging image processor outputs

ff0d49e

Import BatchFeature from feature_extraction_utils

2c2fa9a

Merge branch 'image-processor-mixin' into base-image-processor-class

b9f7837

Resolve conflicts

346270d

Import BatchFeature from feature_extraction_utils

7faf2e6

Fixup and docs

ccc15fb

Fixup and docs

c8f8eb6

BatchFeature for packaging image processor outputs

90093f4

Import BatchFeature from feature_extraction_utils

d89c051

Fixup and docs

9bc9157

Mixin for saving the image processor

6ec382a

Fixup and docs

56ee6ad

Merge branch 'image-batch-feature' into image-processor-glpn

38ebb50

Add rescale back and remove ImageType

6b88d5f

fix import mistake

67077f1

Merge branch 'image-processor-mixin' into base-image-processor-class

fb6438c

Merge branch 'base-image-processor-class' into image-batch-feature

ffe71b6

Merge branch 'image-batch-feature' into image-processor-glpn

cc480e8

Merge branch 'image-processor-mixin' into base-image-processor-class

4264d1a

Merge in branch and remove conflicts

fb5dcd6

Add in rescaling

43f561d

amyeroberts added 17 commits August 2, 2022 19:17

Merge branch 'base-image-processor-class' into image-batch-feature

8f63b76

Merge branch 'image-batch-feature' into image-processor-glpn

46a9c74

Remove default to numpy batching

082e4ff

Fix up

bf73358

Add docstring and model_input_types

34b6b2f

Merge branch 'image-processor-mixin' into base-image-processor-class

8678c13

Merge branch 'base-image-processor-class' into image-batch-feature

937884c

Merge branch 'image-batch-feature' into image-processor-glpn

a1b681a

Resolve merge conflicts

952c2a0

Resolve merge conflicts

2f0fa0b

Resolve merge conflicts

e6233cc

Merge branch 'image-processor-mixin' into base-image-processor-class

bd0afd6

Merge branch 'base-image-processor-class' into image-batch-feature

a6f69bc

Merge and resolve conflicts

a7af81f

Merge branch 'image-processor-mixin' into base-image-processor-class

b66d0f6

Merge branch 'base-image-processor-class' into image-batch-feature

8b73f89

Merge branch 'image-batch-feature' into image-processor-glpn

ae6030c

amyeroberts changed the base branch from base-image-processor-class to image-processor-mixin August 8, 2022 11:23

amyeroberts and others added 2 commits August 8, 2022 12:31

Fix up

7a4d22a

Apply suggestions from code review

790c2c6

Co-authored-by: Sylvain Gugger <Sylvain.gugger@gmail.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

amyeroberts requested review from alaradirik and NielsRogge August 11, 2022 13:00

alaradirik approved these changes Aug 11, 2022

View reviewed changes

Update src/transformers/image_transforms.py

2e929cf

Co-authored-by: Alara Dirik <8944735+alaradirik@users.noreply.github.com>

Base automatically changed from image-processor-mixin to image-transforms-library August 16, 2022 15:40

amyeroberts added 2 commits August 17, 2022 13:08

Add in docstrings

ae35873

Merge pull request #23 from amyeroberts/image-processor-glpn

4fff267

Image processor glpn

amyeroberts merged commit 62c6e55 into image-transforms-library Aug 17, 2022

amyeroberts deleted the image-batch-feature branch August 17, 2022 12:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BaseImageProcessor #26

BaseImageProcessor #26

amyeroberts commented Aug 8, 2022

HuggingFaceDocBuilderDev commented Aug 8, 2022 •

edited

Loading

alaradirik left a comment

alaradirik Aug 11, 2022

amyeroberts Aug 12, 2022

BaseImageProcessor #26

BaseImageProcessor #26

Conversation

amyeroberts commented Aug 8, 2022

What does this PR do?

Before submitting

HuggingFaceDocBuilderDev commented Aug 8, 2022 • edited Loading

alaradirik left a comment

Choose a reason for hiding this comment

alaradirik Aug 11, 2022

Choose a reason for hiding this comment

amyeroberts Aug 12, 2022

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Aug 8, 2022 •

edited

Loading