add uniform processors for altclip + chinese_clip #31198

molbap · 2024-06-03T07:55:43Z

What does this PR do?

Adds two models for #30511 , see parent PR for more details

molbap · 2024-06-03T07:58:11Z

Tests will succeed when #31197 is merged. See #30511 tests to confirm.

HuggingFaceDocBuilderDev · 2024-06-03T13:43:16Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

amyeroberts

Thanks for working on this!

Overall looks good to me. Main thought is that we probably want to move a lot of the kwarg preparation logic outside of the main __call__ method, and it can probably be abstracted into something more general for most processors.

Regarding tests, it would be good to check defaults and updates are correctly passed for all of the possible kwargs, particularly defaulting to the tokenizer.init_kwargs and ensuring these can be overridden.

Would be good to get a second opinion from @qubvel here!

tests/models/altclip/test_processor_altclip.py

molbap · 2024-06-04T11:12:15Z

Thanks! I'm currently moving the kwargs merging in a common method actually, and explaining order of operations wrt to kwargs priorities. Will be in #31197 in a minute

qubvel

Great work 🤗 Very structured and clean code!
I have the same concern as @amyeroberts that the logic of merging is worth split/move to separate methods. It is a bit hard to follow merging steps of different kwargs.

What do you think about grouping merging strategy per modality instead of per step?
not sure if its possible, because some kwargs, probably, have to be popped in advance.

Something like this:

images_kwargs = {} if images_kwargs is None else images_kwargs
default_images_kwargs = ChineseClipProcessorKwargs._defaults.get("images_kwargs", {}).copy()

# merging vision kwargs
images_kwargs = {**default_images_kwargs, **images_kwargs}   # with default
images_kwargs = {**images_kwargs, **kwargs.pop("images_kwargs", {})}  # with passed dict
images_kwargs = merge_with_kwargs_by_key(images_kwargs, ...)
images_kwargs = merge_with_common(images_kwargs, common_kwargs)

src/transformers/processing_utils.py

amyeroberts

Nice! Thanks for all the work on this 🤗

src/transformers/models/altclip/processing_altclip.py

molbap · 2024-08-09T14:03:14Z

Re-pinging @amyeroberts on that one and @qubvel and @zucchini-nlp optionally if you want to take another look - not much changed, just was between sprints and didn't merge that one 😬 should be good to go now (I know you approved, Amy, but that was a while ago!)

molbap · 2024-08-14T18:30:03Z

Tests that were working are broken since passing from crop_size to size in common tests I believe - will look into this soon

tests/models/altclip/test_processor_altclip.py

amyeroberts

Thanks for adding!

* add initial design for uniform processors + align model * add uniform processors for altclip + chinese_clip * fix mutable default 👀 * add configuration test * handle structured kwargs w defaults + add test * protect torch-specific test * fix style * fix * rebase * update processor to generic kwargs + test * fix style * add sensible kwargs merge * update test * fix assertEqual * move kwargs merging to processing common * rework kwargs for type hinting * just get Unpack from extensions * run-slow[align] * handle kwargs passed as nested dict * add from_pretrained test for nested kwargs handling * [run-slow]align * update documentation + imports * update audio inputs * protect audio types, silly * try removing imports * make things simpler * simplerer * move out kwargs test to common mixin * [run-slow]align * skip tests for old processors * [run-slow]align, clip * !$#@!! protect imports, darn it * [run-slow]align, clip * [run-slow]align, clip * update common processor testing * add altclip * add chinese_clip * add pad_size * [run-slow]align, clip, chinese_clip, altclip * remove duplicated tests * fix * update doc * improve documentation for default values * add model_max_length testing This parameter depends on tokenizers received. * Raise if kwargs are specified in two places * fix * match defaults * force padding * fix tokenizer test * clean defaults * move tests to common * remove try/catch block * deprecate kwarg * format * add copyright + remove unused method * [run-slow]altclip, chinese_clip * clean imports * fix version * clean up deprecation * fix style * add corner case test on kwarg overlap * resume processing - add Unpack as importable * add tmpdirname * fix altclip * fix up * add back crop_size to specific tests * generalize tests to possible video_processor * add back crop_size arg * fixup overlapping kwargs test for qformer_tokenizer * remove copied from * fixup chinese_clip tests values * fixup tests - qformer tokenizers * [run-slow] altclip, chinese_clip * remove prepare_image_inputs

molbap added 2 commits June 3, 2024 09:38

add initial design for uniform processors + align model

b85036f

add uniform processors for altclip + chinese_clip

1336931

molbap mentioned this pull request Jun 3, 2024

Image + text + audio uniform processors #30511

Open

12 tasks

molbap marked this pull request as draft June 3, 2024 08:08

molbap added 11 commits June 3, 2024 10:58

fix mutable default 👀

bb8ac70

add configuration test

cd8c601

handle structured kwargs w defaults + add test

f00c852

protect torch-specific test

693036f

fix style

766da3a

fix

844394d

rebase

7d860a0

update processor to generic kwargs + test

7cb9925

fix style

ad4cbf7

add sensible kwargs merge

def56cd

update test

2e6b7e1

molbap marked this pull request as ready for review June 3, 2024 13:21

amyeroberts reviewed Jun 4, 2024

View reviewed changes

tests/models/altclip/test_processor_altclip.py Show resolved Hide resolved

molbap added 2 commits June 4, 2024 13:26

fix assertEqual

c19bbc6

move kwargs merging to processing common

3c38119

qubvel reviewed Jun 4, 2024

View reviewed changes

src/transformers/processing_utils.py Show resolved Hide resolved

molbap added 7 commits June 5, 2024 18:12

rework kwargs for type hinting

81ae819

just get Unpack from extensions

ce4abcd

run-slow[align]

3acdf28

handle kwargs passed as nested dict

404239f

add from_pretrained test for nested kwargs handling

603be40

[run-slow]align

71c9d6c

update documentation + imports

26383c5

molbap mentioned this pull request Jul 15, 2024

Adding mplugdocowl #31792

Open

5 tasks

amyeroberts approved these changes Jul 17, 2024

View reviewed changes

src/transformers/models/altclip/processing_altclip.py Outdated Show resolved Hide resolved

molbap added 2 commits August 9, 2024 15:53

resume processing - add Unpack as importable

9978621

Merge branch 'main' into uniform_processors_2

182a9ec

molbap requested a review from amyeroberts August 13, 2024 13:29

molbap added 4 commits August 14, 2024 20:04

Merge branch 'main' into uniform_processors_2

514aae9

add tmpdirname

f5e2326

fix altclip

357e8ff

fix up

402445b

molbap added 10 commits September 18, 2024 17:12

Merge branch 'main' into uniform_processors_2

8212d8e

add back crop_size to specific tests

eb6a933

Merge branch 'main' into uniform_processors_2

468541c

generalize tests to possible video_processor

1e950e3

add back crop_size arg

ae2e605

fixup overlapping kwargs test for qformer_tokenizer

4b3c4f3

remove copied from

5f50aeb

fixup chinese_clip tests values

de5980a

fixup tests - qformer tokenizers

61e2664

[run-slow] altclip, chinese_clip

58211f5

molbap mentioned this pull request Sep 19, 2024

Uniformize kwargs for Paligemma processor and update docs #33571

Merged

5 tasks

amyeroberts reviewed Sep 19, 2024

View reviewed changes

tests/models/altclip/test_processor_altclip.py Outdated Show resolved Hide resolved

amyeroberts approved these changes Sep 19, 2024

View reviewed changes

remove prepare_image_inputs

416e10a

molbap merged commit 413008c into huggingface:main Sep 19, 2024
21 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add uniform processors for altclip + chinese_clip #31198

add uniform processors for altclip + chinese_clip #31198

molbap commented Jun 3, 2024

molbap commented Jun 3, 2024

HuggingFaceDocBuilderDev commented Jun 3, 2024

amyeroberts left a comment

molbap commented Jun 4, 2024 •

edited

Loading

qubvel left a comment •

edited

Loading

amyeroberts left a comment

molbap commented Aug 9, 2024

molbap commented Aug 14, 2024

amyeroberts left a comment

add uniform processors for altclip + chinese_clip #31198

add uniform processors for altclip + chinese_clip #31198

Conversation

molbap commented Jun 3, 2024

What does this PR do?

molbap commented Jun 3, 2024

HuggingFaceDocBuilderDev commented Jun 3, 2024

amyeroberts left a comment

Choose a reason for hiding this comment

molbap commented Jun 4, 2024 • edited Loading

qubvel left a comment • edited Loading

Choose a reason for hiding this comment

amyeroberts left a comment

Choose a reason for hiding this comment

molbap commented Aug 9, 2024

molbap commented Aug 14, 2024

amyeroberts left a comment

Choose a reason for hiding this comment

molbap commented Jun 4, 2024 •

edited

Loading

qubvel left a comment •

edited

Loading