Add YOLOS #16848

NielsRogge · 2022-04-20T09:48:29Z

What does this PR do?

This PR adds YOLOS, an awesome and simple object detector.

YOLOS is just a single Transformer encoder (ViT), trained using DETR's objective.

For now, I've used "vit" as base_model_prefix, in order to easily load weights from ViT and ViTMAE checkpoints on the hub.

src/transformers/models/yolos/configuration_yolos.py

src/transformers/models/yolos/convert_yolos_to_pytorch.py

tests/yolos/test_modeling_yolos.py

sgugger

Thanks for adding this new model! There are a few badly named variables left and some docstrings to fix, but overall it's in great shape!

docs/source/en/model_doc/yolos.mdx

src/transformers/models/yolos/configuration_yolos.py

src/transformers/models/yolos/modeling_yolos.py

tests/yolos/test_modeling_yolos.py

NielsRogge · 2022-04-26T10:12:22Z

Addressed most comments. The remaining comments are about badly formatted docstrings, however these are all copied from DETR (so I can't change them due to #Copied from statements). Is it ok if I address these docstrings in a separate PR for both models?

Also pinging @Narsil as the pipeline test for YOLOS is failing. This is because YOLOS doesn't take pixel_mask as input, whereas DETR does. This makes YOLOS fail for the object detection pipeline.

sgugger · 2022-04-26T11:36:46Z

I'd advocate to make the changes in docstrings in DETR to be propagated to YOLOS in this PR, just to make sure we don't forget.

Narsil · 2022-04-26T12:14:04Z

Also pinging @Narsil as the pipeline test for YOLOS is failing. This is because YOLOS doesn't take pixel_mask as input, whereas DETR does. This makes YOLOS fail for the object detection pipeline.

Then the feature_extractor should not output them. The image pipeline are pretty simple and roughly simply do

model(**feature_extractor(image)) so if the feature extractor only outputs what's needed then it should work.
That or pixel_mask should be handled (doesn't seem to be making sense for this model reading your comment).

NielsRogge · 2022-04-26T12:40:10Z

Then the feature_extractor should not output them.

Yeah the problem is, YOLOS uses the same feature extractor as DETR, which outputs both pixel_values and pixel_mask. Hence, I've just added ("yolos", "DetrFeatureExtractor") to the Auto Feature Extractor API.

I think the easiest here is to add pixel_mask=None to the forward of YOLOS, in order to make model(**feature_extractor(image)) work.

sgugger · 2022-04-26T13:09:48Z

I think the easiest here is to add pixel_mask=None to the forward of YOLOS, in order to make model(**feature_extractor(image)) work.

We're not adding an argument that will be ignored all the time, that's just confusing to users. Especially if they end up passing one and don't get why it's not used.

If the feature extractor should not return pixel_mask then either use a class attribute on DetrFeatureExtractor to make it not return that in certain cases, or create a YolosFeatureExtractor that removes that field from the output of the feature extractor.

NielsRogge · 2022-04-26T16:09:25Z

Ok so I created a new YolosFeatureExtractor, however the pipeline test is still failing:

def _call_impl(self, *input, **kwargs):
        forward_call = (self._slow_forward if torch._C._get_tracing_state() else self.forward)
        # If we don't have any hooks, we want to skip the rest of the logic in
        # this function, and just call forward.
        if not (self._backward_hooks or self._forward_hooks or self._forward_pre_hooks or _global_backward_hooks
                or _global_forward_hooks or _global_forward_pre_hooks):
>           return forward_call(*input, **kwargs)
E           TypeError: forward() got an unexpected keyword argument 'pixel_mask'

@Narsil could you help me debug this? It's weird cause YolosFeatureExtractor doesn't create a pixel mask. Also, I added doc tests which are passing.

Narsil · 2022-04-28T09:23:49Z

@Narsil could you help me debug this? It's weird cause YolosFeatureExtractor doesn't create a pixel mask. Also, I added doc tests which are passing.

I have checked and the reason is that the tested Feature extractor is actually a detr one, not a Yolo one:

https://github.com/huggingface/transformers/pull/16848/files#diff-fcbe32a3a065f97b00f1c242ecd45858b8d5680a2437b65af183eb0c439e2be9R347

The pipeline tests rely on ModelTester to create the base objects

HuggingFaceDocBuilderDev · 2022-04-28T12:44:55Z

The documentation is not available anymore as the PR was closed or merged.

README.md

NielsRogge · 2022-05-02T16:30:51Z

Failing test is unrelated, merging.

* First draft * Add YolosForObjectDetection * Make forward pass work * Add mid position embeddings * Add interpolation of position encodings * Add expected values * Add YOLOS to tests * Add integration test * Support tiny model as well * Support all models in conversion script * Remove mid_pe_size attribute * Make more tests pass * Add model to README and fix config * Add copied from statements * Rename base_model_prefix to vit * Add missing YOLOS_PRETRAINED_CONFIG_ARCHIVE_MAP * Apply suggestions from code review * Apply more suggestions from code review * Convert remaining checkpoints * Improve docstrings * Add YolosFeatureExtractor * Add feature extractor to docs * Add corresponding tests * Fix style * Fix docs * Apply suggestion from code review * Fix bad rebase * Fix some more bad rebase * Fix missing character * Improve docs and variable names Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>

NielsRogge commented Apr 20, 2022

View reviewed changes

src/transformers/models/yolos/configuration_yolos.py Outdated Show resolved Hide resolved

NielsRogge commented Apr 20, 2022

View reviewed changes

src/transformers/models/yolos/convert_yolos_to_pytorch.py Outdated Show resolved Hide resolved

NielsRogge commented Apr 20, 2022

View reviewed changes

tests/yolos/test_modeling_yolos.py Outdated Show resolved Hide resolved

NielsRogge commented Apr 20, 2022

View reviewed changes

tests/yolos/test_modeling_yolos.py Outdated Show resolved Hide resolved

NielsRogge requested review from sgugger and LysandreJik April 20, 2022 09:52

NielsRogge mentioned this pull request Apr 20, 2022

Adding YOLOS to HuggingFace Transformers hustvl/YOLOS#24

Open

sgugger approved these changes Apr 20, 2022

View reviewed changes

NielsRogge force-pushed the add_yolos branch from f414e10 to 1ffbaff Compare April 26, 2022 09:51

NielsRogge force-pushed the add_yolos branch from e8d2fdc to a48cd46 Compare April 27, 2022 06:55

Niels Rogge added 12 commits May 2, 2022 14:08

First draft

7a17bf6

Add YolosForObjectDetection

6b2e22e

Make forward pass work

4d77008

Add mid position embeddings

563be7b

Add interpolation of position encodings

c9aa75b

Add expected values

f3aed34

Add YOLOS to tests

ec67166

Add integration test

c8c80d9

Support tiny model as well

d51c09a

Support all models in conversion script

bf7ad85

Remove mid_pe_size attribute

a463318

Make more tests pass

39b478c

Niels Rogge added 13 commits May 2, 2022 14:09

Add model to README and fix config

bf38802

Add copied from statements

992de41

Rename base_model_prefix to vit

f1460c3

Add missing YOLOS_PRETRAINED_CONFIG_ARCHIVE_MAP

86d6e34

Apply suggestions from code review

74ca0c4

Apply more suggestions from code review

c03b734

Convert remaining checkpoints

a7875e8

Improve docstrings

7c2dc6e

Add YolosFeatureExtractor

2663957

Add feature extractor to docs

a61626d

Add corresponding tests

c31edd6

Fix style

450798d

Fix docs

a72165f

NielsRogge force-pushed the add_yolos branch from 4b6ba22 to a72165f Compare May 2, 2022 12:10

sgugger approved these changes May 2, 2022

View reviewed changes

README.md Outdated Show resolved Hide resolved

Niels Rogge added 5 commits May 2, 2022 14:29

Apply suggestion from code review

dcc344a

Fix bad rebase

daa3a52

Fix some more bad rebase

d923bea

Fix missing character

2567f4e

Improve docs and variable names

a242ed1

NielsRogge merged commit 1ac6987 into huggingface:main May 2, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add YOLOS #16848

Add YOLOS #16848

NielsRogge commented Apr 20, 2022 •

edited

Loading

sgugger left a comment

NielsRogge commented Apr 26, 2022 •

edited

Loading

sgugger commented Apr 26, 2022

Narsil commented Apr 26, 2022

NielsRogge commented Apr 26, 2022 •

edited

Loading

sgugger commented Apr 26, 2022

NielsRogge commented Apr 26, 2022 •

edited

Loading

Narsil commented Apr 28, 2022

HuggingFaceDocBuilderDev commented Apr 28, 2022 •

edited

Loading

NielsRogge commented May 2, 2022

Add YOLOS #16848

Add YOLOS #16848

Conversation

NielsRogge commented Apr 20, 2022 • edited Loading

What does this PR do?

sgugger left a comment

Choose a reason for hiding this comment

NielsRogge commented Apr 26, 2022 • edited Loading

sgugger commented Apr 26, 2022

Narsil commented Apr 26, 2022

NielsRogge commented Apr 26, 2022 • edited Loading

sgugger commented Apr 26, 2022

NielsRogge commented Apr 26, 2022 • edited Loading

Narsil commented Apr 28, 2022

HuggingFaceDocBuilderDev commented Apr 28, 2022 • edited Loading

NielsRogge commented May 2, 2022

NielsRogge commented Apr 20, 2022 •

edited

Loading

NielsRogge commented Apr 26, 2022 •

edited

Loading

NielsRogge commented Apr 26, 2022 •

edited

Loading

NielsRogge commented Apr 26, 2022 •

edited

Loading

HuggingFaceDocBuilderDev commented Apr 28, 2022 •

edited

Loading