Add ViTDet #25524

NielsRogge · 2023-08-15T20:50:39Z

What does this PR do?

This PR adds part 1 of #25051, namely the ViTDet backbone, introduced in Exploring Plain Vision Transformer Backbones for Object Detection.

Note that this PR only adds the backbone, hence there are no compatible checkpoints with the backbone-only. Those can only be added once either VitMatte or Mask R-CNN are added, both of which use VitDet as backbone.

HuggingFaceDocBuilderDev · 2023-08-15T21:18:10Z

The documentation is not available anymore as the PR was closed or merged.

amyeroberts

Thanks for adding this model!

Very nice and clean PR :) Mostly just nits. The diff on the toctree will need to be resolved before merging.

docs/source/en/_toctree.yml

src/transformers/models/vitdet/configuration_vitdet.py

src/transformers/models/vitdet/modeling_vitdet.py

src/transformers/models/vitdet/configuration_vitdet.py

tests/models/vitdet/test_modeling_vitdet.py

src/transformers/models/vitdet/modeling_vitdet.py

NielsRogge · 2023-08-21T18:21:43Z

@amyeroberts thanks for your review, I've addressed all the comments

amyeroberts

Thanks again for adding and iterating!

There's still an issue with the toctree diff which needs to be addressed

amyeroberts · 2023-08-22T10:24:45Z

docs/source/en/_toctree.yml

Reminder to resolve this. Once that's done we can merge in the PR

cc @ydshieh who knows the fix for this

If you discard the change in this file of this PR, rebase on main, adding back ViTDet entry to this file and run the style, it will be fine.

A fix #25661 is merged into main.

* First draft * Fix READMEs * Update return_dict * Add more tests * Fix docstrings * Address comments * Address more comments * Address more comments * Address more comments, fix test * Fix test

NielsRogge requested a review from amyeroberts August 15, 2023 20:57

amyeroberts approved these changes Aug 16, 2023

View reviewed changes

amyeroberts reviewed Aug 22, 2023

View reviewed changes

NielsRogge added 10 commits August 25, 2023 19:58

First draft

a6b10ff

Fix READMEs

e53f6b4

Update return_dict

8b6fa28

Add more tests

b327704

Fix docstrings

7f52fef

Address comments

2b1135f

Address more comments

15f1196

Address more comments

cc23ac9

Address more comments, fix test

5bc1d41

Fix test

8221c6f

NielsRogge force-pushed the add_vitdet branch from 38f128c to 8221c6f Compare August 25, 2023 17:58

amyeroberts merged commit 4c21da5 into huggingface:main Aug 29, 2023

NielsRogge mentioned this pull request Aug 29, 2023

Add ViTMatte #25843

Merged

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add ViTDet #25524

Add ViTDet #25524

NielsRogge commented Aug 15, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Aug 15, 2023 •

edited

Loading

amyeroberts left a comment

NielsRogge commented Aug 21, 2023

amyeroberts left a comment

amyeroberts Aug 22, 2023

NielsRogge Aug 24, 2023

ydshieh Aug 24, 2023

Add ViTDet #25524

Add ViTDet #25524

Conversation

NielsRogge commented Aug 15, 2023 • edited Loading

What does this PR do?

HuggingFaceDocBuilderDev commented Aug 15, 2023 • edited Loading

amyeroberts left a comment

Choose a reason for hiding this comment

NielsRogge commented Aug 21, 2023

amyeroberts left a comment

Choose a reason for hiding this comment

amyeroberts Aug 22, 2023

Choose a reason for hiding this comment

NielsRogge Aug 24, 2023

Choose a reason for hiding this comment

ydshieh Aug 24, 2023

Choose a reason for hiding this comment

NielsRogge commented Aug 15, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Aug 15, 2023 •

edited

Loading