[Feature] Support ViTPose #1876

LareinaM · 2022-12-12T03:26:49Z

Motivation

Add implementation of ViTPose on MMPose 1.0

Modification

Add six config files
Add layer decay optimizer
In HeatmapHead, add parameter upsample, default to zero (no effect on previous codes), resize in BaseHead

BC-breaking (Optional)

Use cases (Optional)

Checklist

**Before

I have read and followed the workflow indicated in the CONTRIBUTING.md to create this PR.
Pre-commit or linting tools indicated in CONTRIBUTING.md are used to fix the potential lint issues.
Bug fixes are covered by unit tests, the case that causes the bug should be added in the unit tests.
New functionalities are covered by complete unit tests. If not, please add more unit tests to ensure correctness.
The documentation has been modified accordingly, including docstring or example tutorials.

After PR:

CLA has been signed and all committers have signed the CLA in this PR.

jin-s13 · 2022-12-12T10:08:29Z

Have you tried training a base ViT model to check the accuracy?

LareinaM · 2023-02-07T12:33:02Z

Result of the current implementation

With classic decoder

Arch	Input Size	AP	AR
ViTPose-S	256x192	0.739	0.792
ViTPose-B	256x192	0.757	0.810
ViTPose-L	256x192	0.782	0.834
ViTPose-H	256x192	0.788	0.839

With simple decoder

Arch	Input Size	AP	AR
ViTPose-S	256x192	0.736	0.790
ViTPose-B	256x192	0.756	0.809
ViTPose-L	256x192	0.781	0.833
ViTPose-H	256x192	0.789	0.839

Result of original ViTPose implementation

With classic decoder

Model	Input Size	AP	AR
ViTPose-S	256x192	0.738	0.792
ViTPose-B	256x192	0.758	0.811
ViTPose-L	256x192	0.783	0.835
ViTPose-H	256x192	0.791	0.841

With simple decoder

Model	Input Size	AP	AR
ViTPose-S	256x192	0.735	0.789
ViTPose-B	256x192	0.755	0.809
ViTPose-L	256x192	0.782	0.834
ViTPose-H	256x192	0.789	0.840

codecov · 2023-02-15T13:25:21Z

Codecov Report

Patch coverage: 6.34% and project coverage change: -0.46 ⚠️

Comparison is base (d341f11) 82.22% compared to head (e22d6c8) 81.77%.

❗ Current head e22d6c8 differs from pull request most recent head 23b7838. Consider uploading reports for the commit 23b7838 to get more accurate results

Additional details and impacted files

@@             Coverage Diff             @@
##           dev-1.x    #1876      +/-   ##
===========================================
- Coverage    82.22%   81.77%   -0.46%     
===========================================
  Files          225      227       +2     
  Lines        13375    13438      +63     
  Branches      2269     2285      +16     
===========================================
- Hits         10998    10989       -9     
- Misses        1864     1933      +69     
- Partials       513      516       +3

Flag	Coverage Δ
unittests	`81.77% <6.34%> (-0.46%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
mmpose/engine/optim_wrappers/__init__.py	`0.00% <0.00%> (ø)`
...engine/optim_wrappers/layer_decay_optim_wrapper.py	`0.00% <0.00%> (ø)`
mmpose/models/heads/heatmap_heads/heatmap_head.py	`82.19% <23.07%> (-5.78%)`	⬇️
mmpose/models/heads/base_head.py	`81.81% <33.33%> (-2.31%)`	⬇️

... and 2 files with indirect coverage changes

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

☔ View full report in Codecov by Sentry.
📢 Do you have feedback about the report comment? Let us know in this issue.

ly015 · 2023-03-09T03:35:04Z

mmpose/models/heads/heatmap_heads/heatmap_head.py

+        extra (dict, optional): Extra configurations.
+            Defaults to ``None``


The argument extra is convenient for extending the head class but may confuse users. We should keep a well-defined interface where every argument has a clear meaning and a detailed usage description in the docstring.

Here I think maybe we can use existing arguments conv_out_channels, conv_kernel_sizes, and has_final_layer to configure the final conv layers.

To keep the code clear and simple, would it be better to split the dictionary into two parameters, e.g. input_upsample (defaults to 0) and final_kernel_size (defaults to 1)?

ly015 · 2023-03-09T03:37:04Z

mmpose/models/heads/heatmap_heads/heatmap_head.py

@@ -101,6 +104,21 @@ def __init__(self,
            self.decoder = KEYPOINT_CODECS.build(decoder)
        else:
            self.decoder = None
+        self.upsample = 0


I suggest adding a new argument, e.g. input_upsample or input_rescale.

- include testing results from original repo - update training results

mm-assistant bot assigned ly015 Dec 12, 2022

LareinaM marked this pull request as ready for review December 12, 2022 03:27

ly015 changed the title ~~Dev 1.x~~ [Feature] Support ViTPose Dec 12, 2022

LareinaM closed this Dec 12, 2022

LareinaM reopened this Dec 12, 2022

jin-s13 mentioned this pull request Jan 4, 2023

Roadmap of MMPose #9

Open

ly015 reviewed Mar 9, 2023

View reviewed changes

LareinaM added 14 commits March 14, 2023 13:39

Add ViTPose implementation on MMPose 1.x

4b4e63c

add pretrained config for backbone

2601cd4

check for existence of attribute before access

ce9b811

fix formats

b424af8

fix formats

d6807fd

fix indentation and import order

4befb89

rename and add algorithm description

73bc29b

follow changes in original repo

8f0e873

Correct structure for simple decoders

4d6102c

Change configs, add val results, rename folder

c46ffc6

Fix formats

9f43c20

Update markdown file

085477e

Update training results

c611933

Update markdown file

23b7838

- include testing results from original repo - update training results

ly015 force-pushed the dev-1.x branch from f53b22a to 23b7838 Compare March 14, 2023 05:39

ly015 approved these changes Mar 14, 2023

View reviewed changes

ly015 merged commit 936fed3 into open-mmlab:dev-1.x Mar 14, 2023

Serdnad mentioned this pull request Apr 13, 2023

May I know the reason why you included the entire mmpose in your repository? ViTAE-Transformer/ViTPose#86

Open

Tau-J mentioned this pull request Apr 20, 2023

Roadmap of MMPose 1.x #2258

Open

11 tasks

shuheilocale pushed a commit to shuheilocale/mmpose that referenced this pull request May 6, 2023

[Feature] Support ViTPose (open-mmlab#1876)

c5531e5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] Support ViTPose #1876

[Feature] Support ViTPose #1876

LareinaM commented Dec 12, 2022 •

edited

Loading

jin-s13 commented Dec 12, 2022

LareinaM commented Feb 7, 2023 •

edited

Loading

codecov bot commented Feb 15, 2023 •

edited

Loading

ly015 Mar 9, 2023

LareinaM Mar 9, 2023 •

edited

Loading

ly015 Mar 9, 2023

		extra (dict, optional): Extra configurations.
		Defaults to ``None``

[Feature] Support ViTPose #1876

[Feature] Support ViTPose #1876

Conversation

LareinaM commented Dec 12, 2022 • edited Loading

Motivation

Modification

BC-breaking (Optional)

Use cases (Optional)

Checklist

jin-s13 commented Dec 12, 2022

LareinaM commented Feb 7, 2023 • edited Loading

Result of the current implementation

Result of original ViTPose implementation

codecov bot commented Feb 15, 2023 • edited Loading

Codecov Report

ly015 Mar 9, 2023

Choose a reason for hiding this comment

LareinaM Mar 9, 2023 • edited Loading

Choose a reason for hiding this comment

ly015 Mar 9, 2023

Choose a reason for hiding this comment

LareinaM commented Dec 12, 2022 •

edited

Loading

LareinaM commented Feb 7, 2023 •

edited

Loading

codecov bot commented Feb 15, 2023 •

edited

Loading

LareinaM Mar 9, 2023 •

edited

Loading