[Refactor] Using mmcv transformer bricks to refactor vit. #571

clownrat6 · 2021-05-25T12:44:35Z

The foundation of this PR:
mmcv: open-mmlab/mmcv#978 (merged)
mmclassification: open-mmlab/mmpretrain#295

xvjiarui · 2021-05-25T16:31:45Z

We may exclude SETR from this PR. We will refactor #531 and #520

codecov · 2021-06-16T03:44:29Z

Codecov Report

Merging #571 (a9cb251) into master (608f842) will decrease coverage by 0.49%.
The diff coverage is 67.96%.

@@            Coverage Diff             @@
##           master     #571      +/-   ##
==========================================
- Coverage   85.95%   85.45%   -0.50%     
==========================================
  Files         101      101              
  Lines        5234     5220      -14     
  Branches      828      840      +12     
==========================================
- Hits         4499     4461      -38     
- Misses        561      586      +25     
+ Partials      174      173       -1

Flag	Coverage Δ
unittests	`85.45% <67.96%> (-0.50%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
mmseg/models/utils/timm_convert.py	`7.40% <7.40%> (ø)`
mmseg/models/backbones/vit.py	`83.81% <89.33%> (-2.34%)`	⬇️
mmseg/models/utils/__init__.py	`100.00% <100.00%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 608f842...a9cb251. Read the comment docs.

1. Fix low code coverage of vit.py; 2. Remove HybirdEmbed; 3. Fix doc string of VisionTransformer;

…s style.

xvjiarui · 2021-05-25T17:11:24Z

mmseg/models/backbones/base_backbone.py

@@ -0,0 +1,53 @@
+import logging


Is this file necessary?

xvjiarui · 2021-05-25T17:16:15Z

mmseg/models/utils/helpers.py

+import collections.abc
+from itertools import repeat
+
+
+# From PyTorch internals
+def _ntuple(n):
+
+    def parse(x):
+        if isinstance(x, collections.abc.Iterable):
+            return x
+        return tuple(repeat(x, n))
+
+    return parse
+
+
+to_1tuple = _ntuple(1)
+to_2tuple = _ntuple(2)
+to_3tuple = _ntuple(3)
+to_4tuple = _ntuple(4)
+to_ntuple = _ntuple


This file is not necessary. Use from torch.nn.modules.utils import _pair as to_2tuple instead.

xvjiarui · 2021-06-17T04:36:22Z

mmseg/models/backbones/vit.py

-            # We only implement the 'jax_impl' initialization implemented at
-            # https://github.com/rwightman/pytorch-image-models/blob/master/timm/models/vision_transformer.py#L353  # noqa: E501
-            trunc_normal_(self.pos_embed, std=.02)
-            trunc_normal_(self.cls_token, std=.02)
-            for n, m in self.named_modules():
-                if isinstance(m, Linear):
-                    trunc_normal_(m.weight, std=.02)
-                    if m.bias is not None:
-                        if 'mlp' in n:
-                            normal_init(m.bias, std=1e-6)
-                        else:
-                            constant_init(m.bias, 0)
-                elif isinstance(m, Conv2d):
-                    kaiming_init(m.weight, mode='fan_in')
-                    if m.bias is not None:
-                        constant_init(m.bias, 0)
-                elif isinstance(m, (_BatchNorm, nn.GroupNorm, nn.LayerNorm)):
-                    constant_init(m.bias, 0)
-                    constant_init(m.weight, 1.0)
-        else:
-            raise TypeError('pretrained must be a str or None')
+            # Modified from ClassyVision
+            nn.init.normal_(self.pos_embed, std=0.02)


Why remove the initialization?

1. Use timm style init_weights; 2. Remove to_xtuple and trunc_norm_;

xvjiarui · 2021-06-17T17:11:32Z

mmseg/models/backbones/vit.py

@@ -330,10 +299,17 @@ def init_weights(self, pretrained=None):
            else:
                state_dict = checkpoint

+            if 'rwightman/pytorch-image-models' in pretrained:


If user downloaded the weight from timm and would like to init the model with path, the condition does not hold.

xvjiarui · 2021-06-17T17:12:12Z

.github/workflows/build.yml

@@ -37,19 +37,26 @@ jobs:
        include:
          - torch: 1.3.0+cpu
            torchvision: 0.4.1+cpu
+            torch_version: 1.3.0


Reset this file.

xvjiarui · 2021-06-17T17:33:21Z

mmseg/models/backbones/vit.py

+        with_cp (bool): Use checkpoint or not. Using checkpoint will save
+            some memory while slowing down the training speed. Default: False.
+        pretrain_style (str): Choose to use timm or mmcls pretrain weights.
+            Default: timm.


We should explain what options are supported, and add assert.

…#571) * [Refactor] Using mmcv bricks to refactor vit * Follow the vit code structure from mmclassification * Add MMCV install into CI system. * Add to 'Install MMCV' CI item * Add 'Install MMCV_CPU' and 'Install MMCV_GPU CI' items * Fix & Add 1. Fix low code coverage of vit.py; 2. Remove HybirdEmbed; 3. Fix doc string of VisionTransformer; * Add helpers unit test. * Add converter to convert vit pretrain weights from timm style to mmcls style. * Clean some rebundant code and refactor init 1. Use timm style init_weights; 2. Remove to_xtuple and trunc_norm_; * Add comments for VisionTransformer.init_weights() * Add arg: pretrain_style to choose timm or mmcls vit pretrain weights.

* add atrw dataset * add atrw configs * add animal readme * add atrw * update log interval * update readme * update readme * update init

…en-mmlab#571) * polish README * fix typo

clownrat6 requested a review from xvjiarui May 25, 2021 14:34

[Refactor] Using mmcv bricks to refactor vit

918e230

clownrat6 force-pushed the vit_refactor branch from 55d7845 to 918e230 Compare May 25, 2021 17:04

sennnnn added 5 commits May 26, 2021 01:07

Merge branch 'master' into vit_refactor

11a289c

Follow the vit code structure from mmclassification

fd85675

Add MMCV install into CI system.

261e349

Add to 'Install MMCV' CI item

40fcf2c

Add 'Install MMCV_CPU' and 'Install MMCV_GPU CI' items

e77ffbd

sennnnn added 3 commits June 16, 2021 13:07

Fix & Add

068e2af

1. Fix low code coverage of vit.py; 2. Remove HybirdEmbed; 3. Fix doc string of VisionTransformer;

Add helpers unit test.

f0d0514

Add converter to convert vit pretrain weights from timm style to mmcl…

9639f4d

…s style.

xvjiarui reviewed Jun 17, 2021

View reviewed changes

sennnnn added 3 commits June 18, 2021 00:54

Clean some rebundant code and refactor init

f14cb1c

1. Use timm style init_weights; 2. Remove to_xtuple and trunc_norm_;

Add comments for VisionTransformer.init_weights()

e87ad18

Merge branch 'master' into vit_refactor

5b80244

xvjiarui reviewed Jun 17, 2021

View reviewed changes

sennnnn added 2 commits June 18, 2021 01:14

Merge Master

53feec6

Add arg: pretrain_style to choose timm or mmcls vit pretrain weights.

a9cb251

xvjiarui reviewed Jun 17, 2021

View reviewed changes

xvjiarui approved these changes Jun 17, 2021

View reviewed changes

xvjiarui merged commit 8f8abe3 into open-mmlab:master Jun 17, 2021

wjkim81 pushed a commit to wjkim81/mmsegmentation that referenced this pull request Dec 3, 2023

ATRW tiger dataset (open-mmlab#571)

601aba3

* add atrw dataset * add atrw configs * add animal readme * add atrw * update log interval * update readme * update readme * update init

sibozhang pushed a commit to sibozhang/mmsegmentation that referenced this pull request Mar 22, 2024

[Docs] Add conference reference for methods and dataset in README (op…

1d32d42

…en-mmlab#571) * polish README * fix typo

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Refactor] Using mmcv transformer bricks to refactor vit. #571

[Refactor] Using mmcv transformer bricks to refactor vit. #571

clownrat6 commented May 25, 2021 •

edited

Loading

xvjiarui commented May 25, 2021

codecov bot commented Jun 16, 2021 •

edited

Loading

xvjiarui May 25, 2021

xvjiarui May 25, 2021

xvjiarui Jun 17, 2021

xvjiarui Jun 17, 2021 •

edited

Loading

xvjiarui Jun 17, 2021

xvjiarui Jun 17, 2021

[Refactor] Using mmcv transformer bricks to refactor vit. #571

[Refactor] Using mmcv transformer bricks to refactor vit. #571

Conversation

clownrat6 commented May 25, 2021 • edited Loading

xvjiarui commented May 25, 2021

codecov bot commented Jun 16, 2021 • edited Loading

Codecov Report

xvjiarui May 25, 2021

Choose a reason for hiding this comment

xvjiarui May 25, 2021

Choose a reason for hiding this comment

xvjiarui Jun 17, 2021

Choose a reason for hiding this comment

xvjiarui Jun 17, 2021 • edited Loading

Choose a reason for hiding this comment

xvjiarui Jun 17, 2021

Choose a reason for hiding this comment

xvjiarui Jun 17, 2021

Choose a reason for hiding this comment

clownrat6 commented May 25, 2021 •

edited

Loading

codecov bot commented Jun 16, 2021 •

edited

Loading

xvjiarui Jun 17, 2021 •

edited

Loading