-
Notifications
You must be signed in to change notification settings - Fork 2.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
add configs for vit backbone plus decode_heads #520
Merged
Merged
Changes from all commits
Commits
Show all changes
39 commits
Select commit
Hold shift + click to select a range
1c66609
add config
a439a40
add cityscapes config
b7aa7ee
add default value to docstring
4e8383c
Merge branch 'master' of https://github.com/open-mmlab/mmsegmentation…
e7d6243
fix lint
0ce8958
add deit-s and deit-b
6b40465
add readme
6d0ab21
add eps at norm_cfg
c9c6596
Merge branch 'master' of https://github.com/open-mmlab/mmsegmentation…
f7b8c18
add drop_path_rate experiment
d605af6
add deit case at init_weight
425cac7
add upernet result
dd6856e
update result and add upernet 160k config
bd86b64
update upernet result and fix settings
3f8db05
Update iters number
4d77761
update result and delete some configs
656c167
fix import error
7d13836
fix drop_path_rate
8931d7f
update result and restore config
2219cf7
update benchmark result
7f1866e
remove cityscapes exp
845f8f5
merge upstream master
69cb384
remove neck
cebbf6f
neck exp
4c0e952
Merge branch 'master' of https://github.com/open-mmlab/mmsegmentation…
a90ed4a
Merge branch 'master' of https://github.com/open-mmlab/mmsegmentation…
ebc0531
add more configs
e7a6637
Merge branch 'vit_plus_heads' of https://github.com/xiexinch/mmsegmen…
319f56e
fix init error
c4aa7b6
fix ffn setting
8457e67
update result
b0f9e29
update results
74424aa
update result
8c11f62
update results and fill table
637356d
Merge branch 'master' of https://github.com/open-mmlab/mmsegmentation…
31ba78d
delete or rename configs
53ae35c
fix link delimiter
451ae11
rename configs and fix link
8db5cdd
rename neck to mln
File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,58 @@ | ||
# model settings | ||
norm_cfg = dict(type='SyncBN', requires_grad=True) | ||
model = dict( | ||
type='EncoderDecoder', | ||
pretrained='https://github.com/rwightman/pytorch-image-models/releases/download/v0.1-vitjx/jx_vit_base_p16_224-80ecf9dd.pth', # noqa | ||
backbone=dict( | ||
type='VisionTransformer', | ||
img_size=(512, 512), | ||
patch_size=16, | ||
in_channels=3, | ||
embed_dims=768, | ||
num_layers=12, | ||
num_heads=12, | ||
mlp_ratio=4, | ||
out_indices=(2, 5, 8, 11), | ||
qkv_bias=True, | ||
drop_rate=0.0, | ||
attn_drop_rate=0.0, | ||
drop_path_rate=0.0, | ||
with_cls_token=True, | ||
norm_cfg=dict(type='LN', eps=1e-6), | ||
act_cfg=dict(type='GELU'), | ||
norm_eval=False, | ||
out_shape='NCHW', | ||
interpolate_mode='bicubic'), | ||
neck=dict( | ||
type='MultiLevelNeck', | ||
in_channels=[768, 768, 768, 768], | ||
out_channels=768, | ||
scales=[4, 2, 1, 0.5]), | ||
decode_head=dict( | ||
type='UPerHead', | ||
in_channels=[768, 768, 768, 768], | ||
in_index=[0, 1, 2, 3], | ||
pool_scales=(1, 2, 3, 6), | ||
channels=512, | ||
dropout_ratio=0.1, | ||
num_classes=19, | ||
norm_cfg=norm_cfg, | ||
align_corners=False, | ||
loss_decode=dict( | ||
type='CrossEntropyLoss', use_sigmoid=False, loss_weight=1.0)), | ||
auxiliary_head=dict( | ||
type='FCNHead', | ||
in_channels=768, | ||
in_index=3, | ||
channels=256, | ||
num_convs=1, | ||
concat_input=False, | ||
dropout_ratio=0.1, | ||
num_classes=19, | ||
norm_cfg=norm_cfg, | ||
align_corners=False, | ||
loss_decode=dict( | ||
type='CrossEntropyLoss', use_sigmoid=False, loss_weight=0.4)), | ||
# model training and testing settings | ||
train_cfg=dict(), | ||
test_cfg=dict(mode='whole')) # yapf: disable |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,32 @@ | ||
# Vision Transformer | ||
|
||
## Introduction | ||
|
||
<!-- [ALGORITHM] --> | ||
|
||
```latex | ||
@article{dosoViTskiy2020, | ||
title={An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale}, | ||
author={DosoViTskiy, Alexey and Beyer, Lucas and Kolesnikov, Alexander and Weissenborn, Dirk and Zhai, Xiaohua and Unterthiner, Thomas and Dehghani, Mostafa and Minderer, Matthias and Heigold, Georg and Gelly, Sylvain and Uszkoreit, Jakob and Houlsby, Neil}, | ||
journal={arXiv preprint arXiv:2010.11929}, | ||
year={2020} | ||
} | ||
``` | ||
|
||
## Results and models | ||
|
||
### ADE20K | ||
|
||
| Method | Backbone | Crop Size | Lr schd | Mem (GB) | Inf time (fps) | mIoU | mIoU(ms+flip) | config | download | | ||
| ------- | -------- | --------- | ------: | -------- | -------------- | ----: | ------------: | ---------------------------------------------------------------------------------------------------------------------- | -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | | ||
| UPerNet | ViT-B + MLN | 512x512 | 80000 | 9.20 | 6.94 | 47.71 | 49.51 | [config](https://github.com/open-mmlab/mmsegmentation/blob/master/configs/vit/upernet_vit-b16_mln_512x512_80k_ade20k.py) | [model](https://download.openmmlab.com/mmsegmentation/v0.5/vit/upernet_vit-b16_mln_512x512_80k_ade20k/upernet_vit-b16_mln_512x512_80k_ade20k-0403cee1.pth) | [log](https://download.openmmlab.com/mmsegmentation/v0.5/vit/upernet_vit-b16_mln_512x512_80k_ade20k/20210624_130547.log.json) | | ||
| UPerNet | ViT-B + MLN | 512x512 | 160000 | 9.20 | 7.58 | 46.75 | 48.46 | [config](https://github.com/open-mmlab/mmsegmentation/blob/master/configs/vit/upernet_vit-b16_mln_512x512_160k_ade20k.py) | [model](https://download.openmmlab.com/mmsegmentation/v0.5/vit/upernet_vit-b16_mln_512x512_160k_ade20k/upernet_vit-b16_mln_512x512_160k_ade20k-852fa768.pth) | [log](https://download.openmmlab.com/mmsegmentation/v0.5/vit/upernet_vit-b16_mln_512x512_160k_ade20k/20210623_192432.log.json) | | ||
| UPerNet | ViT-B + LN + MLN | 512x512 | 160000 | 9.21 | 6.82 | 47.73 | 49.95 | [config](https://github.com/open-mmlab/mmsegmentation/blob/master/configs/vit/upernet_vit-b16_ln_mln_512x512_160k_ade20k.py) | [model](https://download.openmmlab.com/mmsegmentation/v0.5/vit/upernet_vit-b16_ln_mln_512x512_160k_ade20k/upernet_vit-b16_ln_mln_512x512_160k_ade20k-f444c077.pth) | [log](https://download.openmmlab.com/mmsegmentation/v0.5/vit/upernet_vit-b16_ln_mln_512x512_160k_ade20k/20210621_172828.log.json) | | ||
| UPerNet | DeiT-S | 512x512 | 80000 | 4.68 | 29.85 | 42.96 | 43.79 | [config](https://github.com/open-mmlab/mmsegmentation/blob/master/configs/vit/upernet_deit-s16_512x512_80k_ade20k.py) | [model](https://download.openmmlab.com/mmsegmentation/v0.5/vit/upernet_deit-s16_512x512_80k_ade20k/upernet_deit-s16_512x512_80k_ade20k-afc93ec2.pth) | [log](https://download.openmmlab.com/mmsegmentation/v0.5/vit/upernet_deit-s16_512x512_80k_ade20k/20210624_095228.log.json) | | ||
| UPerNet | DeiT-S | 512x512 | 160000 | 4.68 | 29.19 | 42.87 | 43.79 | [config](https://github.com/open-mmlab/mmsegmentation/blob/master/configs/vit/upernet_deit-s16_512x512_160k_ade20k.py) | [model](https://download.openmmlab.com/mmsegmentation/v0.5/vit/upernet_deit-s16_512x512_160k_ade20k/upernet_deit-s16_512x512_160k_ade20k-5110d916.pth) | [log](https://download.openmmlab.com/mmsegmentation/v0.5/vit/upernet_deit-s16_512x512_160k_ade20k/20210621_160903.log.json) | | ||
| UPerNet | DeiT-S + MLN | 512x512 | 160000 | 5.69 | 11.18 | 43.82 | 45.07 | [config](https://github.com/open-mmlab/mmsegmentation/blob/master/configs/vit/upernet_deit-s16_mln_512x512_160k_ade20k.py) | [model](https://download.openmmlab.com/mmsegmentation/v0.5/vit/upernet_deit-s16_mln_512x512_160k_ade20k/upernet_deit-s16_mln_512x512_160k_ade20k-fb9a5dfb.pth) | [log](https://download.openmmlab.com/mmsegmentation/v0.5/vit/upernet_deit-s16_mln_512x512_160k_ade20k/20210621_161021.log.json) | | ||
| UPerNet | DeiT-S + LN + MLN | 512x512 | 160000 | 5.69 | 12.39 | 43.52 | 45.01 | [config](https://github.com/open-mmlab/mmsegmentation/blob/master/configs/vit/upernet_deit-s16_ln_mln_512x512_160k_ade20k.py) | [model](https://download.openmmlab.com/mmsegmentation/v0.5/vit/upernet_deit-s16_ln_mln_512x512_160k_ade20k/upernet_deit-s16_ln_mln_512x512_160k_ade20k-c0cd652f.pth) | [log](https://download.openmmlab.com/mmsegmentation/v0.5/vit/upernet_deit-s16_ln_mln_512x512_160k_ade20k/20210621_161021.log.json) | | ||
| UPerNet | DeiT-B | 512x512 | 80000 | 7.75 | 9.69 | 45.24 | 46.73 | [config](https://github.com/open-mmlab/mmsegmentation/blob/master/configs/vit/upernet_deit-b16_512x512_80k_ade20k.py) | [model](https://download.openmmlab.com/mmsegmentation/v0.5/vit/upernet_deit-b16_512x512_80k_ade20k/upernet_deit-b16_512x512_80k_ade20k-1e090789.pth) | [log](https://download.openmmlab.com/mmsegmentation/v0.5/vit/upernet_deit-b16_512x512_80k_ade20k/20210624_130529.log.json) | | ||
| UPerNet | DeiT-B | 512x512 | 160000 | 7.75 | 10.39 | 45.36 | 47.16 | [config](https://github.com/open-mmlab/mmsegmentation/blob/master/configs/vit/upernet_deit-b16_512x512_160k_ade20k.py) | [model](https://download.openmmlab.com/mmsegmentation/v0.5/vit/upernet_deit-b16_512x512_160k_ade20k/upernet_deit-b16_512x512_160k_ade20k-828705d7.pth) | [log](https://download.openmmlab.com/mmsegmentation/v0.5/vit/upernet_deit-b16_512x512_160k_ade20k/20210621_180100.log.json) | | ||
| UPerNet | DeiT-B + MLN | 512x512 | 160000 | 9.21 | 7.78 | 45.46 | 47.16 | [config](https://github.com/open-mmlab/mmsegmentation/blob/master/configs/vit/upernet_deit-b16_mln_512x512_160k_ade20k.py) | [model](https://download.openmmlab.com/mmsegmentation/v0.5/vit/upernet_deit-b16_mln_512x512_160k_ade20k/upernet_deit-b16_mln_512x512_160k_ade20k-4e1450f3.pth) | [log](https://download.openmmlab.com/mmsegmentation/v0.5/vit/upernet_deit-b16_mln_512x512_160k_ade20k/20210621_191949.log.json) | | ||
| UPerNet | DeiT-B + LN + MLN | 512x512 | 160000 | 9.21 | 7.75 | 45.37 | 47.23 | [config](https://github.com/open-mmlab/mmsegmentation/blob/master/configs/vit/upernet_deit-b16_ln_mln_512x512_160k_ade20k.py) | [model](https://download.openmmlab.com/mmsegmentation/v0.5/vit/upernet_deit-b16_ln_mln_512x512_160k_ade20k/upernet_deit-b16_ln_mln_512x512_160k_ade20k-8a959c14.pth) | [log](https://download.openmmlab.com/mmsegmentation/v0.5/vit/upernet_deit-b16_ln_mln_512x512_160k_ade20k/20210623_153535.log.json) | |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,6 @@ | ||
_base_ = './upernet_vit-b16_mln_512x512_160k_ade20k.py' | ||
|
||
model = dict( | ||
pretrained='https://dl.fbaipublicfiles.com/deit/deit_base_patch16_224-b5f2ef4d.pth', # noqa | ||
backbone=dict(drop_path_rate=0.1), | ||
neck=None) # yapf: disable |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,6 @@ | ||
_base_ = './upernet_vit-b16_mln_512x512_80k_ade20k.py' | ||
|
||
model = dict( | ||
pretrained='https://dl.fbaipublicfiles.com/deit/deit_base_patch16_224-b5f2ef4d.pth', # noqa | ||
backbone=dict(drop_path_rate=0.1), | ||
neck=None) # yapf: disable |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,5 @@ | ||
_base_ = './upernet_vit-b16_mln_512x512_160k_ade20k.py' | ||
|
||
model = dict( | ||
pretrained='https://dl.fbaipublicfiles.com/deit/deit_base_patch16_224-b5f2ef4d.pth', # noqa | ||
backbone=dict(drop_path_rate=0.1, final_norm=True)) # yapf: disable |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,5 @@ | ||
_base_ = './upernet_vit-b16_mln_512x512_160k_ade20k.py' | ||
|
||
model = dict( | ||
pretrained='https://dl.fbaipublicfiles.com/deit/deit_base_patch16_224-b5f2ef4d.pth', # noqa | ||
backbone=dict(drop_path_rate=0.1),) # yapf: disable |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,8 @@ | ||
_base_ = './upernet_vit-b16_mln_512x512_160k_ade20k.py' | ||
|
||
model = dict( | ||
pretrained='https://dl.fbaipublicfiles.com/deit/deit_small_patch16_224-cd65a155.pth', # noqa | ||
backbone=dict(num_heads=6, embed_dims=384, drop_path_rate=0.1), | ||
decode_head=dict(num_classes=150, in_channels=[384, 384, 384, 384]), | ||
neck=None, | ||
auxiliary_head=dict(num_classes=150, in_channels=384)) # yapf: disable |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,8 @@ | ||
_base_ = './upernet_vit-b16_mln_512x512_80k_ade20k.py' | ||
|
||
model = dict( | ||
pretrained='https://dl.fbaipublicfiles.com/deit/deit_small_patch16_224-cd65a155.pth', # noqa | ||
backbone=dict(num_heads=6, embed_dims=384, drop_path_rate=0.1), | ||
decode_head=dict(num_classes=150, in_channels=[384, 384, 384, 384]), | ||
neck=None, | ||
auxiliary_head=dict(num_classes=150, in_channels=384)) # yapf: disable |
12 changes: 12 additions & 0 deletions
12
configs/vit/upernet_deit-s16_ln_mln_512x512_160k_ade20k.py
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,12 @@ | ||
_base_ = './upernet_vit-b16_mln_512x512_160k_ade20k.py' | ||
|
||
model = dict( | ||
pretrained='https://dl.fbaipublicfiles.com/deit/deit_small_patch16_224-cd65a155.pth', # noqa | ||
backbone=dict( | ||
num_heads=6, | ||
embed_dims=384, | ||
drop_path_rate=0.1, | ||
final_norm=True), | ||
decode_head=dict(num_classes=150, in_channels=[384, 384, 384, 384]), | ||
neck=dict(in_channels=[384, 384, 384, 384], out_channels=384), | ||
auxiliary_head=dict(num_classes=150, in_channels=384)) # yapf: disable |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,8 @@ | ||
_base_ = './upernet_vit-b16_mln_512x512_160k_ade20k.py' | ||
|
||
model = dict( | ||
pretrained='https://dl.fbaipublicfiles.com/deit/deit_small_patch16_224-cd65a155.pth', # noqa | ||
backbone=dict(num_heads=6, embed_dims=384, drop_path_rate=0.1), | ||
decode_head=dict(num_classes=150, in_channels=[384, 384, 384, 384]), | ||
neck=dict(in_channels=[384, 384, 384, 384], out_channels=384), | ||
auxiliary_head=dict(num_classes=150, in_channels=384)) # yapf: disable |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,38 @@ | ||
_base_ = [ | ||
'../_base_/models/upernet_vit-b16_ln_mln.py', | ||
'../_base_/datasets/ade20k.py', '../_base_/default_runtime.py', | ||
'../_base_/schedules/schedule_160k.py' | ||
] | ||
|
||
model = dict( | ||
backbone=dict(drop_path_rate=0.1, final_norm=True), | ||
decode_head=dict(num_classes=150), | ||
auxiliary_head=dict(num_classes=150)) | ||
|
||
# AdamW optimizer, no weight decay for position embedding & layer norm | ||
# in backbone | ||
optimizer = dict( | ||
_delete_=True, | ||
type='AdamW', | ||
lr=0.00006, | ||
betas=(0.9, 0.999), | ||
weight_decay=0.01, | ||
paramwise_cfg=dict( | ||
custom_keys={ | ||
'pos_embed': dict(decay_mult=0.), | ||
'cls_token': dict(decay_mult=0.), | ||
'norm': dict(decay_mult=0.) | ||
})) | ||
|
||
lr_config = dict( | ||
_delete_=True, | ||
policy='poly', | ||
warmup='linear', | ||
warmup_iters=1500, | ||
warmup_ratio=1e-6, | ||
power=1.0, | ||
min_lr=0.0, | ||
by_epoch=False) | ||
|
||
# By default, models are trained on 8 GPUs with 2 images per GPU | ||
data = dict(samples_per_gpu=2) |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,36 @@ | ||
_base_ = [ | ||
'../_base_/models/upernet_vit-b16_ln_mln.py', | ||
'../_base_/datasets/ade20k.py', '../_base_/default_runtime.py', | ||
'../_base_/schedules/schedule_160k.py' | ||
] | ||
|
||
model = dict( | ||
decode_head=dict(num_classes=150), auxiliary_head=dict(num_classes=150)) | ||
|
||
# AdamW optimizer, no weight decay for position embedding & layer norm | ||
# in backbone | ||
optimizer = dict( | ||
_delete_=True, | ||
type='AdamW', | ||
lr=0.00006, | ||
betas=(0.9, 0.999), | ||
weight_decay=0.01, | ||
paramwise_cfg=dict( | ||
custom_keys={ | ||
'pos_embed': dict(decay_mult=0.), | ||
'cls_token': dict(decay_mult=0.), | ||
'norm': dict(decay_mult=0.) | ||
})) | ||
|
||
lr_config = dict( | ||
_delete_=True, | ||
policy='poly', | ||
warmup='linear', | ||
warmup_iters=1500, | ||
warmup_ratio=1e-6, | ||
power=1.0, | ||
min_lr=0.0, | ||
by_epoch=False) | ||
|
||
# By default, models are trained on 8 GPUs with 2 images per GPU | ||
data = dict(samples_per_gpu=2) |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,36 @@ | ||
_base_ = [ | ||
'../_base_/models/upernet_vit-b16_ln_mln.py', | ||
'../_base_/datasets/ade20k.py', '../_base_/default_runtime.py', | ||
'../_base_/schedules/schedule_80k.py' | ||
] | ||
|
||
model = dict( | ||
decode_head=dict(num_classes=150), auxiliary_head=dict(num_classes=150)) | ||
|
||
# AdamW optimizer, no weight decay for position embedding & layer norm | ||
# in backbone | ||
optimizer = dict( | ||
_delete_=True, | ||
type='AdamW', | ||
lr=0.00006, | ||
betas=(0.9, 0.999), | ||
weight_decay=0.01, | ||
paramwise_cfg=dict( | ||
custom_keys={ | ||
'pos_embed': dict(decay_mult=0.), | ||
'cls_token': dict(decay_mult=0.), | ||
'norm': dict(decay_mult=0.) | ||
})) | ||
|
||
lr_config = dict( | ||
_delete_=True, | ||
policy='poly', | ||
warmup='linear', | ||
warmup_iters=1500, | ||
warmup_ratio=1e-6, | ||
power=1.0, | ||
min_lr=0.0, | ||
by_epoch=False) | ||
|
||
# By default, models are trained on 8 GPUs with 2 images per GPU | ||
data = dict(samples_per_gpu=2) |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why use
xavier_init
forConv2d
?kaiming_init
forConvModule
is used in MMCV.There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Followed FPN