Skip to content

Latest commit

 

History

History

topformer

TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation

Reference

Zhang, Wenqiang, Zilong Huang, Guozhong Luo, Tao Chen, Xinggang Wang, Wenyu Liu, Gang Yu,and Chunhua Shen. "TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation." In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 12083-12093. 2022.

Performance

ADE20k

Model Backbone Resolution Training Iters mIoU mIoU (flip) mIoU (ms+flip) Links
TopFormer-Base topformer 512x512 160000 38.28% 38.59% - model | log | vdl
TopFormer-Small topformer 512x512 160000 35.60% 35.83% - model | log | vdl
TopFormer-Tiny topformer 512x512 160000 32.49% 32.75% - model | log | vdl

Note that, the input resulution of TopFormer should be a multiple of 32.