Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
emanet_r101-d8_4xb2-80k_cityscapes-512x1024.py		emanet_r101-d8_4xb2-80k_cityscapes-512x1024.py
emanet_r101-d8_4xb2-80k_cityscapes-769x769.py		emanet_r101-d8_4xb2-80k_cityscapes-769x769.py
emanet_r50-d8_4xb2-80k_cityscapes-512x1024.py		emanet_r50-d8_4xb2-80k_cityscapes-512x1024.py
emanet_r50-d8_4xb2-80k_cityscapes-769x769.py		emanet_r50-d8_4xb2-80k_cityscapes-769x769.py
metafile.yaml		metafile.yaml

README.md

EMANet

Expectation-Maximization Attention Networks for Semantic Segmentation

Introduction

Official Repo

Code Snippet

Abstract

Self-attention mechanism has been widely used for various tasks. It is designed to compute the representation of each position by a weighted sum of the features at all positions. Thus, it can capture long-range relations for computer vision tasks. However, it is computationally consuming. Since the attention maps are computed w.r.t all other positions. In this paper, we formulate the attention mechanism into an expectation-maximization manner and iteratively estimate a much more compact set of bases upon which the attention maps are computed. By a weighted summation upon these bases, the resulting representation is low-rank and deprecates noisy information from the input. The proposed Expectation-Maximization Attention (EMA) module is robust to the variance of input and is also friendly in memory and computation. Moreover, we set up the bases maintenance and normalization methods to stabilize its training procedure. We conduct extensive experiments on popular semantic segmentation benchmarks including PASCAL VOC, PASCAL Context and COCO Stuff, on which we set new records.

Results and models

Cityscapes

Method	Backbone	Crop Size	Lr schd	Mem (GB)	Inf time (fps)	Device	mIoU	mIoU(ms+flip)	config	download
EMANet	R-50-D8	512x1024	80000	5.4	4.58	V100	77.59	79.44	config	model \| log
EMANet	R-101-D8	512x1024	80000	6.2	2.87	V100	79.10	81.21	config	model \| log
EMANet	R-50-D8	769x769	80000	8.9	1.97	V100	79.33	80.49	config	model \| log
EMANet	R-101-D8	769x769	80000	10.1	1.22	V100	79.62	81.00	config	model \| log

Citation

@inproceedings{li2019expectation,
  title={Expectation-maximization attention networks for semantic segmentation},
  author={Li, Xia and Zhong, Zhisheng and Wu, Jianlong and Yang, Yibo and Lin, Zhouchen and Liu, Hong},
  booktitle={Proceedings of the IEEE International Conference on Computer Vision},
  pages={9167--9176},
  year={2019}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

emanet

emanet

README.md

EMANet

Introduction

Abstract

Results and models

Cityscapes

Citation

Files

emanet

Directory actions

More options

Directory actions

More options

Latest commit

History

emanet

Folders and files

parent directory

README.md

EMANet

Introduction

Abstract

Results and models

Cityscapes

Citation