Skip to content

Files

Latest commit

d8f4b43 · Dec 15, 2021

History

History
52 lines (39 loc) · 9.11 KB

README.md

File metadata and controls

52 lines (39 loc) · 9.11 KB

Side-Aware Boundary Localization for More Precise Object Detection

Abstract

Current object detection frameworks mainly rely on bounding box regression to localize objects. Despite the remarkable progress in recent years, the precision of bounding box regression remains unsatisfactory, hence limiting performance in object detection. We observe that precise localization requires careful placement of each side of the bounding box. However, the mainstream approach, which focuses on predicting centers and sizes, is not the most effective way to accomplish this task, especially when there exists displacements with large variance between the anchors and the targets. In this paper, we propose an alternative approach, named as Side-Aware Boundary Localization (SABL), where each side of the bounding box is respectively localized with a dedicated network branch. To tackle the difficulty of precise localization in the presence of displacements with large variance, we further propose a two-step localization scheme, which first predicts a range of movement through bucket prediction and then pinpoints the precise position within the predicted bucket. We test the proposed method on both two-stage and single-stage detection frameworks. Replacing the standard bounding box regression branch with the proposed design leads to significant improvements on Faster R-CNN, RetinaNet, and Cascade R-CNN, by 3.0%, 1.7%, and 0.9%, respectively.

Citation

We provide config files to reproduce the object detection results in the ECCV 2020 Spotlight paper for Side-Aware Boundary Localization for More Precise Object Detection.

@inproceedings{Wang_2020_ECCV,
    title = {Side-Aware Boundary Localization for More Precise Object Detection},
    author = {Jiaqi Wang and Wenwei Zhang and Yuhang Cao and Kai Chen and Jiangmiao Pang and Tao Gong and Jianping Shi and Chen Change Loy and Dahua Lin},
    booktitle = {ECCV},
    year = {2020}
}

Results and Models

The results on COCO 2017 val is shown in the below table. (results on test-dev are usually slightly higher than val). Single-scale testing (1333x800) is adopted in all results.

Method Backbone Lr schd ms-train box AP Config Download
SABL Faster R-CNN R-50-FPN 1x N 39.9 config model | log
SABL Faster R-CNN R-101-FPN 1x N 41.7 config model | log
SABL Cascade R-CNN R-50-FPN 1x N 41.6 config model | log
SABL Cascade R-CNN R-101-FPN 1x N 43.0 config model | log
Method Backbone GN Lr schd ms-train box AP Config Download
SABL RetinaNet R-50-FPN N 1x N 37.7 config model | log
SABL RetinaNet R-50-FPN Y 1x N 38.8 config model | log
SABL RetinaNet R-101-FPN N 1x N 39.7 config model | log
SABL RetinaNet R-101-FPN Y 1x N 40.5 config model | log
SABL RetinaNet R-101-FPN Y 2x Y (640~800) 42.9 config model | log
SABL RetinaNet R-101-FPN Y 2x Y (480~960) 43.6 config model | log