This is the official pytorch implementation of our AAAI 2023 paper DQ-DETR. Code will be available soon!
Authors: Shilong Liu, Yaoyuan Liang, Feng Li, Shijia Huang, Hao Zhang, Hang Su, Jun Zhu, Lei Zhang
PEG requires a model to extract phrases from text and locate objects from image simultaneously. As phrase extraction can be regarded as a 1D text segmentation problem, we formulate PEG as a dual detection problem.
To evaluate the performance of PEG, we also propose a new metric CMAP (cross-modal average precision), analogous to the AP metric in object detection.
As phrase extraction can be regarded as a 1D text segmentation problem, we formulate PEG as a dual detection problem.