Skip to content

[AAAI 2023] DQ-DETR: Dual Query Detection Transformer for Phrase Extraction and Grounding

Notifications You must be signed in to change notification settings

IDEA-Research/DQ-DETR

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 

Repository files navigation

DQ-DETR

This is the official pytorch implementation of our AAAI 2023 paper DQ-DETR. Code will be available soon!

Authors: Shilong Liu, Yaoyuan Liang, Feng Li, Shijia Huang, Hao Zhang, Hang Su, Jun Zhu, Lei Zhang

phrase extraction and grounding(PEG)

PEG requires a model to extract phrases from text and locate objects from image simultaneously. As phrase extraction can be regarded as a 1D text segmentation problem, we formulate PEG as a dual detection problem.

tasks_comparison

To evaluate the performance of PEG, we also propose a new metric CMAP (cross-modal average precision), analogous to the AP metric in object detection.

cmap

DQ-DETR

As phrase extraction can be regarded as a 1D text segmentation problem, we formulate PEG as a dual detection problem.

dq-detr

Experiments

refcoco flickr30k

About

[AAAI 2023] DQ-DETR: Dual Query Detection Transformer for Phrase Extraction and Grounding

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published