scottn@foxmail.com
- Lorenzo Bianchi, Fabio Carrara, Nicola Messina, Claudio Gennaro, Fabrizio Falchi. The Devil is in the Fine-Grained Details: Evaluating Open-Vocabulary Object Detectors for Fine-Grained Understanding. arxiv 2023. [paper]
- MIC: Zhao Wang, Aoxue Li, Fengwei Zhou, Zhenguo Li, Qi Dou. Open-Vocabulary Object Detection with Meta Prompt Representation and Instance Contrastive Optimization. BMVC 2023. [paper]
- CoDet: Chuofan Ma, Yi Jiang, Xin Wen, Zehuan Yuan, Xiaojuan Qi. CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection. NeurIPS 2023. [paper] [code]
- DE-ViT: Xinyu Zhang, Yuting Wang, Abdeslam Boularias. Detect Every Thing with Few Examples. GCPR 2023. [paper] [code]
- DITO: Dahun Kim, Anelia Angelova, Weicheng Kuo. Detection-Oriented Image-Text Pretraining for Open-Vocabulary Detection. arxiv 2023. [paper] [code]
- CFM-ViT: Dahun Kim, Anelia Angelova, Weicheng Kuo. Contrastive Feature Masking Open-Vocabulary Vision Transformer. ICCV 2023. [paper]
- EdaDet: Cheng Shi, Sibei Yang. EdaDet: Open-Vocabulary Object Detection Using Early Dense Alignment. ICCV 2023. [paper]
- Jianzong Wu, Xiangtai Li, Henghui Ding, Xia Li, Guangliang Cheng, Yunhai Tong, Chen Change Loy. Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation. ICCV 2023. [paper] [code]
- Jincheng Li, Chunyu Xie, Xiaoyu Wu, Bin Wang, Dawei Leng. What Makes Good Open-Vocabulary Detector: A Disassembling Perspective. KDD workshop 2023. [paper]
- MMC-Det: Yifan Xu, Mengdan Zhang, Xiaoshan Yang, Changsheng Xu. Exploring Multi-Modal Contextual Knowledge for Open-Vocabulary Object Detection. arxiv 2023. [paper]
- OVDEval: Yiyang Yao, Peng Liu, Tiancheng Zhao, Qianqian Zhang, Jiajia Liao, Chunxin Fang, Kyusong Lee, Qing Wang. How to Evaluate the Generalization of Detection? A Benchmark for Comprehensive Open-Vocabulary Detection. arxiv 2023. [paper] [code]
- SAS-Det: Shiyu Zhao, Samuel Schulter, Long Zhao, Zhixing Zhang, Vijay Kumar B. G, Yumin Suh, Manmohan Chandraker, Dimitris N. Metaxas. Improving Pseudo Labels for Open-Vocabulary Object Detection. arxiv 2023. [paper]
- Chaoyang Zhu, Long Chen. A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future. arxiv 2023. [paper]
- UOVN: Hengcan Shi, Munawar Hayat, Jianfei Cai. Unified Open-Vocabulary Dense Visual Prediction. arxiv 2023. [paper]
- SGDN: Hengcan Shi, Munawar Hayat, Jianfei Cai. Open-Vocabulary Object Detection via Scene Graph Discovery. arxiv 2023. [paper]
- OWL-ST: Matthias Minderer, Alexey Gritsenko, Neil Houlsby. Scaling Open-Vocabulary Object Detection. arxiv 2023. [paper]
- Prannay Kaul, Weidi Xie, Andrew Zisserman. Multi-Modal Classifiers for Open-Vocabulary Object Detection. ICML 2023. [paper][code]
- OpenSeeD: Hao Zhang, Feng Li, Xueyan Zou, Shilong Liu, Chunyuan Li, Jianfeng Gao, Jianwei Yang, Lei Zhang. A Simple Framework for Open-Vocabulary Segmentation and Detection. arXiv 2023. [paper] [code]
- Relja Arandjelović, Alex Andonian, Arthur Mensch, Olivier J. Hénaff, Jean-Baptiste Alayrac, Andrew Zisserman. Three Ways to Improve Feature Alignment for Open Vocabulary Eetection. arXiv 2023. [paper]
- Prompt-OVD: Hwanjun Song, Jihwan Bang. Prompt-Guided Transformers for End-to-End Open-Vocabulary Object Detection. arXiv 2023. [paper]
- PCL: Han-Cheol Cho, Won Young Jhoo, Wooyoung Kang, Byungseok Roh. Open-Vocabulary Object Detection using Pseudo Caption Labels. arXiv 2023. [paper]
- CORA: Xiaoshi Wu, Feng Zhu, Rui Zhao, Hongsheng Li. CORA: Adapting CLIP for Open-Vocabulary Detection with Region Prompting and Anchor Pre-Matching. CVPR 2023. [paper] [code]
- Luting Wang, Yi Liu, Penghui Du, Zihan Ding, Yue Liao, Qiaosong Qi, Biaolong Chen, Si Liu. Object-Aware Distillation Pyramid for Open-Vocabulary Object Detection. CVPR 2023. [paper] [code]
- BARON: Size Wu, Wenwei Zhang, Sheng Jin, Wentao Liu, Chen Change Loy. Aligning Bag of Regions for Open-Vocabulary Object Detection. CVPR 2023. [paper] [code]
- RO-ViT: Dahun Kim, Anelia Angelova, Weicheng Kuo. Region-Aware Pretraining for Open-Vocabulary Object Detection with Vision Transformers. CVPR 2023. [paper] [code]
- DetCLIPv2: Lewei Yao, Jianhua Han, Xiaodan Liang, Dan Xu, Wei Zhang, Zhenguo Li, Hang Xu. DetCLIPv2: Scalable Open-Vocabulary Object Detection Pre-training via Word-Region Alignment. CVPR 2023. [paper]
- CondHead: Tao Wang. Learning to Detect and Segment for Open Vocabulary Object Detection. CVPR 2023. [paper]
- F-VLM: Weicheng Kuo, Yin Cui, Xiuye Gu, AJ Piergiovanni, Anelia Angelova. F-VLM: Open-Vocabulary Object Detection upon Frozen Vision and Language Models. ICLR 2023. [paper] [code]
- VLDet: Chuang Lin, Peize Sun, Yi Jiang, Ping Luo, Lizhen Qu, Gholamreza Haffari, Zehuan Yuan, Jianfei Cai. Learning Object-Language Alignments for Open-Vocabulary Object Detection. ICLR 2023. [paper] [code]
- VTP-OVD: Yanxin Long, Jianhua Han, Runhui Huang, Xu Hang, Yi Zhu, Chunjing Xu, Xiaodan Liang. P3OVD: Fine-grained Visual-Text Prompt-Driven Self-Training for Open-Vocabulary Object Detection. arXiv 2022. [paper]
- MEDet: Peixian Chen, Kekai Sheng, Mengdan Zhang, Yunhang Shen, Ke Li, Chunhua Shen. Open Vocabulary Object Detection with Proposal Mining and Prediction Equalization. arXiv 2022. [paper] [code]
- LocOV: Maria A. Bravo, Sudhanshu Mittal, Thomas Brox. Localized Vision-Language Matching for Open-vocabulary Object Detection. DAGM German Conference on Pattern Recognition (GCPR) 2022. [paper] [code]
- Object-Centric-OVD: Hanoona Rasheed, Muhammad Maaz, Muhammad Uzair Khattak, Salman Khan, Fahad Shahbaz Khan. Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection. NeurIPS 2022. [paper] [code]
- VL-PLM: Shiyu Zhao, Zhixing Zhang, Samuel Schulter, Long Zhao, Vijay Kumar B.G, Anastasis Stathopoulos, Manmohan Chandraker, Dimitris Metaxas. Exploiting Unlabeled Data with Vision and Language Models for Object Detection. ECCV 2022. [paper] [code]
- PromptDet: Chengjian Feng, Yujie Zhong, Zequn Jie, Xiangxiang Chu, Haibing Ren, Xiaolin Wei, Weidi Xie, Lin Ma. PromptDet: Towards Open-vocabulary Detection using Uncurated Images. ECCV 2022. [paper] [code]
- OpenSeg: Golnaz Ghiasi, Xiuye Gu, Yin Cui, Tsung-Yi Lin. Scaling Open-Vocabulary Image Segmentation with Image-Level Labels. ECCV 2022. [paper] [code]
- OV-DETR: Yuhang Zang, Wei Li, Kaiyang Zhou, Chen Huang, Chen Change Loy. Open-Vocabulary DETR with Conditional Matching. ECCV 2022. [paper] [code]
- PB-OVD: Mingfei Gao, Chen Xing, Juan Carlos Niebles, Junnan Li, Ran Xu, Wenhao Liu, Caiming Xiong. Open Vocabulary Object Detection with Pseudo Bounding-Box Labels. ECCV 2022. [paper] [code]
- OWL-ViT: Matthias Minderer, Alexey Gritsenko, Austin Stone, Maxim Neumann, Dirk Weissenborn, Alexey Dosovitskiy, Aravindh Mahendran, Anurag Arnab, Mostafa Dehghani, Zhuoran Shen, Xiao Wang, Xiaohua Zhai, Thomas Kipf, Neil Houlsby. Simple Open-Vocabulary Object Detection with Vision Transformers. ECCV 2022. [paper] [code]
- RegionCLIP: Yiwu Zhong, Jianwei Yang, Pengchuan Zhang, Chunyuan Li, Noel Codella, Liunian Harold Li, Luowei Zhou, Xiyang Dai, Lu Yuan, Yin Li, Jianfeng Gao. RegionCLIP: Region-Based Language-Image Pretraining. CVPR 2022. [paper] [code]
- XPM: Dat Huynh, Jason Kuen, Zhe Lin, Jiuxiang Gu, Ehsan Elhamifar. Open-Vocabulary Instance Segmentation via Robust Cross-Modal Pseudo-Labeling. CVPR 2022. [paper] [code]
- HierKD: Zongyang Ma, Guan Luo, Jin Gao, Liang Li, Yuxin Chen, Shaoru Wang, Congxuan Zhang, Weiming Hu. Open-Vocabulary One-Stage Detection With Hierarchical Visual-Language Knowledge Distillation. CVPR 2022. [paper] [code]
- DetPro: Yu Du, Fangyun Wei, Zihe Zhang, Miaojing Shi, Yue Gao, Guoqi Li. Learning to Prompt for Open-Vocabulary Object Detection with Vision-Language Model. CVPR 2022. [paper] [code]
- ViLD: Xiuye Gu, Tsung-Yi Lin, Weicheng Kuo, Yin Cui. Open-vocabulary Object Detection via Vision and Language Knowledge Distillation. ICLR 2022. [paper] [code]