Change the repository type filter
All
Repositories list
15 repositories
open-agent-leaderboard
PublicReproducible Language Agent Research- ZoomEye: Enhancing Multimodal LLMs with Human-Like Zooming Capabilities through Tree-Based Image Exploration
- Real-time and accurate open-vocabulary end-to-end object detection
RS5M
PublicRS5M: a large-scale vision language dataset for remote sensing [TGRS]- Evaluating Vision & Language Pretraining Models with Objects, Attributes and Relations. [EMNLP 2022]
awesome-RSVLM
PublicOVDEval
PublicA Comprehensive Evaluation Benchmark for Open-Vocabulary Detection (AAAI 2024)GroundVLP
PublicGroundVLP: Harnessing Zero-shot Visual Grounding from Vision-Language Pre-training and Open-Vocabulary Object Detection (AAAI 2024)habitat-lab
Public