Change the repository type filter
All
Repositories list
17 repositories
VLM-R1
PublicSolve Visual Understanding with Reinforced VLMsom-ai-lab.github.io
PublicOmAgent
PublicRS5M
PublicRS5M: a large-scale vision language dataset for remote sensing [TGRS]open-agent-leaderboard
PublicReproducible Language Agent ResearchOmChat
PublicOmAgentDocs
Public- ZoomEye: Enhancing Multimodal LLMs with Human-Like Zooming Capabilities through Tree-Based Image Exploration
OmDet
PublicReal-time and accurate open-vocabulary end-to-end object detectionVL-CheckList
PublicEvaluating Vision & Language Pretraining Models with Objects, Attributes and Relations. [EMNLP 2022]OmModel
Publicawesome-RSVLM
PublicOVDEval
PublicA Comprehensive Evaluation Benchmark for Open-Vocabulary Detection (AAAI 2024)GroundVLP
PublicGroundVLP: Harnessing Zero-shot Visual Grounding from Vision-Language Pre-training and Open-Vocabulary Object Detection (AAAI 2024)habitat-lab
Public