-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
GLIP #12
Comments
Hi, Thanks for your interest in our work. Best, |
@Charles-Xie Hi Chi, thanks for the reply. I'm wondering what do you think about the similarity and differences between D3 and Omnilabel dataset(OmniLabel: A Challenging Benchmark for Language-Based Object Detection)? |
@twangnh Omnilabel is a great work and I'm happy to see two works with similar motivations appear in a short time, which may show the direction of these works is promising and possibly acknowledged by some researchers in the community. I will try to answer this below as a discussion, and the following only standards for my personal opinion. The difference is also significant: This is only my personal opinion. Thanks for asking. We hope to see more methods and datasets towards this direction. |
Thanks for sharing the wonderful work, the paper differentiate GLIP with GroundingDINO, FIBER, the former is classified into open vocabulary object detection, while the latter is named bi-functional model(detect and reference comprehention), since GLIP can also be used for DOD (e.g., in omnilabel paper), could you please give more dissucssion on this ?
The text was updated successfully, but these errors were encountered: