Text prompt - object detection and segmentation of all things #7129

KTXKIKI · 2023-11-12T15:38:45Z

Actions before raising this issue

I searched the existing issues and did not find anything similar.
I read/searched the docs

Is your feature request related to a problem? Please describe.

pytorch.zip
Perhaps you can input the desired text or language description in the front end of the CVAT for open object detection and segmentation
https://github.com/IDEA-Research/GroundingDINO/tree/main
https://github.com/autodistill/autodistill-grounded-sam

Describe the solution you'd like

No response

Describe alternatives you've considered

No response

Additional context

No response

Sayanjones · 2024-02-22T17:58:48Z

Hi @KTXKIKI, I am interested to work on this project. Can we discuss this further?

adkbbx · 2024-02-23T01:14:16Z

Hey @KTXKIKI , Do let me know how I can get started on this enhancement task.

KTXKIKI · 2024-02-23T01:20:29Z

嗨，我有兴趣从事这个项目。我们可以进一步讨论这个问题吗？

Hello, I apologize for not being able to reply in a timely manner due to different time differences in China

Okay, I think we should start with Severless and modify some code in both the front-end and back-end to achieve real-time text input, prompt categories, and automatically annotate everything

KTXKIKI · 2024-02-23T01:24:41Z

嘿，请告诉我如何开始此增强任务。

Hello, I apologize for not being able to reply in a timely manner due to different time differences in China

I think we should start with Severless and modify some code in both the front-end and back-end to achieve real-time text input, prompt categories, and automatically annotate everything

Encapsulating the inference code, inference environment, and model into a Docker image and running them as containers to communicate with the CVAT server for automatic annotation. In fact, I have written some serverless functions above, but recently they have been put on hold and have not been further written

KTXKIKI · 2024-02-23T01:27:26Z

Some reference points:
https://github.com/AILab-CVC/YOLO-World
https://docs.autodistill.com/
https://github.com/IDEA-Research/GroundingDINO/tree/main
https://github.com/autodistill/autodistill-grounded-sam
https://github.com/mbzuai-oryx/groundingLMM
https://github.com/hardikdava/cvat_plugins/tree/main

adkbbx · 2024-02-23T02:09:50Z

@KTXKIKI
Thank you for your prompt response and sharing the resources since I am a new contributor to CVAT I am currently trying to set up the development environment locally on my Windows 11 PC using this documentation link. I will look into your resources as soon as I setup the environment on my machine. Do let me know if you have any suggestions or extra resources I can use for setting up the development environment locally to start contributing to this issue.

Sayanjones · 2024-02-23T09:49:28Z

@KTXKIKI Thank you for responding. I'm a new contributor to CVAT as well. Thank you for sharing the resources. I'll go through them once I have my Windows system up and running smoothly. Let's work together on this contribution and share any other advice or vent about the project. I'm excited to collaborate on CVAT with you!

ak4721269 · 2024-02-24T11:44:42Z

Hey @KTXKIKI, I am a new contributor to CVAT. Initially , I had used CVAT.ai to manually label plastics for this project . I would like to contribute to this project. Currently, I am setting up the environment on my Windows system .After that, I will refer the links mentioned above in order to get started with the project.

arch-adi21 · 2024-02-27T14:04:44Z

Hello @KTXKIKI i am interested to be the part of this journey . Basically I have domain expertise in machine learning and right now i am shifting to Deep learning where I find data augmentation to be a very interesting part. Treat me as beginner to suggest me some initial tasks or learning resources , which i should go through to start this journey.

<3

kmh03214 · 2024-06-06T05:41:39Z

I strongly agree with the necessity of this project.
I am currently developing a CVAT serverless model.
Recently, I think it is necessary to have an interface in CVAT's Auto annotation that can receive text prompts to address the open vocabulary problem.

I hope it gets completed quickly and successfully! Thank you. 😃

KTXKIKI added the enhancement New feature or request label Nov 12, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Text prompt - object detection and segmentation of all things #7129

Text prompt - object detection and segmentation of all things #7129

KTXKIKI commented Nov 12, 2023 •

edited

Loading

Sayanjones commented Feb 22, 2024

adkbbx commented Feb 23, 2024

KTXKIKI commented Feb 23, 2024

KTXKIKI commented Feb 23, 2024

KTXKIKI commented Feb 23, 2024

adkbbx commented Feb 23, 2024

Sayanjones commented Feb 23, 2024

ak4721269 commented Feb 24, 2024

arch-adi21 commented Feb 27, 2024

kmh03214 commented Jun 6, 2024

Text prompt - object detection and segmentation of all things #7129

Text prompt - object detection and segmentation of all things #7129

Comments

KTXKIKI commented Nov 12, 2023 • edited Loading

Actions before raising this issue

Is your feature request related to a problem? Please describe.

Describe the solution you'd like

Describe alternatives you've considered

Additional context

Sayanjones commented Feb 22, 2024

adkbbx commented Feb 23, 2024

KTXKIKI commented Feb 23, 2024

KTXKIKI commented Feb 23, 2024

KTXKIKI commented Feb 23, 2024

adkbbx commented Feb 23, 2024

Sayanjones commented Feb 23, 2024

ak4721269 commented Feb 24, 2024

arch-adi21 commented Feb 27, 2024

kmh03214 commented Jun 6, 2024

KTXKIKI commented Nov 12, 2023 •

edited

Loading