Can metaseg input a video and output the class label? #91

CR400AF-A · 2023-08-02T12:45:26Z

Thanks for your great work!

I have a specific requirement for my project and I'm wondering if metaseg can cater to it. I need to input an image with dimensions HW3 (height * width * 3 channels) and obtain an "image" output with class labels in the form of HW1 (height * width * 1 channel). The "1" in this context represents that the pixels belong to different classes, rather than representing exact semantic labels.

Before I proceed, I'd like to confirm if metaseg has the capability to handle such a task. Your response would be highly valuable to me. Thank you for your time, and I'm looking forward to hearing from you.

CR400AF-A · 2023-08-02T15:37:16Z

I found the solution, but a new problem has emerged.

What I want to do is to segment a video and label each class. My first idea is to assign different class labels to different mask_image colors (you can see what I did for this below). However, I noticed that the output mask video changes the colors between different frames, making it difficult for me to track the labels (such as cookie/person and so on). I checked your code and found that you did the same thing to the video as the images. So, it is not surprising to get such a result.

Therefore, I wonder if you could share some of your ideas regarding this. Thanks!

What I did (In sam_predictor.py line 139):
'''
combined_mask = mask_image # combined_mask = cv2.add(frame, mask_image)
out.write(combined_mask)
'''

CR400AF-A · 2023-08-02T15:46:04Z

maybe this video can help you understand what happened. Take the person's arm as an example. I want to give these pixels a label according to something (here is the mask color, but the color changes with time). So is there some methods to fix it?
Thanks!

output_video_mask.mp4

CR400AF-A · 2023-08-02T15:56:43Z

The video is too large (46M) to preview on the github. Here is an link:
https://cloud.tsinghua.edu.cn/d/fefe751e32d549ad8aab/

Snnier · 2023-08-19T02:29:09Z

How did you do it: "1" means the pixel belongs to a different class, not the exact semantic label？

CR400AF-A · 2023-08-19T03:31:43Z

Hello, I can't make it through this method. Maybe you can have a look at issue #92 . I provide some methods for this issue.

CR400AF-A changed the title ~~Can metaseg output the class label?~~ Can metaseg input a video and output the class label? Aug 2, 2023

CR400AF-A mentioned this issue Aug 16, 2023

SegAutoMaskPredictor producing random color #92

Open

Snnier mentioned this issue Aug 19, 2023

I found the solution, but a new problem has emerged. #93

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can metaseg input a video and output the class label? #91

Can metaseg input a video and output the class label? #91

CR400AF-A commented Aug 2, 2023

CR400AF-A commented Aug 2, 2023 •

edited

Loading

CR400AF-A commented Aug 2, 2023 •

edited

Loading

CR400AF-A commented Aug 2, 2023

Snnier commented Aug 19, 2023

CR400AF-A commented Aug 19, 2023

Can metaseg input a video and output the class label? #91

Can metaseg input a video and output the class label? #91

Comments

CR400AF-A commented Aug 2, 2023

CR400AF-A commented Aug 2, 2023 • edited Loading

CR400AF-A commented Aug 2, 2023 • edited Loading

CR400AF-A commented Aug 2, 2023

Snnier commented Aug 19, 2023

CR400AF-A commented Aug 19, 2023

CR400AF-A commented Aug 2, 2023 •

edited

Loading

CR400AF-A commented Aug 2, 2023 •

edited

Loading