add video support to segment-anything-2 pipeline #181

eliteprox · 2024-08-29T05:04:07Z

This change updates the request field image to media_file and loads the appropriate segment-anything-2 inference model based on the content type of the file. It uses ffmpeg to process the video to image frames and loads them in with inference values from the request.

Some adjustments still need to be made to the request/response parameters. I think "frame index" and "object id' may be two request parameters to add to this pipeline for the video requests, I've hard coded some values for now.

pschroedl · 2024-08-29T08:08:00Z

runner/app/pipelines/segment_anything_2.py

+                    labels=kwargs.get('point_labels', None),
+                    )
+                video_segments = {}  # video_segments contains the per-frame segmentation results
+                for out_frame_idx, out_obj_ids, out_mask_logits in self.tm_vid.propagate_in_video(inference_state):


I have a feeling we should return the full triple instead of creating a video segment, leaving post-processing to the consumer of the API ( though I recognize this is good quick way to validate the sanity of the mask outputs )

Thanks for the guidance on this, I see how we can just return the results of self.tm_vid.propagate_in_video(inference_state):

I have a feeling we should return the full triple instead of creating a video segment, leaving post-processing to the consumer of the API ( though I recognize this is good quick way to validate the sanity of the mask outputs )

I had some issues trying to return the correct values. I added frame index as an input parameter, normally propagate_in_video will loop returning results for each frame starting at the frame index until the end of the video, now it should only return a single frame. But the data doesn't look correct, can you take a look? @pschroedl

eliteprox requested a review from rickstaa as a code owner August 29, 2024 05:04

pschroedl reviewed Aug 29, 2024

View reviewed changes

eliteprox marked this pull request as draft September 3, 2024 15:40

eliteprox force-pushed the feature/sam2-add-video branch from f8a45d9 to 3040cbd Compare September 10, 2024 03:27

eliteprox changed the base branch from segment_anything_2_pipeline_image to main September 10, 2024 03:28

eliteprox force-pushed the feature/sam2-add-video branch from 3040cbd to 658cf87 Compare October 28, 2024 16:59

eliteprox added 4 commits November 4, 2024 13:23

squashed commits for segment-anything-2-video

1284acb

code cleanup

7ef968e

fix rebase mistake

a86e16f

update runner

675568f

eliteprox force-pushed the feature/sam2-add-video branch from 568dfcc to 675568f Compare November 4, 2024 18:35

add zstd to dockerfile.segment_anything_2

13cadee

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add video support to segment-anything-2 pipeline #181

add video support to segment-anything-2 pipeline #181

eliteprox commented Aug 29, 2024 •

edited

Loading

pschroedl Aug 29, 2024

eliteprox Aug 29, 2024

eliteprox Sep 10, 2024 •

edited

Loading

add video support to segment-anything-2 pipeline #181

Are you sure you want to change the base?

add video support to segment-anything-2 pipeline #181

Conversation

eliteprox commented Aug 29, 2024 • edited Loading

pschroedl Aug 29, 2024

Choose a reason for hiding this comment

eliteprox Aug 29, 2024

Choose a reason for hiding this comment

eliteprox Sep 10, 2024 • edited Loading

Choose a reason for hiding this comment

eliteprox commented Aug 29, 2024 •

edited

Loading

eliteprox Sep 10, 2024 •

edited

Loading