Which dataset was used to train EVF-SAM2? #45

iseunghoon · 2025-02-02T11:10:25Z

Which dataset was used to train EVF-SAM2?

CoderZhangYx · 2025-02-06T02:03:01Z

Same as EVF-SAM1, we used RefCOCO/+/g, ADE20K, Objects365(filtered & machine annotated), PartImageNet, Humanparsing, Pascal-part.

yunjeongch · 2025-02-10T06:47:47Z

Hi,

I’m a bit confused about the training scheme of EVF-SAM2.

From what I see in the code, the EvfSam2Model class is implemented in both evf_sam2.py and evf_sam2_video.py. As far as I understand, the difference between them lies in the visual model: SAM2Base in evf_sam2.py and SAM2VideoPredictor in evf_sam2_video.py.

Did you train the evf_sam2.py version with SAM2Base using the aforementioned image dataset and then perform inference with evf_sam2_video.py using the trained parameters?

Thanks in advance!

CoderZhangYx · 2025-02-10T07:23:58Z

Yes, you are right. Both SAM2Base and SAM2VideoPredictor are wrappers of SAM2 components (image encoder + prompt encoder + mask decoder). We train EVF-SAM2 using image datasets by keeping all SAM2 params frozen, then the model is able to perform zero-shot video prediction.

yunjeongch · 2025-02-12T06:35:30Z

Thanks for the clarification!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Which dataset was used to train EVF-SAM2? #45

Which dataset was used to train EVF-SAM2? #45

iseunghoon commented Feb 2, 2025

CoderZhangYx commented Feb 6, 2025

yunjeongch commented Feb 10, 2025

CoderZhangYx commented Feb 10, 2025

yunjeongch commented Feb 12, 2025

Which dataset was used to train EVF-SAM2? #45

Which dataset was used to train EVF-SAM2? #45

Comments

iseunghoon commented Feb 2, 2025

CoderZhangYx commented Feb 6, 2025

yunjeongch commented Feb 10, 2025

CoderZhangYx commented Feb 10, 2025

yunjeongch commented Feb 12, 2025