SAMWISE: Infusing wisdom in SAM2 for Text-Driven Video Segmentation

Claudia Cuttano, Gabriele Trivigno, Gabriele Rosi, Carlo Masone, Giuseppe Averta

Official repository for the paper: "SAMWISE: Infusing wisdom in SAM2 for Text-Driven Video Segmentation". In this work we build upon the Segment-Anything 2 (SAM2) model and make it wiser, by empowering it with natural language understanding and explicit temporal modeling at the feature extraction stage, without fine-tuning its weights, and without outsourcing modality interaction to external models. Our proposed method, SAMWISE, achieves state-of-the-art across various benchmarks, by adding a negligible overhead of just 4.2 M parameters.

📄[arXiv]

🚀 Code and Trained Models Coming Soon! 🚀

Our proposed SAMWISE.

SAMWISE in Action 👀

Our approach integrates natural language knowledge and temporal cues for streaming-based Referring Video Segmentation (RVOS). We mitigate tracking bias—where the model may overlook an identifiable object while tracking another—through a learnable mechanism. This enables efficient streaming processing, leveraging memory from previous frames to maintain context and ensure accurate object segmentation.

SAMWISE for streaming-based RVOS.

SAMWISE (our model, not the hobbit) segments objects from The Lord of the Rings in zero-shot—no extra training, just living up to its namesake! 🧙‍♂️✨

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
assets		assets
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SAMWISE: Infusing wisdom in SAM2 for Text-Driven Video Segmentation

SAMWISE in Action 👀

About

Releases

Packages

Contributors 2

License

ClaudiaCuttano/SAMWISE

Folders and files

Latest commit

History

Repository files navigation

SAMWISE: Infusing wisdom in SAM2 for Text-Driven Video Segmentation

SAMWISE in Action 👀

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Packages