Request: Open-source code for "Using OpenVCLIP features with AWT for zero-shot video recognition #1

RichardHuang0001 · 2024-12-22T15:32:24Z

Thank you for your excellent work and contributions to the community!

I was wondering if you have any plans to release the code or provide guidance on how to use OpenVCLIP to extract features and combine them with AWT for zero-shot video recognition?

Best regards

zyuhan1999 · 2024-12-23T20:45:03Z

Hi,

Thank you for your interest in our work! AWT comprises three key components: augment, weight, and transportation. The only difference between zero-shot image classification and video classification lies in the augmentation step. For videos, in addition to randomly cropped and flipped images, frames retrieved from different video timestamps are also used.

You can download the Open-VCLIP pre-trained checkpoint and directly perform inference. The only manual effort required is organizing the image features of each video in the specified format, as outlined here. Once organized, you can use AWT_zero_shot/evaluate.py for AWT inference.

I hope this helps!

Best regards,
Yuhan

zyuhan1999 closed this as completed Jan 13, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Request: Open-source code for "Using OpenVCLIP features with AWT for zero-shot video recognition #1

Request: Open-source code for "Using OpenVCLIP features with AWT for zero-shot video recognition #1

RichardHuang0001 commented Dec 22, 2024

zyuhan1999 commented Dec 23, 2024

Request: Open-source code for "Using OpenVCLIP features with AWT for zero-shot video recognition #1

Request: Open-source code for "Using OpenVCLIP features with AWT for zero-shot video recognition #1

Comments

RichardHuang0001 commented Dec 22, 2024

zyuhan1999 commented Dec 23, 2024