Untrimmed Video Feature Extractor

Long and untrimmed video learning has recieved increasing attention in recent years. This repo aims to provide some simple and effective scripts for long and untrimmed video feature extraction. We adopt the video processing pipline from TSP and adapt it with several awesome vision pretraining backbones.

Environment

Run conda env create -f base_environment.yaml for base environment setup. For specific model setup, please check their project link:

Video Swin Transformer

Omnivore

CLIP

Usage

Run bash Scripts/generate_video_metada.sh to extract metadata from video, where VIDEO_FOLDER is the directory contains the raw videos, and OUTPUT_CSV_PATH is the output csv file contains the generated video metadata.

Then run the following script to extract features:

bash Scripts/extract_${MODEL_NAME}_feat.sh

Before running, rember to set the defined variable in the script.

Finally, run bash Scripts/merge_pkl_to_h5.sh to merge the video features to a single h5 file.

Acknowledgement

This repo is mainly based on pipeline provided by TSP.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
ActivityNetMetada		ActivityNetMetada
CLIP		CLIP
Scripts		Scripts
TSP		TSP
VideoSwin		VideoSwin
omnivore		omnivore
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
base_environment.yaml		base_environment.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Untrimmed Video Feature Extractor

Environment

Usage

Acknowledgement

About

Releases

Packages

Contributors 2

Languages

License

zjr2000/Untrimmed-Video-Feature-Extractor

Folders and files

Latest commit

History

Repository files navigation

Untrimmed Video Feature Extractor

Environment

Usage

Acknowledgement

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages