The PyTorch code of the TEA Module.
- PyTorch >= 1.1.0
Please refer to TSN repo and TSM repo for the detailed guide of data pre-processing.
A list file is utilized to specify the video data information, including a tuple of extracted video frame folder path (absolute path), video frame number, and video label. A typical line in the file look like:
/data/xxx/xxx/something-something/video_frame_folder 100 12
Finally, the absolute path of your own generated list files should be added into ops/dataset_configs.py
We have provided several examples for training TEA models on different datasets. Please refer to the Appendix B of our paper for more training details.
- To train TEA on Something-Something V1 dataset with 8 frames:
bash ./scripts/train_tea_something_rgb_8f.sh
- To train TEA on HMDB dataset with 16 frames from the Kinetics-400 pre-trained models:
bash ./scripts/finetune_tea_hmdb_rgb_16f.sh
Two inference protocols are utilized in our paper: 1) efficient protocol and 2) accuracy protocol. For both protocols we provide the example scripts for testing TEA models:
- Efficient Protocol
bash ./scripts/single_test_tea_something_rgb_8f.sh
- Accuracy Protocol
bash ./scripts/multi_test_tea_something_rgb_8f.sh