Weakly-Supervised Temporal Action Localization by Progressive Complementary Learning

Jia-Run Du, Jia-Chang Feng, Kun-Yu Lin, Fa-Ting Hong, Zhongang Qi, Ying Shan, Jian-Fang Hu, and Wei-Shi Zheng

[Paper]

Official repository of TCSVT 2024 paper "Weakly-Supervised Temporal Action Localization by Progressive Complementary Learning".

We propose a Progressive Complementary Learning (ProCL) method that progressively enhances the snippet-level supervision from the perspective of category exclusion. Specifically, our ProCL gradually excludes the categories that snippets should not belong to, based on different confidence levels. Moreover, we propose three snippet-level losses for weakly-supervised temporal action localization, bridging the gap between video-level supervision and unavailable snippet-level supervision, without using additional auxiliary models or information.

💬 News

[2024.10.09] 🎊 The ckpts of ProCL are available.
[2024.10.09] 🥳 Code of ProCL is released.
[2024.09.03] 🎉 Our work is accepted by TCSVT 2024.

🔧 Requirements

Python 3.6 and Pytorch 1.3.0 are used.
We conduct the environments on a single 1080Ti GPU.
Create the anaconda environment as what we used, as bellow:

# NOTE: before execute this command, you should change the "prefix" in the environment.yaml
conda env create -f environment.yaml

📕 Prepare the data

The features for [Thumos14] and [ActivityNet1.3] dataset can be downloaded. The annotations are included with this package.
After downloading, modify the config --path-dataset in your running script to your own path.

👀 Quick Start

Download the pre-trained [checkpoints].
Create the default folder ./ckpt and put the downloaded pre-trained models into ./ckpt.
Run the test scripts:

# Thumos14
CUDA_VISIBLE_DEVICES=2 python test.py --use-model Model_Thumos --dataset-name Thumos14reduced --path-dataset /mnt/Datasets/TAL_dataset/Thumos14 --model-name Train_Thumos14 --seed 355 --delta 0.2 --max_seqlen_list 560 1120 280 --att_thresh_params 0.1 0.925 0.025 --test_proposal_method 'multiple_threshold_hamnet_v3' --test_proposal_mode 'att' --PLG_logits_mode 'norm' --PLG_thres 0.69

# ActivityNet1.3
CUDA_VISIBLE_DEVICES=2 python test.py --use-model Model_Ant --dataset-name ActivityNet1.3 --path-dataset /mnt/Datasets/TAL_dataset/ActivityNet1.3/  --dataset Ant13_SampleDataset --model-name Train_ActivityNet13 --num-class 200 --seed 3552 --delta 0.3 --t 10 --max_seqlen_list 90 180 50 --test_proposal_method 'multiple_threshold_hamnet_ant' --PLG_logits_mode 'norm' --PLG_thres 0.685

🏷️ Train Your Own Model

Thumos14

CUDA_VISIBLE_DEVICES=2 python main.py --use-model Model_Thumos --dataset-name Thumos14reduced --path-dataset /mnt/Datasets/TAL_dataset/Thumos14 --model-name Train_Thumos14 --seed 355 --delta 0.2 --max_seqlen_list 560 1120 280 --use_ms --k 7 --max-iter 20000 --att_thresh_params 0.1 0.925 0.025 --test_proposal_method 'multiple_threshold_hamnet_v3' --test_proposal_mode 'att' --lambda_cll 1 --lambda_lpl 1 --PLG_logits_mode 'norm' --PLG_thres 0.69 --rescale_mode 'nearest' --ensemble_weight 0.33 0.33 0.33 --lpl_norm 'none' --alpha 0 --multi_back --lambda_mscl 1 --SCL_alpha 1 --pseudo_iter -1 --interval 50

ActivityNet1.3

CUDA_VISIBLE_DEVICES=2 python main.py --use-model Model_Ant --path-dataset /mnt/Datasets/TAL_dataset/ActivityNet1.3/ --dataset-name ActivityNet1.3 --dataset Ant13_SampleDataset --model-name Train_ActivityNet13 --num-class 200 --seed 3552 --delta 0.3 --t 10 --max_seqlen_list 90 180 50 --use_ms --k 10 --lr 1e-5 --max-iter 30000 --test_proposal_method 'multiple_threshold_hamnet_ant' --lambda_cll 1 --lambda_lpl 1 --PLG_logits_mode 'norm' --PLG_thres 0.685 --rescale_mode 'nearest' --ensemble_weight 0.33 0.33 0.33 --lpl_norm 'none' --alpha 1 --multi_back --pseudo_iter 20000

Acknowledgement

We would like to thank the contributors to the CO2-Net, ASM-Loc for their open research and exploration.

📝 Citation

If you find ProCL useful for your research and applications, please cite using this BibTeX:

@article{du2024weakly,
  title={Weakly-supervised temporal action localization by progressive complementary learning},
  author={Du, Jia-Run and Feng, Jia-Chang and Lin, Kun-Yu and Hong, Fa-Ting and Qi, Zhongang and Shan, Ying and Hu, Jian-Fang and Zheng, Wei-Shi},
  journal={IEEE Transactions on Circuits and Systems for Video Technology},
  year={2024},
  publisher={IEEE}
}

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
Dataset		Dataset
I3D		I3D
Thumos14reduced-Annotations		Thumos14reduced-Annotations
assets		assets
eval		eval
process_ant1.3		process_ant1.3
process_unt		process_unt
tools		tools
utils		utils
.gitignore		.gitignore
README.md		README.md
environment.yaml		environment.yaml
losses.py		losses.py
main.py		main.py
model_newest.py		model_newest.py
options.py		options.py
proposal_methods.py		proposal_methods.py
run.sh		run.sh
statics.ipynb		statics.ipynb
test.py		test.py
train.py		train.py
wsad_dataset.py		wsad_dataset.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Weakly-Supervised Temporal Action Localization by Progressive Complementary Learning

Jia-Run Du, Jia-Chang Feng, Kun-Yu Lin, Fa-Ting Hong, Zhongang Qi, Ying Shan, Jian-Fang Hu, and Wei-Shi Zheng

[Paper]

💬 News

🔧 Requirements

📕 Prepare the data

👀 Quick Start

🏷️ Train Your Own Model

Acknowledgement

📝 Citation

About

Releases

Packages

Languages

Run542968/ProCL

Folders and files

Latest commit

History

Repository files navigation

Weakly-Supervised Temporal Action Localization by Progressive Complementary Learning

Jia-Run Du, Jia-Chang Feng, Kun-Yu Lin, Fa-Ting Hong, Zhongang Qi, Ying Shan, Jian-Fang Hu, and Wei-Shi Zheng

[Paper]

💬 News

🔧 Requirements

📕 Prepare the data

👀 Quick Start

🏷️ Train Your Own Model

Acknowledgement

📝 Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages