Progressive Fourier Neural Representation for Sequential Video Compilation (ICLR2024)

Extended Analysis: Continual Learning: Forget-free Winning Subnetworks for Video Representations (Preprint)

To train WSN (baseline)

./scripts/run.sh #gpu train #sparsity

To evaluate WSN (baseline)

./scripts/run.sh #gpu eval #sparsity #bit

To train PFNR (FSO3)

./scripts/run_fso.sh #gpu train #sparsity

To evaluate PFNR (FSO3)

./scripts/run_fso.sh #gpu eval #sparsity #bit

Public datasets

Davis Download

UVG8/17 Download

Baseline (NeRV, STL): Neural Representations for Videos (NeurIPS 2021)

Project Page | Paper | UVG Data

Hao Chen, Bo He, Hanyu Wang, Yixuan Ren, Ser-Nam Lim], Abhinav Shrivastava
This is the official implementation of the paper "NeRV: Neural Representations for Videos ".

🔥 A better codebase is released based on HNeRV.

Method overview

Get started

We run with Python 3.8, you can set up a conda environment with all dependencies like so:

pip install -r requirements.txt

High-Level structure

The code is organized as follows:

train_nerv.py includes a generic traiing routine.
model_nerv.py contains the dataloader and neural network architecure
data/ directory video/imae dataset, we provide big buck bunny here
checkpoints/ directory contains some pre-trained model on big buck bunny dataset
log files (tensorboard, txt, state_dict etc.) will be saved in output directory (specified by --outf)

Reproducing experiments

Training experiments

The NeRV-S experiment on 'big buck bunny' can be reproduced with, NeRV-M and NeRV-L with 9_16_58 and 9_16_112 for fc_hw_dim respectively.

python train_nerv.py -e 300   --lower-width 96 --num-blocks 1 --dataset bunny --frame_gap 1 \
    --outf bunny_ab --embed 1.25_40 --stem_dim_num 512_1  --reduction 2  --fc_hw_dim 9_16_26 --expansion 1  \
    --single_res --loss Fusion6   --warmup 0.2 --lr_type cosine  --strides 5 2 2 2 2  --conv_type conv \
    -b 1  --lr 0.0005 --norm none --act swish

Evaluation experiments

To evaluate pre-trained model, just add --eval_Only and specify model path with --weight, you can specify model quantization with --quant_bit [bit_lenght], yuo can test decoding speed with --eval_fps, below we preovide sample commends for NeRV-S on bunny dataset

python train_nerv.py -e 300   --lower-width 96 --num-blocks 1 --dataset bunny --frame_gap 1 \
    --outf bunny_ab --embed 1.25_40 --stem_dim_num 512_1  --reduction 2  --fc_hw_dim 9_16_26 --expansion 1  \
    --single_res --loss Fusion6   --warmup 0.2 --lr_type cosine  --strides 5 2 2 2 2  --conv_type conv \
    -b 1  --lr 0.0005 --norm none  --act swish \
    --weight checkpoints/nerv_S.pth --eval_only

Decoding: Dump predictions with pre-trained model

To dump predictions with pre-trained model, just add --dump_images besides --eval_Only and specify model path with --weight

python train_nerv.py -e 300   --lower-width 96 --num-blocks 1 --dataset bunny --frame_gap 1 \
    --outf bunny_ab --embed 1.25_40 --stem_dim_num 512_1  --reduction 2  --fc_hw_dim 9_16_26 --expansion 1  \
    --single_res --loss Fusion6   --warmup 0.2 --lr_type cosine  --strides 5 2 2 2 2  --conv_type conv \
    -b 1  --lr 0.0005 --norm none  --act swish \
   --weight checkpoints/nerv_S.pth --eval_only  --dump_images

Model Pruning

Evaluate the pruned model

Prune a pre-trained model and fine-tune to recover its performance, with --prune_ratio to specify model parameter amount to be pruned, --weight to specify the pre-trained model, --not_resume_epoch to skip loading the pre-trained weights epoch to restart fine-tune

python train_nerv.py -e 100   --lower-width 96 --num-blocks 1 --dataset bunny --frame_gap 1 \
    --outf prune_ab --embed 1.25_40 --stem_dim_num 512_1  --reduction 2  --fc_hw_dim 9_16_26 --expansion 1  \
    --single_res --loss Fusion6   --warmup 0. --lr_type cosine  --strides 5 2 2 2 2  --conv_type conv \
    -b 1  --lr 0.0005 --norm none --suffix 107  --act swish \
    --weight checkpoints/nerv_S.pth --not_resume_epoch --prune_ratio 0.4

Evaluate the pruned model

To evaluate pruned model, using --weight to specify the pruned model weight, --prune_ratio to initialize the weight_mask for checkpoint loading, eval_only for evaluation mode, --quant_bit to specify quantization bit length, --quant_axis to specify quantization axis

python train_nerv.py -e 100   --lower-width 96 --num-blocks 1 --dataset bunny --frame_gap 1 \
    --outf dbg --embed 1.25_40 --stem_dim_num 512_1  --reduction 2  --fc_hw_dim 9_16_26 --expansion 1  \
    --single_res --loss Fusion6   --warmup 0. --lr_type cosine  --strides 5 2 2 2 2  --conv_type conv \
    -b 1  --lr 0.0005 --norm none --suffix 107  --act swish \
    --weight checkpoints/nerv_S_pruned.pth --prune_ratio 0.4  --eval_only --quant_bit 8 --quant_axis 0

Distrotion-Compression result

The final bits-per-pixel (bpp) is computed by $$ModelParameter * (1 - ModelSparsity) * QuantBit / PixelNum$$. We provide numerical results for distortion-compression (Figure 7, 8 and 11) at psnr_bpp_results.csv .

Citation

If you find our work useful in your research, please cite:

@misc{kang2024progressive,
      title={Progressive Fourier Neural Representation for Sequential Video Compilation}, 
      author={Haeyong Kang and Jaehong Yoon and DaHyun Kim and Sung Ju Hwang and Chang D Yoo},
      year={2024},
      eprint={2306.11305},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Extended Analysis:

@misc{kang2024continual,
      title={Continual Learning: Forget-free Winning Subnetworks for Video Representations}, 
      author={Haeyong Kang and Jaehong Yoon and Sung Ju Hwang and Chang D. Yoo},
      year={2024},
      eprint={2312.11973},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Contact

If you have any questions, please feel free to email the authors.

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
__pycache__		__pycache__
checkpoints		checkpoints
data/bunny		data/bunny
model		model
neuralop		neuralop
output/bunny_ab/bunny/embed1.25_40_512_1_fc_9_16_26__exp1.0_reduce2_low96_blk1_cycle1_gap1_e300_warm60_b1_conv_lr0.0005_cosine_Fusion6_Strd5,2,2,2,2_SinRes_actswish_		output/bunny_ab/bunny/embed1.25_40_512_1_fc_9_16_26__exp1.0_reduce2_low96_blk1_cycle1_gap1_e300_warm60_b1_conv_lr0.0005_cosine_Fusion6_Strd5,2,2,2,2_SinRes_actswish_
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
model_nerv.py		model_nerv.py
model_subnet_nerv.py		model_subnet_nerv.py
pfnr.jpg		pfnr.jpg
plot_nerv_eval.py		plot_nerv_eval.py
requirements.txt		requirements.txt
run.sh		run.sh
setup_environment.sh		setup_environment.sh
train_nerv.py		train_nerv.py
train_nerv_eval.py		train_nerv_eval.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Progressive Fourier Neural Representation for Sequential Video Compilation (ICLR2024)

Extended Analysis: Continual Learning: Forget-free Winning Subnetworks for Video Representations (Preprint)

To train WSN (baseline)

To evaluate WSN (baseline)

To train PFNR (FSO3)

To evaluate PFNR (FSO3)

Public datasets

Baseline (NeRV, STL): Neural Representations for Videos (NeurIPS 2021)

Project Page | Paper | UVG Data

🔥 A better codebase is released based on HNeRV.

Method overview

Get started

High-Level structure

Reproducing experiments

Training experiments

Evaluation experiments

Decoding: Dump predictions with pre-trained model

Model Pruning

Evaluate the pruned model

Evaluate the pruned model

Distrotion-Compression result

Citation

Contact

About

Releases

Packages

Languages

License

ihaeyong/PFNR

Folders and files

Latest commit

History

Repository files navigation

Progressive Fourier Neural Representation for Sequential Video Compilation (ICLR2024)

Extended Analysis: Continual Learning: Forget-free Winning Subnetworks for Video Representations (Preprint)

To train WSN (baseline)

To evaluate WSN (baseline)

To train PFNR (FSO3)

To evaluate PFNR (FSO3)

Public datasets

Baseline (NeRV, STL): Neural Representations for Videos (NeurIPS 2021)

Project Page | Paper | UVG Data

🔥 A better codebase is released based on HNeRV.

Method overview

Get started

High-Level structure

Reproducing experiments

Training experiments

Evaluation experiments

Decoding: Dump predictions with pre-trained model

Model Pruning

Evaluate the pruned model

Evaluate the pruned model

Distrotion-Compression result

Citation

Contact

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages