Skip to content

Latest commit

 

History

History
159 lines (109 loc) · 6.88 KB

README.md

File metadata and controls

159 lines (109 loc) · 6.88 KB

Towards Lighter and Faster: Learning Wavelets Progressively for Image Super-Resolution (accepted by ACMMM2020) [PAPER]

This repository is the official PyTorch implementation for our proposed WSR. The code is developed by supercaoO (Huanrong Zhang) based on SRFBN_CVPR19. In the future, the update will be released in supercaoO/WSR first.

sample

Introduction

We propose a lightweight and fast network to learn wavelet coefficients progressively for single image super-resolution (WSR). More specifically, the network comprises two main branches. One is used for predicting the second level low-frequency wavelet coefficients, and the other one is designed in a recurrent way for predicting the rest wavelet coefficients at first and second levels. Finally, an inverse wavelet transformation is adopted to reconstruct the SR images from these coefficients. In addition, we propose a deformable convolution kernel (side window) to construct the side-information multi-distillation block (S-IMDB), which is the basic unit of the recurrent blocks (RBs). Moreover, we train WSR with loss constraints at wavelet and spatial domains.

framework

The RNN-based framework of our proposed 4× WSR. Notice that two recurrent blocks (RBs) share the same set of weights. The details about our proposed S-IMDB can be found in our main paper.

If you find our work useful in your research or publications, please consider citing:

@inproceedings{zhang2020wsr,
    author = {Zhang, Huanrong and Jin, Zhi and Tan, Xiaojun and Li, Xiying},
    title = {Towards Lighter and Faster: Learning Wavelets Progressively for Image Super-Resolution},
    booktitle = {Proceedings of the 28th ACM International Conference on Multimedia},
    year= {2020}
}

Contents

  1. Requirements
  2. Test
  3. Train
  4. Results
  5. Acknowledgements

Requirements

Test

Quick start

  1. Clone this repository and cd to WSR:

    git clone https://github.com/supercaoO/WSR.git
    cd WSR
  2. Check if the pre-trained model WSR_x4_BI.pth exists in ./models.

  3. Then, run following commands for evaluation on Set5:

    CUDA_VISIBLE_DEVICES=0 python test.py -opt options/test/test_WSR_Set5.json
  4. Finally, PSNR/SSIM values for Set5 are shown on your terminal, you can find the reconstruction images in ./results/SR/BI.

Test on standard SR benchmark

  1. If you have cloned this repository, you can first download SR benchmark (Set5, Set14, B100, Urban100 and Manga109) from GoogleDrive (provided by SRFBN_CVPR19) or BaiduYun (code: p9pf).

  2. Run ./results/Prepare_TestData_HR_LR.m in Matlab to generate HR/LR images with BI degradation model.

  3. Edit ./options/test/test_WSR_x4_BI.json for your needs according to ./options/test/README.md.

  4. Then, run command:

    cd WSR
    CUDA_VISIBLE_DEVICES=0 python test.py -opt options/test/test_WSR_x4_BI.json
  5. Finally, PSNR/SSIM values are shown on your terminal, you can find the reconstruction images in ./results/SR/BI. You can further evaluate SR results using ./results/Evaluate_PSNR_SSIM.m.

Test on your own images

  1. If you have cloned this repository, you can first place your own images to ./results/LR/MyImage.

  2. Edit ./options/test/test_WSR_own.json for your needs according to ./options/test/README.md.

  3. Then, run command:

    cd WSR
    CUDA_VISIBLE_DEVICES=0 python test.py -opt options/test/test_WSR_own.json
  4. Finally, you can find the reconstruction images in ./results/SR/MyImage.

Train

  1. Download training set DIV2K from official link or BaiduYun (code: m84q).

  2. Run ./scripts/Prepare_TrainData_HR_LR.m in Matlab to generate HR/LR training pairs with BI degradation model and corresponding scale factor.

  3. Run ./results/Prepare_TestData_HR_LR.m in Matlab to generate HR/LR test images with BI degradation model and corresponding scale factor, and choose one of SR benchmark for evaluation during training.

  4. Edit ./options/train/train_WSR.json for your needs according to ./options/train/README.md.

  5. Then, run command:

    cd WSR
    CUDA_VISIBLE_DEVICES=0 python train.py -opt options/train/train_WSR.json
  6. You can monitor the training process in ./experiments.

  7. Finally, you can follow the Test Instructions to evaluate your model.

Results

The inference time is measured on B100 dataset (100 images) using Intel(R) Xeon(R) Silver 4210 CPU @ 2.20GHz (CPU time) and NVIDIA TITAN RTX GPU (GPU time).

Quantitative Results

comp_sota
Comparisons on the number of network parameters, inference time, and PSNR/SSIM of different 4× SR methods. Best and second best PSNR/SSIM results are marked in red and blue, respectively.

comp_sota_2
Comparisons on the number of network parameters and inference time of different 4× SR methods. Best results are highlighted. Notice that the compared methods achieve better PSNR/SSIM results than our WSR does.

Qualitative Results

visual_comp
Visual comparisons with different 4× SR advances on “img018” and “img024” from Urban100 dataset. The inference time is CPU time.

Trade-off Results

trade_off_results
Relationship between the number of network parameters, inference time, and reconstruction performance of different 4× SR advances. The color represents PSNR achieved by different 4× networks on B100 dataset. The inference time in left figure is CPU time and that in right figure is GPU time.

TODO

  • Option files for more scale (i.e., 2×, 8×, 16×).

Acknowledgements