S-ECAN

This repository is for S-ECAN introduced in the following paper：

Image Super-Resolution Reconstruction Using Swin Transformer with Efficient Channel Attention Networks [Paper Link]

The code is tested on Ubuntu 20.04 environment (Python 3.8, PyTorch 1.7.1, CUDA 12.0) with GeForce RTX 3090/Gefore GTX TITAN GPUs.

1.Overview

Image super-resolution reconstruction (SR) is an important ill-posed problem in low-level vision, which aims to reconstruct high-resolution images from low-resolution images. Although current state-of-the-art methods exhibit impressive performance, their recovery of image detail information and edge information is still unsatisfactory. To address this problem, this paper proposes a shifted window Transformer (Swin Transformer) with an efficient channel attention network(S-ECAN), which combines the attention based on convolutional neural networks and the self-attention of the Swin Transformer to combine the advantages of both and focuses on learning high-frequency features of images. In addition, to solve the problem of Convolutional Neural Network (CNN) based channel attention consumes a large number of parameters to achieve good performance, this paper proposes the Efficient Channel Attention Block (ECAB), which only involves a handful of parameters while bringing clear performance gain. Extensive experimental validation shows that the proposed model can recover more high-frequency details and texture information. The model is validated on Set5, Set14, B100, Urban100, and Manga109 datasets, where it outperforms the stateof- the-art methods by 0.03-0.13dB, 0.04-0.09dB, 0.01-0.06dB, 0.13-0.20dB, and 0.06-0.17dB respectively in terms of objective metrics. Ultimately, the substantial performance gains and enhanced visual results over prior arts validate the effectiveness and competitiveness of our proposed approach, which achieves an improved performance-complexity trade-off.

The overall architecture of Network architecture of our Swin Transformer with Efficient Channel Attention Networks(S-ECAN) and the structure of Residual Multi-Attention Group (RMAG) and Multi Attention Block (MAB).
Super-Resolution Visual results(4×)on ”img 030” from Urban100.

2.Environment Setup

This program uses Conda to create an environment. The environment creation process is as follows:

step 1 : Create and activate the Conda environment

conda create -n secan python=3.8
conda activate secan

step 2 : Install the pytorch package (note that the package corresponding to the CUDA version is installed on the pytorch website)

conda install pytorch==1.7.1 torchvision==0.8.2 torchaudio==0.7.2 cudatoolkit=11.0 -c pytorch

step 3 : Install the packages necessary for the program to run

pip install -r requirements.txt

At this point, the environment configuration has been completed, the environment as the project's environment can be.

3.Dataset and Model

Dataset Download dataset benchmark_datasets from Release in this repository, unzip it into the datasets directory, and use the program scripts/matlab_scripts/generate_bicubic_img.m to generate the GTmodx (as HR) with the LRbicx folder (as LR).
Model Download the models from Release in this repository, and place them in the experiments/pretrained_models folder.

4.Test and Validation

Once the environment, dataset and model are ready, it's time for testing and validation. Our testing and validation code is run.sh, and you can directly use the following code to reproduce the paper results:

/bin/bash run.sh

When the program has finished running successfully, the inference results of the dataset will be saved to the results folder and the results of the objective metrics validation will be displayed in the terminal.

【Note1】: This program may have a small deviation under different hardware conditions or software environments.

【Note2】: This repository only provides code for testing and validation, if you want to use our network structure for training or improvement, you can implement it in combination with secan/archs/secan_arch.py and BasicSR repository.

5.Results

Quantitative comparison (average PSNR/SSIM) with state-of-the-art methods on benchmark datasets.
Visual comparison for 4×SR on Urban100 datasets.
Visual comparison for 4×SR on Manga109 datasets.
PSNR results v.s the total number of parameters of different methods for image SR (×4) on Set5.

6.Citations

@article{SUN2024108859,
    title     = {Image super-resolution reconstruction using Swin Transformer with efficient channel attention networks},
    journal   = {Engineering Applications of Artificial Intelligence},
    volume    = {136},
    pages     = {108859},
    year      = {2024},
    issn      = {0952-1976},
    doi       = {https://doi.org/10.1016/j.engappai.2024.108859},
    url       = {https://www.sciencedirect.com/science/article/pii/S0952197624010170},
    author    = {Zhenxi Sun and Jin Zhang and Ziyi Chen and Lu Hong and Rui Zhang and Weishi Li and Haojie Xia},
    keywords  = {Image super-resolution, Swin Transformer, Efficient channel attention, Multi-attention fusion},
}

7.Acknowledgement

The codes are based on BasicSR. Please also follow their licenses. Thanks for their awesome works.

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
Figs		Figs
datasets		datasets
experiments/pretrained_models		experiments/pretrained_models
results		results
scripts		scripts
secan		secan
LICENSE		LICENSE
README.md		README.md
cog.yaml		cog.yaml
inference_secan.py		inference_secan.py
requirements.txt		requirements.txt
run.sh		run.sh
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

S-ECAN

Image Super-Resolution Reconstruction Using Swin Transformer with Efficient Channel Attention Networks [Paper Link]

1.Overview

2.Environment Setup

3.Dataset and Model

4.Test and Validation

5.Results

6.Citations

7.Acknowledgement

About

Releases 1

Packages

Languages

License

Zency-Sun/S-ECAN

Folders and files

Latest commit

History

Repository files navigation

S-ECAN

Image Super-Resolution Reconstruction Using Swin Transformer with Efficient Channel Attention Networks [Paper Link]

1.Overview

2.Environment Setup

3.Dataset and Model

4.Test and Validation

5.Results

6.Citations

7.Acknowledgement

About

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages