Why do we need to Enhance UnderWater Imagery?

Autonomous underwater vehicles (AUVs) depend on various sensors for decision-making among which vision based are an attractive sensing modality. But the visual data needs to be enhanced as the color red attenuates rapidly and the images become bluish while small differences in its altitude to the sea-floor also affect the brightness of the images also factors such as refraction and absorption, suspended particles in the water, and color distortion results in a noisy and distorted visual data.

Approach

To tackle this Pix2Pix GANs have been used to restore images. It's a Conditional Adversarial Network which performs image to image translation by translating an image from any arbitrary domain X to another arbitrary domain Y. By letting X be a set of distorted underwater images and Y be a set of undistorted underwater images, we can generate an image that is the enhanced version of the given underwater image.

Dataset Used

Dataset can be downloaded here (428MB)

For more information on the dataset refer:

Preview:

First row consists of ground truth that we want, second row consists of visual data as seen underwater.

PreTrained Model

Extract the zip file and put the pth.tar files in the directory with all the python files. Make sure to set LOAD_MODEL=True in the config.py file.
Link to the zip file containing pretrained weights of generator and discriminator Mega Link (586.1MB)

Training

The code is capable of running on single machine multiple GPU system.
If GPU is being used then in config.py:
- Set the DEVICE as the main device you want to use. By default cuda:5.
- Set the DEVICE_IDs as the list of GPU ids you want to use keep the first index of this list same as the one in DEVICE.
If CPU is being used then in config.py:
- Set the DEVICE as "cpu"
- Set the DEVICE_IDs as ["cpu"].
Edit other parameters in config.py to match the setup you want to use and run train.py.

Results

1st column: Underwater Image / 2nd column: Ground Truth / 3rd column: Generated Image

@misc{isola2018imagetoimage,
      title={Image-to-Image Translation with Conditional Adversarial Networks, 
      author={Phillip Isola and Jun-Yan Zhu and Tinghui Zhou and Alexei A. Efros},
      year={2018},
      eprint={1611.07004},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

@inproceedings{Fabbri2018ICRA, 
      author = {Cameron Fabbri and Md Jahidul Islam  and Junaed Sattar},
      title = {{Enhancing Underwater Imagery using Generative Adversarial Networks}},
      booktitle = {Proceedings of the {IEEE International Conference on Robotics and Automation (ICRA)}, to appear},
      year = {2018},
      address = {Brisbane, Queensland, Australia},
      month = {May}
}

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
results		results
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
Enhancing-Underwater-Imagery-using-Pix2Pix-cGANs.pdf		Enhancing-Underwater-Imagery-using-Pix2Pix-cGANs.pdf
LICENSE		LICENSE
README.md		README.md
config.py		config.py
dataset.py		dataset.py
discriminator_model.py		discriminator_model.py
generator_model.py		generator_model.py
requirements.txt		requirements.txt
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Why do we need to Enhance UnderWater Imagery?

Approach

Dataset Used

Preview:

PreTrained Model

Training

Results

About

Languages

License

Viditagarwal7479/Enhancing-Underwater-Imagery-using-Pix2Pix-cGANs

Folders and files

Latest commit

History

Repository files navigation

Why do we need to Enhance UnderWater Imagery?

Approach

Dataset Used

Preview:

PreTrained Model

Training

Results

About

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Languages