Images processing

The aim of this repository is to get to know how images are being processed. It covers several topics, including:

generating artificial images (using GAN and DDPM),
image classification using visual transformer.

Generative Adversarial Network (GAN)

By employing a dataset consisting of pumpkin cakes, we train both a generator and a discriminator model. The primary objective of the generator is to generate high-quality pumpkin cakes, while the discriminator aims to distinguish between real and fake ones. In order to expedite the training process, the images are downscaled to a resolution of 32x32 pixels.

Results (upscaled)

GAN model structure

Denoising Diffusion Probabilistic Models (DDPM)

The objective of this laboratory is to educate the PyTorch model on the task of denoising images. For this purpose, we emulate an image as a vector with a shape of (2,). Initially, the model undergoes training using the bicycle.txt dataset. Subsequently, we employ the trained model to generate a denoised bicycle image by removing the noise introduced through a normal distribution.

Bike generated from noise

Image classification with Vision Transformers (ViT)

Image classification model using Vision Transformers.

Other

jupyter nbconvert --to webpdf --allow-chromium-download lab.ipynb

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
denoising-diffusion-probabilistic-model		denoising-diffusion-probabilistic-model
generative-adversarial-network		generative-adversarial-network
vision-transformer		vision-transformer
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Images processing

Generative Adversarial Network (GAN)

Denoising Diffusion Probabilistic Models (DDPM)

Image classification with Vision Transformers (ViT)

Other

About

Releases

Packages

Languages

kosmydel/image-processing

Folders and files

Latest commit

History

Repository files navigation

Images processing

Generative Adversarial Network (GAN)

Denoising Diffusion Probabilistic Models (DDPM)

Image classification with Vision Transformers (ViT)

Other

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages