Denoising Diffusion Probabilistic Model in Flax

This implementation is based on lucidrains's denoising-diffusion-pytorch, where he implemented the original DDPM model proposed from paper Denoising Diffusion Probabilistic Models, as well as latest research findings

I will keep adding new research findings to this repo, let me know if you have any suggestions!

end-to-end training on colab notebook

You can run this code and even modify it directly in Google Colab, no installation required:

[https://github.com/yiyixuxu/denoising-diffusion-flax/blob/main/ddpm_flax_oxford102_end_to_end.ipynb]

The Colab also demonstrates how to configure your own training and load pre-trained checkpoint to generate samples on your own!

generated sample from oxford102 flower dataset

on going at 27k steps (self-conditioning + P2 weighting)

on going at 85k steps (self-conditioning + P2 weighting)

300k steps!

To-do list

write a wandb report about the p2-weighting, self-conditioning and predict_from_x0
implement gradient accumulation
implement ddim

Examples

All examples use read-to-use tensorflow dataset, and have the training process and model checkpoint available on W&B so it is very easy to reproduce

cifar10

python3 main.py --workdir=./cifar10 --mode=train --config=configs/cifar10.py

W&B project page: ddpm-flax-cifar10

fashion-mnist

python3 main.py --workdir=./fashion-mnist --mode=train --config=configs/fashion_mnist.py

W&B project page: ddpm-flax-fashion-mnist

oxford_flowers102

python3 main.py --workdir=./flower102--mode=train --config=configs/oxford102_p2_selfcondition.py

W&B project page: ddpm-flax-flower102

Load a model checkpoint from W&B

By default, we log our model as W&B artifact at end of the training, you can restore your checkpoint from wandb artifact directly by pass the --wandb_artifact argument on commend line; In the example below, we will load our model checkpint from the wandb artifact yiyixu/ddpm-flax-fashion-mnist/model-3j8xvqwf:v0 and continue our training from there

python main.py --workdir=./fashion_mnist_wandb --mode=train --wandb_artifact=yiyixu/ddpm-flax-fashion-mnist/model-3j8xvqwf:v0 --config=configs/fashion_mnist_cpu.py

Train your own model

You can customize your training either by update the config file or overriding parameters on the command line

see more details on how to configure your training from the notebook

Update the config file

You can find example configuration files under configs/ folder - you can create your own configuration file and run

python3 main.py --workdir=./your_test_folder --mode=train --config=configs/your_config_file.py

Overriding parameters on the command line

Specify a hyperparameter configuration by the means of setting --config flag. Configuration flag is defined using config_flags. config_flags allows overriding configuration fields. This can be done as follows:

python main.py --workdir=./fashion_mnist_cpu --config=configs/fashion_mnist_cpu.py  \
--config.training.num_train_steps=100

Configuration

Dataset

the script can run directly on any TensorFlow dataset, just set the configuration field data.dataset to the desired dataset name. You can find a list of ready-to-use dataset [here](tensorflow dataset name https://www.tensorflow.org/datasets/catalog/overview)

See below the list of hyperparameters for data processing; If you are using TPU with 8 devices, make sure your batch_size is dividable by 8; If you set data.image_size to a different size than your actual image, it will be resized, so make sure to set the size properly

data.dataset           
data.batch_size              
data.cache                   
data.image_size
data.channels

W&B Logging

It use Weights and Bias logging by default, if you don't already have an W&B acccount, you can sign up here - you will also be given option to create an account when you run the script on comand line

To disable W&B logging, you can override with --config flag on command line

python3 main.py --workdir=./fashion-mnist --mode=train --config=configs/fashion_mnist.py --config.wandb.log_train=False

You can find below list of hyperparameters for W&B logging in config file

  wandb.entity = None
  wandb.project = "ddpm-flax-flower102"
  wandb.job_type = "training"
  wandb.name = None 
  wandb.log_train = True
  wandb.log_sample = True
  wandb.log_model = True

wandb.entity, wandb.project, wandb.job_type and wandb.name is used to initialize the wandb run; wandb.project is required field because we will create a project with that name to send the run to; all the other fields can be left as None

read more about how to set up these values in Weights & Biase documentation about wandb.init() here

by default, we will log training metrics (wandb.log_train = True), generated samples (wandb.log_sample = True), as well as the final model checkpoint (wandb.log_model = True);

Predict x0

By default, we train our model to predict noise by modifying its parameterization, if you want to predict x_0 directly from x_t, set config.ddpm.pred_x0=True;

The authors of DDPM paper claimed that they it lead to worse sample quality in their experiments

Self-Conditioning

Self-Conditioning is a useful technique for improving diffusion models. In a typical diffusion sampling process, the model iteratively predict x0 in order to gradually denoise the image, and the x0 estimated from previous step is discard in the new step; with self-conditioning, the model will also take previously generated samples as input.

You read more about the technique in the paper Analog Bits: Generating Discrete Data using Diffusion Models with Self-Conditioning

By default, we do not apply self-conditioning; If you wish to apply self-conditioning, set config.ddpm.self_condition=True;

P2 Weighting

P2 (perception prioritized) weighting optimizes the weighting scheme of the training objective function to improve sample quality. It encourages the diffusion model to focus on recovering signals from highly corrupted data, where the model learns global and perceptually rich concepts.

You can read more about P2 weighting in the paper and check out the github repo

By default, we do not apply P2 weighting. However you can apply it by change the values of p2 hyperparameters in config file, i.e. config.ddpm.p2_loss_weight_gamma and config.ddpm.p2_loss_weight_k;

the paper recomend use p2_loss_weight_gamma=1 and p2_loss_weight_k=1

Model EMA

By default, we will keep track of an exponential moving average version of the model and use it to generate samples. You can find the list of hyperparameters with default values for ema calculation in config file config.ema

  ema.beta = 0.995
  ema.update_every = 10
  ema.update_after_step = 100
  ema.inv_gamma = 1.0
  ema.power = 2 / 3
  ema.min_value = 0.0

ema.inv_gamma and ema.power is used to calculate ema_decay rate for each training step. i.e. ema_decay = (1 + steps / config.inv_gamma) ** - config.power ; ema.min_value and ema.beta determine the minimum and maximum decay rate

by default, we start to average the parameters after 100 steps (ema.update_after_step = 100) and we update the average every 10 steps (ema.update_every = 10)

Name		Name	Last commit message	Last commit date
Latest commit History 65 Commits
denoising_diffusion_flax		denoising_diffusion_flax
images		images
LICENSE		LICENSE
README.md		README.md
ddpm_flax_oxford102_end_to_end.ipynb		ddpm_flax_oxford102_end_to_end.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Denoising Diffusion Probabilistic Model in Flax

end-to-end training on colab notebook

generated sample from oxford102 flower dataset

To-do list

Contents

Running locally

Google Cloud TPU

Examples

cifar10

fashion-mnist

oxford_flowers102

Load a model checkpoint from W&B

Train your own model

Update the config file

Overriding parameters on the command line

Configuration

Dataset

W&B Logging

Predict x0

Self-Conditioning

P2 Weighting

Model EMA

About

Releases

Packages

Languages

License

yiyixuxu/denoising-diffusion-flax

Folders and files

Latest commit

History

Repository files navigation

Denoising Diffusion Probabilistic Model in Flax

end-to-end training on colab notebook

generated sample from oxford102 flower dataset

To-do list

Contents

Running locally

Google Cloud TPU

Examples

cifar10

fashion-mnist

oxford_flowers102

Load a model checkpoint from W&B

Train your own model

Update the config file

Overriding parameters on the command line

Configuration

Dataset

W&B Logging

Predict x0

Self-Conditioning

P2 Weighting

Model EMA

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages