Sampling algorithm differ from paper. #5

ariel415el · 2021-05-20T09:41:44Z

Hi,
I want to elaborate on #2:
The sampling algorithm in your paper is a bit different that what shown in the paper.

The paper suggests this sample step

while you do this:

The clipping is done here

diffusion/diffusion_tf/diffusion_utils.py

Line 172 in 1e0dceb

x_recon = tf.clip_by_value(x_recon, -1., 1.)

Now I checked and indeed, without the clipping, the two equations are the same.
Can you give any interpretation or intuition for the clipping and why it is needed?
It seem to be crucial for training while not mentioned in the paper

Thanks

malekinho8 · 2022-07-29T18:56:30Z

Is there any update on this? In my experience this detail has been crucial in determining sample quality, yet it seems to be largely unaddressed with regards to diffusion models. Does anyone have any insight on this?

Kaffaljidhmah2 · 2022-09-24T01:14:08Z

In https://huggingface.co/blog/annotated-diffusion, the author says:

Note that the code above is a simplified version of the original implementation. We found our simplification (which is in line with Algorithm 2 in the paper) to work just as well as the original, more complex implementation, which employs clipping.

varun-ml · 2022-10-19T12:35:00Z

The issue is that the predictions are often out of range. So the authors are are trying to impose some sort of a correction to get meaningful samples. To do that they are restricting x_reconstructed to -1 to +1 by clipping. So, here is how they generation samples

Get error predictions at step t
Get reconstructed image ie x_recon using error predictions
Clip x_recon since we know x is in range 1 to -1
using clipped x_recon, generate x_t

2 is done using eq

4 is done using

This is a hack and will lead to increased density at 1 and -1

ndvbd · 2023-01-24T09:11:42Z

I don't see in the paper the defintion of σt - where is it mentioned and defined? Why do we need to add noise in the reverse process?

wanghao-cst · 2023-04-17T05:59:09Z

I don't see in the paper the defintion of σt - where is it mentioned and defined? Why do we need to add noise in the reverse process?

To make it be a normal distribution.

ariel415el mentioned this issue May 20, 2021

Clipping hmdolatabadi/denoising_diffusion#2

Open

kashif mentioned this issue Apr 20, 2022

Fix denoising sampling lucidrains/DALLE2-pytorch#16

Closed

buttercutter mentioned this issue Jan 16, 2023

Question about diffusion rate β_t #17

Open

guevara mentioned this issue Feb 19, 2023

The Annotated Diffusion Model guevara/read-it-later#9197

Open

gmingjie mentioned this issue Jun 7, 2023

p_mean_variance mean calculation openai/improved-diffusion#64

Open

jamesheald mentioned this issue Jun 24, 2024

Variance of the reverse process explodes openai/improved-diffusion#105

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sampling algorithm differ from paper. #5

Sampling algorithm differ from paper. #5

ariel415el commented May 20, 2021 •

edited

Loading

malekinho8 commented Jul 29, 2022

Kaffaljidhmah2 commented Sep 24, 2022

varun-ml commented Oct 19, 2022

ndvbd commented Jan 24, 2023

wanghao-cst commented Apr 17, 2023

Sampling algorithm differ from paper. #5

Sampling algorithm differ from paper. #5

Comments

ariel415el commented May 20, 2021 • edited Loading

malekinho8 commented Jul 29, 2022

Kaffaljidhmah2 commented Sep 24, 2022

varun-ml commented Oct 19, 2022

ndvbd commented Jan 24, 2023

wanghao-cst commented Apr 17, 2023

ariel415el commented May 20, 2021 •

edited

Loading