Diffusion Models
Last updated
Last updated
Repetitive refining of a sample to get a final good image.
This repetitive sampling makes the generation process slow.
They take the input image and gradually add Gaussian noise to it through a series of steps. We will call this the forward process. Notably, this is unrelated to the forward pass of a neural network.
Basically, at each time step you add some noise to the sample. So you start with the image, and then keep adding Gaussian noise until it becomes just noise.
In this, a neural network is trained to reverse the noise-adding process. In this the network is trained to predict a "less" noise sample given a noise sample, basically the opposite of the forward process at a time .
Very good way to understand image generation from auto-regression to diffusion point of view.