DDPM: Denoising Diffusion Probabilistic Models¶

Mathematical Formulation¶

Forward Diffusion Process¶

The distribution $q$ in the forward diffusion process is defined as Markov Chain given by $$ q(x_1,...x_T|x_0) = \Pi_{t=1}^T q(x_t|x_{t-1}) (1) $$ $$ q(x_t|x_{t-1}) = \mathcal{N}(x_t;\sqrt{1-\beta_t}x_{t-1},\beta_t I) (2) $$

From (2) we can sample $x_t$ from $q(x_t|x_{t-1})$ by $$ x_t = \sqrt{1-\beta_t}x_{t-1} + \sqrt{\beta_t}z_t (3) $$ where $z_t \sim \mathcal{N}(0,I)$.
We want to sample at any time $t$ from the distribution $q(x_t|x_0)$ without knowing the previous samples $x_{t-1},...,x_1$.

$$\alpha_t = 1-\beta_t (4)\ $$ $\(\bar{\alpha}_t = \Pi_{i=1}^t \alpha_i \ \ (5)\\$\) $$ q(x_t|x_0) = \mathcal{N}(x_t;\sqrt{\bar{\alpha}_t}x_0,(1-\bar{\alpha}_t) I) (6)$$

Therefore, we can sample $x_t$ from $q(x_t|x_0)$ by $x_t = \sqrt{\bar{\alpha}_t}x_0 + \sqrt{1-\bar{\alpha}_t}z_t$ where $z_t \sim \mathcal{N}(0,I)$ and $\bar{\alpha}_t$ is the cumulative product of $\alpha_i$,which means $\bar{\alpha}_t = \Pi_{i=1}^t \alpha_i$.

Reverse Diffusion Process¶

The reverse process is a Markov chain where a neural network predicts the parameters for the reverse diffusion kernel at each timestep.

maximum likelihood estimation (MLE) is used to train the neural network.
Maximize the lower bound of the log-likelihood of the data given the model.
After some math -- our goal is to minimize the noise difference between the predicted noise and the true noise. $\(||(\epsilon_{\theta}-\epsilon)^2_2||$\)

Algorithm¶

Implementation¶

Noise Scheduling¶

References¶

what are diffusion models

An In-Depth Guide to Denoising Diffusion Probabilistic Models DDPM – Theory to Implementation

Youtube: Denoising Diffusion Probabilistic Models | DDPM Explained

最后更新: 2024年9月3日 11:00:52
创建日期: 2024年8月28日 16:21:40