Saved in:
Bibliographic Details
Main Author: Li, Henry
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2412.07935
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866917865818423296
author Li, Henry
author_facet Li, Henry
contents Diffusion models generate samples by incrementally reversing a process that turns data into noise. We show that when the step size goes to zero, the reversed process is invariant to the distribution of these increments. This reveals a previously unconsidered parameter in the design of diffusion models: the distribution of the diffusion step $Δx_k := x_{k} - x_{k + 1}$. This parameter is implicitly set by default to be normally distributed in most diffusion models. By lifting this assumption, we generalize the framework for designing diffusion models and establish an expanded class of diffusion processes with greater flexibility in the choice of loss function used during training. We demonstrate the effectiveness of these models on density estimation and generative modeling tasks on standard image datasets, and show that different choices of the distribution of $Δx_k$ result in qualitatively different generated samples.
format Preprint
id arxiv_https___arxiv_org_abs_2412_07935
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Non-Normal Diffusion Models
Li, Henry
Machine Learning
Artificial Intelligence
Diffusion models generate samples by incrementally reversing a process that turns data into noise. We show that when the step size goes to zero, the reversed process is invariant to the distribution of these increments. This reveals a previously unconsidered parameter in the design of diffusion models: the distribution of the diffusion step $Δx_k := x_{k} - x_{k + 1}$. This parameter is implicitly set by default to be normally distributed in most diffusion models. By lifting this assumption, we generalize the framework for designing diffusion models and establish an expanded class of diffusion processes with greater flexibility in the choice of loss function used during training. We demonstrate the effectiveness of these models on density estimation and generative modeling tasks on standard image datasets, and show that different choices of the distribution of $Δx_k$ result in qualitatively different generated samples.
title Non-Normal Diffusion Models
topic Machine Learning
Artificial Intelligence
url https://arxiv.org/abs/2412.07935