Saved in:
Bibliographic Details
Main Authors: Gerdes, Mathis, Welling, Max, Cheng, Miranda C. N.
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2410.02667
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866917793734066176
author Gerdes, Mathis
Welling, Max
Cheng, Miranda C. N.
author_facet Gerdes, Mathis
Welling, Max
Cheng, Miranda C. N.
contents Diffusion generative models transform noise into data by inverting a process that progressively adds noise to data samples. Inspired by concepts from the renormalization group in physics, which analyzes systems across different scales, we revisit diffusion models by exploring three key design aspects: 1) the choice of representation in which the diffusion process operates (e.g. pixel-, PCA-, Fourier-, or wavelet-basis), 2) the prior distribution that data is transformed into during diffusion (e.g. Gaussian with covariance $Σ$), and 3) the scheduling of noise levels applied separately to different parts of the data, captured by a component-wise noise schedule. Incorporating the flexibility in these choices, we develop a unified framework for diffusion generative models with greatly enhanced design freedom. In particular, we introduce soft-conditioning models that smoothly interpolate between standard diffusion models and autoregressive models (in any basis), conceptually bridging these two approaches. Our framework opens up a wide design space which may lead to more efficient training and data generation, and paves the way to novel architectures integrating different generative approaches and generation tasks.
format Preprint
id arxiv_https___arxiv_org_abs_2410_02667
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle GUD: Generation with Unified Diffusion
Gerdes, Mathis
Welling, Max
Cheng, Miranda C. N.
Machine Learning
High Energy Physics - Theory
Diffusion generative models transform noise into data by inverting a process that progressively adds noise to data samples. Inspired by concepts from the renormalization group in physics, which analyzes systems across different scales, we revisit diffusion models by exploring three key design aspects: 1) the choice of representation in which the diffusion process operates (e.g. pixel-, PCA-, Fourier-, or wavelet-basis), 2) the prior distribution that data is transformed into during diffusion (e.g. Gaussian with covariance $Σ$), and 3) the scheduling of noise levels applied separately to different parts of the data, captured by a component-wise noise schedule. Incorporating the flexibility in these choices, we develop a unified framework for diffusion generative models with greatly enhanced design freedom. In particular, we introduce soft-conditioning models that smoothly interpolate between standard diffusion models and autoregressive models (in any basis), conceptually bridging these two approaches. Our framework opens up a wide design space which may lead to more efficient training and data generation, and paves the way to novel architectures integrating different generative approaches and generation tasks.
title GUD: Generation with Unified Diffusion
topic Machine Learning
High Energy Physics - Theory
url https://arxiv.org/abs/2410.02667