Saved in:
Bibliographic Details
Main Authors: Galashov, Alexandre, de Bortoli, Valentin, Gretton, Arthur
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2405.06780
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866914791608549376
author Galashov, Alexandre
de Bortoli, Valentin
Gretton, Arthur
author_facet Galashov, Alexandre
de Bortoli, Valentin
Gretton, Arthur
contents We propose a gradient flow procedure for generative modeling by transporting particles from an initial source distribution to a target distribution, where the gradient field on the particles is given by a noise-adaptive Wasserstein Gradient of the Maximum Mean Discrepancy (MMD). The noise-adaptive MMD is trained on data distributions corrupted by increasing levels of noise, obtained via a forward diffusion process, as commonly used in denoising diffusion probabilistic models. The result is a generalization of MMD Gradient Flow, which we call Diffusion-MMD-Gradient Flow or DMMD. The divergence training procedure is related to discriminator training in Generative Adversarial Networks (GAN), but does not require adversarial training. We obtain competitive empirical performance in unconditional image generation on CIFAR10, MNIST, CELEB-A (64 x64) and LSUN Church (64 x 64). Furthermore, we demonstrate the validity of the approach when MMD is replaced by a lower bound on the KL divergence.
format Preprint
id arxiv_https___arxiv_org_abs_2405_06780
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Deep MMD Gradient Flow without adversarial training
Galashov, Alexandre
de Bortoli, Valentin
Gretton, Arthur
Machine Learning
Artificial Intelligence
We propose a gradient flow procedure for generative modeling by transporting particles from an initial source distribution to a target distribution, where the gradient field on the particles is given by a noise-adaptive Wasserstein Gradient of the Maximum Mean Discrepancy (MMD). The noise-adaptive MMD is trained on data distributions corrupted by increasing levels of noise, obtained via a forward diffusion process, as commonly used in denoising diffusion probabilistic models. The result is a generalization of MMD Gradient Flow, which we call Diffusion-MMD-Gradient Flow or DMMD. The divergence training procedure is related to discriminator training in Generative Adversarial Networks (GAN), but does not require adversarial training. We obtain competitive empirical performance in unconditional image generation on CIFAR10, MNIST, CELEB-A (64 x64) and LSUN Church (64 x 64). Furthermore, we demonstrate the validity of the approach when MMD is replaced by a lower bound on the KL divergence.
title Deep MMD Gradient Flow without adversarial training
topic Machine Learning
Artificial Intelligence
url https://arxiv.org/abs/2405.06780