Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Galashov, Alexandre, de Bortoli, Valentin, Gretton, Arthur
Format:	Preprint
Published:	2024
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2405.06780
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866914791608549376
author	Galashov, Alexandre de Bortoli, Valentin Gretton, Arthur
author_facet	Galashov, Alexandre de Bortoli, Valentin Gretton, Arthur
contents	We propose a gradient flow procedure for generative modeling by transporting particles from an initial source distribution to a target distribution, where the gradient field on the particles is given by a noise-adaptive Wasserstein Gradient of the Maximum Mean Discrepancy (MMD). The noise-adaptive MMD is trained on data distributions corrupted by increasing levels of noise, obtained via a forward diffusion process, as commonly used in denoising diffusion probabilistic models. The result is a generalization of MMD Gradient Flow, which we call Diffusion-MMD-Gradient Flow or DMMD. The divergence training procedure is related to discriminator training in Generative Adversarial Networks (GAN), but does not require adversarial training. We obtain competitive empirical performance in unconditional image generation on CIFAR10, MNIST, CELEB-A (64 x64) and LSUN Church (64 x 64). Furthermore, we demonstrate the validity of the approach when MMD is replaced by a lower bound on the KL divergence.
format	Preprint
id	arxiv_https___arxiv_org_abs_2405_06780
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Deep MMD Gradient Flow without adversarial training Galashov, Alexandre de Bortoli, Valentin Gretton, Arthur Machine Learning Artificial Intelligence We propose a gradient flow procedure for generative modeling by transporting particles from an initial source distribution to a target distribution, where the gradient field on the particles is given by a noise-adaptive Wasserstein Gradient of the Maximum Mean Discrepancy (MMD). The noise-adaptive MMD is trained on data distributions corrupted by increasing levels of noise, obtained via a forward diffusion process, as commonly used in denoising diffusion probabilistic models. The result is a generalization of MMD Gradient Flow, which we call Diffusion-MMD-Gradient Flow or DMMD. The divergence training procedure is related to discriminator training in Generative Adversarial Networks (GAN), but does not require adversarial training. We obtain competitive empirical performance in unconditional image generation on CIFAR10, MNIST, CELEB-A (64 x64) and LSUN Church (64 x 64). Furthermore, we demonstrate the validity of the approach when MMD is replaced by a lower bound on the KL divergence.
title	Deep MMD Gradient Flow without adversarial training
topic	Machine Learning Artificial Intelligence
url	https://arxiv.org/abs/2405.06780

Similar Items