Saved in:
Bibliographic Details
Main Authors: Hertrich, Johannes, Gruhlke, Robert
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2407.20444
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866911103061065728
author Hertrich, Johannes
Gruhlke, Robert
author_facet Hertrich, Johannes
Gruhlke, Robert
contents In order to sample from an unnormalized probability density function, we propose to combine continuous normalizing flows (CNFs) with rejection-resampling steps based on importance weights. We relate the iterative training of CNFs with regularized velocity fields to a JKO scheme and prove convergence of the involved velocity fields to the velocity field of the Wasserstein gradient flow (WGF). The alternation of local flow steps and non-local rejection-resampling steps allows to overcome local minima or slow convergence of the WGF for multimodal distributions. Since the proposal of the rejection step is generated by the model itself, they do not suffer from common drawbacks of classical rejection schemes. The arising model can be trained iteratively, reduces the reverse Kullback-Leibler (KL) loss function in each step, allows to generate iid samples and moreover allows for evaluations of the generated underlying density. Numerical examples show that our method yields accurate results on various test distributions including high-dimensional multimodal targets and outperforms the state of the art in almost all cases significantly.
format Preprint
id arxiv_https___arxiv_org_abs_2407_20444
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Importance Corrected Neural JKO Sampling
Hertrich, Johannes
Gruhlke, Robert
Machine Learning
Probability
In order to sample from an unnormalized probability density function, we propose to combine continuous normalizing flows (CNFs) with rejection-resampling steps based on importance weights. We relate the iterative training of CNFs with regularized velocity fields to a JKO scheme and prove convergence of the involved velocity fields to the velocity field of the Wasserstein gradient flow (WGF). The alternation of local flow steps and non-local rejection-resampling steps allows to overcome local minima or slow convergence of the WGF for multimodal distributions. Since the proposal of the rejection step is generated by the model itself, they do not suffer from common drawbacks of classical rejection schemes. The arising model can be trained iteratively, reduces the reverse Kullback-Leibler (KL) loss function in each step, allows to generate iid samples and moreover allows for evaluations of the generated underlying density. Numerical examples show that our method yields accurate results on various test distributions including high-dimensional multimodal targets and outperforms the state of the art in almost all cases significantly.
title Importance Corrected Neural JKO Sampling
topic Machine Learning
Probability
url https://arxiv.org/abs/2407.20444