Affichage MARC: :: Library Catalog

Enregistré dans:

Détails bibliographiques
Auteurs principaux:	Holderrieth, Peter, Albergo, Michael S., Jaakkola, Tommi
Format:	Preprint
Publié:	2025
Sujets:	Machine Learning Computation
Accès en ligne:	https://arxiv.org/abs/2502.10843
Tags:	Ajouter un tag Pas de tags, Soyez le premier à ajouter un tag!

_version_	1866916894677663744
author	Holderrieth, Peter Albergo, Michael S. Jaakkola, Tommi
author_facet	Holderrieth, Peter Albergo, Michael S. Jaakkola, Tommi
contents	We propose "LEAPS", an algorithm to sample from discrete distributions known up to normalization by learning a rate matrix of a continuous-time Markov chain (CTMC). LEAPS can be seen as a continuous-time formulation of annealed importance sampling and sequential Monte Carlo methods, extended so that the variance of the importance weights is offset by the inclusion of the CTMC. To derive these importance weights, we introduce a set of Radon-Nikodym derivatives of CTMCs over their path measures. Because the computation of these weights is intractable with standard neural network parameterizations of rate matrices, we devise a new compact representation for rate matrices via what we call "locally equivariant" functions. To parameterize them, we introduce a family of locally equivariant multilayer perceptrons, attention layers, and convolutional networks, and provide an approach to make deep networks that preserve the local equivariance. This property allows us to propose a scalable training algorithm for the rate matrix such that the variance of the importance weights associated to the CTMC are minimal. We demonstrate the efficacy of LEAPS on problems in statistical physics.
format	Preprint
id	arxiv_https___arxiv_org_abs_2502_10843
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	LEAPS: A discrete neural sampler via locally equivariant networks Holderrieth, Peter Albergo, Michael S. Jaakkola, Tommi Machine Learning Computation We propose "LEAPS", an algorithm to sample from discrete distributions known up to normalization by learning a rate matrix of a continuous-time Markov chain (CTMC). LEAPS can be seen as a continuous-time formulation of annealed importance sampling and sequential Monte Carlo methods, extended so that the variance of the importance weights is offset by the inclusion of the CTMC. To derive these importance weights, we introduce a set of Radon-Nikodym derivatives of CTMCs over their path measures. Because the computation of these weights is intractable with standard neural network parameterizations of rate matrices, we devise a new compact representation for rate matrices via what we call "locally equivariant" functions. To parameterize them, we introduce a family of locally equivariant multilayer perceptrons, attention layers, and convolutional networks, and provide an approach to make deep networks that preserve the local equivariance. This property allows us to propose a scalable training algorithm for the rate matrix such that the variance of the importance weights associated to the CTMC are minimal. We demonstrate the efficacy of LEAPS on problems in statistical physics.
title	LEAPS: A discrete neural sampler via locally equivariant networks
topic	Machine Learning Computation
url	https://arxiv.org/abs/2502.10843

Documents similaires