Saved in:
Bibliographic Details
Main Authors: Fogliani, Luigi, Loureiro, Bruno, Gabrié, Marylou
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2602.12923
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866911446304030720
author Fogliani, Luigi
Loureiro, Bruno
Gabrié, Marylou
author_facet Fogliani, Luigi
Loureiro, Bruno
Gabrié, Marylou
contents Mode collapse, the failure to capture one or more modes when targetting a multimodal distribution, is a central challenge in modern variational inference. In this work, we provide a mathematical analysis of annealing based strategies for mitigating mode collapse in a tractable setting: learning a Gaussian mixture, where mode collapse is known to arise. Leveraging a low dimensional summary statistics description, we precisely characterize the interplay between the initial temperature and the annealing rate, and derive a sharp formula for the probability of mode collapse. Our analysis shows that an appropriately chosen annealing scheme can robustly prevent mode collapse. Finally, we present numerical evidence that these theoretical tradeoffs qualitatively extend to neural network based models, RealNVP normalizing flows, providing guidance for designing annealing strategies mitigating mode collapse in practical variational inference pipelines.
format Preprint
id arxiv_https___arxiv_org_abs_2602_12923
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Annealing in variational inference mitigates mode collapse: A theoretical study on Gaussian mixtures
Fogliani, Luigi
Loureiro, Bruno
Gabrié, Marylou
Machine Learning
Mode collapse, the failure to capture one or more modes when targetting a multimodal distribution, is a central challenge in modern variational inference. In this work, we provide a mathematical analysis of annealing based strategies for mitigating mode collapse in a tractable setting: learning a Gaussian mixture, where mode collapse is known to arise. Leveraging a low dimensional summary statistics description, we precisely characterize the interplay between the initial temperature and the annealing rate, and derive a sharp formula for the probability of mode collapse. Our analysis shows that an appropriately chosen annealing scheme can robustly prevent mode collapse. Finally, we present numerical evidence that these theoretical tradeoffs qualitatively extend to neural network based models, RealNVP normalizing flows, providing guidance for designing annealing strategies mitigating mode collapse in practical variational inference pipelines.
title Annealing in variational inference mitigates mode collapse: A theoretical study on Gaussian mixtures
topic Machine Learning
url https://arxiv.org/abs/2602.12923