Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Chen, Qi, Zhu, Jierui, Shkurti, Florian
Format:	Preprint
Published:	2025
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2506.00849
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866908389352669184
author	Chen, Qi Zhu, Jierui Shkurti, Florian
author_facet	Chen, Qi Zhu, Jierui Shkurti, Florian
contents	Despite the empirical success of Diffusion Models (DMs) and Variational Autoencoders (VAEs), their generalization performance remains theoretically underexplored, especially lacking a full consideration of the shared encoder-generator structure. Leveraging recent information-theoretic tools, we propose a unified theoretical framework that provides guarantees for the generalization of both the encoder and generator by treating them as randomized mappings. This framework further enables (1) a refined analysis for VAEs, accounting for the generator's generalization, which was previously overlooked; (2) illustrating an explicit trade-off in generalization terms for DMs that depends on the diffusion time $T$; and (3) providing computable bounds for DMs based solely on the training data, allowing the selection of the optimal $T$ and the integration of such bounds into the optimization process to improve model performance. Empirical results on both synthetic and real datasets illustrate the validity of the proposed theory.
format	Preprint
id	arxiv_https___arxiv_org_abs_2506_00849
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Generalization in VAE and Diffusion Models: A Unified Information-Theoretic Analysis Chen, Qi Zhu, Jierui Shkurti, Florian Machine Learning Artificial Intelligence Despite the empirical success of Diffusion Models (DMs) and Variational Autoencoders (VAEs), their generalization performance remains theoretically underexplored, especially lacking a full consideration of the shared encoder-generator structure. Leveraging recent information-theoretic tools, we propose a unified theoretical framework that provides guarantees for the generalization of both the encoder and generator by treating them as randomized mappings. This framework further enables (1) a refined analysis for VAEs, accounting for the generator's generalization, which was previously overlooked; (2) illustrating an explicit trade-off in generalization terms for DMs that depends on the diffusion time $T$; and (3) providing computable bounds for DMs based solely on the training data, allowing the selection of the optimal $T$ and the integration of such bounds into the optimization process to improve model performance. Empirical results on both synthetic and real datasets illustrate the validity of the proposed theory.
title	Generalization in VAE and Diffusion Models: A Unified Information-Theoretic Analysis
topic	Machine Learning Artificial Intelligence
url	https://arxiv.org/abs/2506.00849

Similar Items