Saved in:
Bibliographic Details
Main Authors: Li, Sixu, Chen, Shi, Li, Qin
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2401.04856
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866913212382838784
author Li, Sixu
Chen, Shi
Li, Qin
author_facet Li, Sixu
Chen, Shi
Li, Qin
contents Score-based Generative Models (SGMs) is one leading method in generative modeling, renowned for their ability to generate high-quality samples from complex, high-dimensional data distributions. The method enjoys empirical success and is supported by rigorous theoretical convergence properties. In particular, it has been shown that SGMs can generate samples from a distribution that is close to the ground-truth if the underlying score function is learned well, suggesting the success of SGM as a generative model. We provide a counter-example in this paper. Through the sample complexity argument, we provide one specific setting where the score function is learned well. Yet, SGMs in this setting can only output samples that are Gaussian blurring of training data points, mimicking the effects of kernel density estimation. The finding resonates a series of recent finding that reveal that SGMs can demonstrate strong memorization effect and fail to generate.
format Preprint
id arxiv_https___arxiv_org_abs_2401_04856
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle A Good Score Does not Lead to A Good Generative Model
Li, Sixu
Chen, Shi
Li, Qin
Machine Learning
Score-based Generative Models (SGMs) is one leading method in generative modeling, renowned for their ability to generate high-quality samples from complex, high-dimensional data distributions. The method enjoys empirical success and is supported by rigorous theoretical convergence properties. In particular, it has been shown that SGMs can generate samples from a distribution that is close to the ground-truth if the underlying score function is learned well, suggesting the success of SGM as a generative model. We provide a counter-example in this paper. Through the sample complexity argument, we provide one specific setting where the score function is learned well. Yet, SGMs in this setting can only output samples that are Gaussian blurring of training data points, mimicking the effects of kernel density estimation. The finding resonates a series of recent finding that reveal that SGMs can demonstrate strong memorization effect and fail to generate.
title A Good Score Does not Lead to A Good Generative Model
topic Machine Learning
url https://arxiv.org/abs/2401.04856