Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Elbrächter, Dennis, Alberti, Giovanni S., Santacesaria, Matteo
Format:	Preprint
Published:	2025
Subjects:	Machine Learning Numerical Analysis
Online Access:	https://arxiv.org/abs/2509.24710
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866909814644277248
author	Elbrächter, Dennis Alberti, Giovanni S. Santacesaria, Matteo
author_facet	Elbrächter, Dennis Alberti, Giovanni S. Santacesaria, Matteo
contents	Score-based diffusion models are a highly effective method for generating samples from a distribution of images. We consider scenarios where the training data comes from a noisy version of the target distribution, and present an efficiently implementable modification of the inference procedure to generate noiseless samples. Our approach is motivated by the manifold hypothesis, according to which meaningful data is concentrated around some low-dimensional manifold of a high-dimensional ambient space. The central idea is that noise manifests as low magnitude variation in off-manifold directions in contrast to the relevant variation of the desired distribution which is mostly confined to on-manifold directions. We introduce the notion of an extended score and show that, in a simplified setting, it can be used to reduce small variations to zero, while leaving large variations mostly unchanged. We describe how its approximation can be computed efficiently from an approximation to the standard score and demonstrate its efficacy on toy problems, synthetic data, and real data.
format	Preprint
id	arxiv_https___arxiv_org_abs_2509_24710
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	MAD: Manifold Attracted Diffusion Elbrächter, Dennis Alberti, Giovanni S. Santacesaria, Matteo Machine Learning Numerical Analysis Score-based diffusion models are a highly effective method for generating samples from a distribution of images. We consider scenarios where the training data comes from a noisy version of the target distribution, and present an efficiently implementable modification of the inference procedure to generate noiseless samples. Our approach is motivated by the manifold hypothesis, according to which meaningful data is concentrated around some low-dimensional manifold of a high-dimensional ambient space. The central idea is that noise manifests as low magnitude variation in off-manifold directions in contrast to the relevant variation of the desired distribution which is mostly confined to on-manifold directions. We introduce the notion of an extended score and show that, in a simplified setting, it can be used to reduce small variations to zero, while leaving large variations mostly unchanged. We describe how its approximation can be computed efficiently from an approximation to the standard score and demonstrate its efficacy on toy problems, synthetic data, and real data.
title	MAD: Manifold Attracted Diffusion
topic	Machine Learning Numerical Analysis
url	https://arxiv.org/abs/2509.24710

Similar Items