Saved in:
Bibliographic Details
Main Authors: Elbrächter, Dennis, Alberti, Giovanni S., Santacesaria, Matteo
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2509.24710
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866909814644277248
author Elbrächter, Dennis
Alberti, Giovanni S.
Santacesaria, Matteo
author_facet Elbrächter, Dennis
Alberti, Giovanni S.
Santacesaria, Matteo
contents Score-based diffusion models are a highly effective method for generating samples from a distribution of images. We consider scenarios where the training data comes from a noisy version of the target distribution, and present an efficiently implementable modification of the inference procedure to generate noiseless samples. Our approach is motivated by the manifold hypothesis, according to which meaningful data is concentrated around some low-dimensional manifold of a high-dimensional ambient space. The central idea is that noise manifests as low magnitude variation in off-manifold directions in contrast to the relevant variation of the desired distribution which is mostly confined to on-manifold directions. We introduce the notion of an extended score and show that, in a simplified setting, it can be used to reduce small variations to zero, while leaving large variations mostly unchanged. We describe how its approximation can be computed efficiently from an approximation to the standard score and demonstrate its efficacy on toy problems, synthetic data, and real data.
format Preprint
id arxiv_https___arxiv_org_abs_2509_24710
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle MAD: Manifold Attracted Diffusion
Elbrächter, Dennis
Alberti, Giovanni S.
Santacesaria, Matteo
Machine Learning
Numerical Analysis
Score-based diffusion models are a highly effective method for generating samples from a distribution of images. We consider scenarios where the training data comes from a noisy version of the target distribution, and present an efficiently implementable modification of the inference procedure to generate noiseless samples. Our approach is motivated by the manifold hypothesis, according to which meaningful data is concentrated around some low-dimensional manifold of a high-dimensional ambient space. The central idea is that noise manifests as low magnitude variation in off-manifold directions in contrast to the relevant variation of the desired distribution which is mostly confined to on-manifold directions. We introduce the notion of an extended score and show that, in a simplified setting, it can be used to reduce small variations to zero, while leaving large variations mostly unchanged. We describe how its approximation can be computed efficiently from an approximation to the standard score and demonstrate its efficacy on toy problems, synthetic data, and real data.
title MAD: Manifold Attracted Diffusion
topic Machine Learning
Numerical Analysis
url https://arxiv.org/abs/2509.24710