Saved in:
Bibliographic Details
Main Authors: Liu, Hao, Ye, Junze Tony, Blanchet, Jose, Si, Nian
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2406.19619
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866912331928174592
author Liu, Hao
Ye, Junze Tony
Blanchet, Jose
Si, Nian
author_facet Liu, Hao
Ye, Junze Tony
Blanchet, Jose
Si, Nian
contents We introduce ScoreFusion, a theoretically grounded method for fusing multiple pre-trained diffusion models that are assumed to generate from auxiliary populations. ScoreFusion is particularly useful for enhancing the generative modeling of a target population with limited observed data. Our starting point considers the family of KL barycenters of the auxiliary populations, which is proven to be an optimal parametric class in the KL sense, but difficult to learn. Nevertheless, by recasting the learning problem as score matching in denoising diffusion, we obtain a tractable way of computing the optimal KL barycenter weights. We prove a dimension-free sample complexity bound in total variation distance, provided that the auxiliary models are well-fitted for their own task and the auxiliary tasks combined capture the target well. The sample efficiency of ScoreFusion is demonstrated by learning handwritten digits. We also provide a simple adaptation of a Stable Diffusion denoising pipeline that enables sampling from the KL barycenter of two auxiliary checkpoints; on a portrait generation task, our method produces faces that enhance population heterogeneity relative to the auxiliary distributions.
format Preprint
id arxiv_https___arxiv_org_abs_2406_19619
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle ScoreFusion: Fusing Score-based Generative Models via Kullback-Leibler Barycenters
Liu, Hao
Ye, Junze Tony
Blanchet, Jose
Si, Nian
Machine Learning
Statistics Theory
We introduce ScoreFusion, a theoretically grounded method for fusing multiple pre-trained diffusion models that are assumed to generate from auxiliary populations. ScoreFusion is particularly useful for enhancing the generative modeling of a target population with limited observed data. Our starting point considers the family of KL barycenters of the auxiliary populations, which is proven to be an optimal parametric class in the KL sense, but difficult to learn. Nevertheless, by recasting the learning problem as score matching in denoising diffusion, we obtain a tractable way of computing the optimal KL barycenter weights. We prove a dimension-free sample complexity bound in total variation distance, provided that the auxiliary models are well-fitted for their own task and the auxiliary tasks combined capture the target well. The sample efficiency of ScoreFusion is demonstrated by learning handwritten digits. We also provide a simple adaptation of a Stable Diffusion denoising pipeline that enables sampling from the KL barycenter of two auxiliary checkpoints; on a portrait generation task, our method produces faces that enhance population heterogeneity relative to the auxiliary distributions.
title ScoreFusion: Fusing Score-based Generative Models via Kullback-Leibler Barycenters
topic Machine Learning
Statistics Theory
url https://arxiv.org/abs/2406.19619