Saved in:
Bibliographic Details
Main Authors: Bradley, Arwen, Nakkiran, Preetum, Berthelot, David, Thornton, James, Susskind, Joshua M.
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2502.04549
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866912605533110272
author Bradley, Arwen
Nakkiran, Preetum
Berthelot, David
Thornton, James
Susskind, Joshua M.
author_facet Bradley, Arwen
Nakkiran, Preetum
Berthelot, David
Thornton, James
Susskind, Joshua M.
contents We study the theoretical foundations of composition in diffusion models, with a particular focus on out-of-distribution extrapolation and length-generalization. Prior work has shown that composing distributions via linear score combination can achieve promising results, including length-generalization in some cases (Du et al., 2023; Liu et al., 2022). However, our theoretical understanding of how and why such compositions work remains incomplete. In fact, it is not even entirely clear what it means for composition to "work". This paper starts to address these fundamental gaps. We begin by precisely defining one possible desired result of composition, which we call projective composition. Then, we investigate: (1) when linear score combinations provably achieve projective composition, (2) whether reverse-diffusion sampling can generate the desired composition, and (3) the conditions under which composition fails. We connect our theoretical analysis to prior empirical observations where composition has either worked or failed, for reasons that were unclear at the time. Finally, we propose a simple heuristic to help predict the success or failure of new compositions.
format Preprint
id arxiv_https___arxiv_org_abs_2502_04549
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Mechanisms of Projective Composition of Diffusion Models
Bradley, Arwen
Nakkiran, Preetum
Berthelot, David
Thornton, James
Susskind, Joshua M.
Machine Learning
We study the theoretical foundations of composition in diffusion models, with a particular focus on out-of-distribution extrapolation and length-generalization. Prior work has shown that composing distributions via linear score combination can achieve promising results, including length-generalization in some cases (Du et al., 2023; Liu et al., 2022). However, our theoretical understanding of how and why such compositions work remains incomplete. In fact, it is not even entirely clear what it means for composition to "work". This paper starts to address these fundamental gaps. We begin by precisely defining one possible desired result of composition, which we call projective composition. Then, we investigate: (1) when linear score combinations provably achieve projective composition, (2) whether reverse-diffusion sampling can generate the desired composition, and (3) the conditions under which composition fails. We connect our theoretical analysis to prior empirical observations where composition has either worked or failed, for reasons that were unclear at the time. Finally, we propose a simple heuristic to help predict the success or failure of new compositions.
title Mechanisms of Projective Composition of Diffusion Models
topic Machine Learning
url https://arxiv.org/abs/2502.04549