Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Bai, Runsheng, Zhang, Chengyu, Deng, Yangdong
Format:	Preprint
Published:	2026
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2603.25872
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866915893252980736
author	Bai, Runsheng Zhang, Chengyu Deng, Yangdong
author_facet	Bai, Runsheng Zhang, Chengyu Deng, Yangdong
contents	Diffusion models have achieved remarkable success in generating high-fidelity content but suffer from slow, iterative sampling, resulting in high latency that limits their use in interactive applications. We introduce DRiffusion, a parallel sampling framework that parallelizes diffusion inference through a draft-and-refine process. DRiffusion employs skip transitions to generate multiple draft states for future timesteps and computes their corresponding noises in parallel, which are then used in the standard denoising process to produce refined results. Theoretically, our method achieves an acceleration rate of $\tfrac{1}{n}$ or $\tfrac{2}{n+1}$, depending on whether the conservative or aggressive mode is used, where $n$ denotes the number of devices. Empirically, DRiffusion attains 1.4$\times$-3.7$\times$ speedup across multiple diffusion models while incur minimal degradation in generation quality: on MS-COCO dataset, both FID and CLIP remain largely on par with those of the original model, while PickScore and HPSv2.1 show only minor average drops of 0.17 and 0.43, respectively. These results verify that DRiffusion delivers substantial acceleration and preserves perceptual quality.
format	Preprint
id	arxiv_https___arxiv_org_abs_2603_25872
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	DRiffusion: Draft-and-Refine Process Parallelizes Diffusion Models with Ease Bai, Runsheng Zhang, Chengyu Deng, Yangdong Machine Learning Diffusion models have achieved remarkable success in generating high-fidelity content but suffer from slow, iterative sampling, resulting in high latency that limits their use in interactive applications. We introduce DRiffusion, a parallel sampling framework that parallelizes diffusion inference through a draft-and-refine process. DRiffusion employs skip transitions to generate multiple draft states for future timesteps and computes their corresponding noises in parallel, which are then used in the standard denoising process to produce refined results. Theoretically, our method achieves an acceleration rate of $\tfrac{1}{n}$ or $\tfrac{2}{n+1}$, depending on whether the conservative or aggressive mode is used, where $n$ denotes the number of devices. Empirically, DRiffusion attains 1.4$\times$-3.7$\times$ speedup across multiple diffusion models while incur minimal degradation in generation quality: on MS-COCO dataset, both FID and CLIP remain largely on par with those of the original model, while PickScore and HPSv2.1 show only minor average drops of 0.17 and 0.43, respectively. These results verify that DRiffusion delivers substantial acceleration and preserves perceptual quality.
title	DRiffusion: Draft-and-Refine Process Parallelizes Diffusion Models with Ease
topic	Machine Learning
url	https://arxiv.org/abs/2603.25872

Similar Items