Saved in:
Bibliographic Details
Main Authors: Masuki, Kanta, Ashida, Yuto
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2501.09064
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866913652109475840
author Masuki, Kanta
Ashida, Yuto
author_facet Masuki, Kanta
Ashida, Yuto
contents Diffusion models represent a class of generative models that produce data by denoising a sample corrupted by white noise. Despite the success of diffusion models in computer vision, audio synthesis, and point cloud generation, so far they overlook inherent multiscale structures in data and have a slow generation process due to many iteration steps. In physics, the renormalization group offers a fundamental framework for linking different scales and giving an accurate coarse-grained model. Here we introduce a renormalization group-based diffusion model that leverages multiscale nature of data distributions for realizing a high-quality data generation. In the spirit of renormalization group procedures, we define a flow equation that progressively erases data information from fine-scale details to coarse-grained structures. Through reversing the renormalization group flows, our model is able to generate high-quality samples in a coarse-to-fine manner. We validate the versatility of the model through applications to protein structure prediction and image generation. Our model consistently outperforms conventional diffusion models across standard evaluation metrics, enhancing sample quality and/or accelerating sampling speed by an order of magnitude. The proposed method alleviates the need for data-dependent tuning of hyperparameters in the generative diffusion models, showing promise for systematically increasing sample efficiency based on the concept of the renormalization group.
format Preprint
id arxiv_https___arxiv_org_abs_2501_09064
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Generative diffusion model with inverse renormalization group flows
Masuki, Kanta
Ashida, Yuto
Statistical Mechanics
Disordered Systems and Neural Networks
Machine Learning
Applied Physics
Biological Physics
Diffusion models represent a class of generative models that produce data by denoising a sample corrupted by white noise. Despite the success of diffusion models in computer vision, audio synthesis, and point cloud generation, so far they overlook inherent multiscale structures in data and have a slow generation process due to many iteration steps. In physics, the renormalization group offers a fundamental framework for linking different scales and giving an accurate coarse-grained model. Here we introduce a renormalization group-based diffusion model that leverages multiscale nature of data distributions for realizing a high-quality data generation. In the spirit of renormalization group procedures, we define a flow equation that progressively erases data information from fine-scale details to coarse-grained structures. Through reversing the renormalization group flows, our model is able to generate high-quality samples in a coarse-to-fine manner. We validate the versatility of the model through applications to protein structure prediction and image generation. Our model consistently outperforms conventional diffusion models across standard evaluation metrics, enhancing sample quality and/or accelerating sampling speed by an order of magnitude. The proposed method alleviates the need for data-dependent tuning of hyperparameters in the generative diffusion models, showing promise for systematically increasing sample efficiency based on the concept of the renormalization group.
title Generative diffusion model with inverse renormalization group flows
topic Statistical Mechanics
Disordered Systems and Neural Networks
Machine Learning
Applied Physics
Biological Physics
url https://arxiv.org/abs/2501.09064