Guardado en:
Detalles Bibliográficos
Autores principales: Yin, Bo, Hu, Xiaobin, Zhou, Xingyu, Jiang, Peng-Tao, Liao, Yue, Zhu, Junwei, Zhang, Jiangning, Tai, Ying, Wang, Chengjie, Yan, Shuicheng
Formato: Preprint
Publicado: 2025
Materias:
Acceso en línea:https://arxiv.org/abs/2511.17979
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
_version_ 1866915632921968640
author Yin, Bo
Hu, Xiaobin
Zhou, Xingyu
Jiang, Peng-Tao
Liao, Yue
Zhu, Junwei
Zhang, Jiangning
Tai, Ying
Wang, Chengjie
Yan, Shuicheng
author_facet Yin, Bo
Hu, Xiaobin
Zhou, Xingyu
Jiang, Peng-Tao
Liao, Yue
Zhu, Junwei
Zhang, Jiangning
Tai, Ying
Wang, Chengjie
Yan, Shuicheng
contents Diffusion models have achieved remarkable success in generative modeling, yet how to effectively adapt large pretrained models to new tasks remains challenging. We revisit the reconstruction behavior of diffusion models during denoising to unveil the underlying frequency energy mechanism governing this process. Building upon this observation, we propose FeRA, a frequency driven fine tuning framework that aligns parameter updates with the intrinsic frequency energy progression of diffusion. FeRA establishes a comprehensive frequency energy framework for effective diffusion adaptation fine tuning, comprising three synergistic components: (i) a compact frequency energy indicator that characterizes the latent bandwise energy distribution, (ii) a soft frequency router that adaptively fuses multiple frequency specific adapter experts, and (iii) a frequency energy consistency regularization that stabilizes diffusion optimization and ensures coherent adaptation across bands. Routing operates in both training and inference, with inference time routing dynamically determined by the latent frequency energy. It integrates seamlessly with adapter based tuning schemes and generalizes well across diffusion backbones and resolutions. By aligning adaptation with the frequency energy mechanism, FeRA provides a simple, stable, and compatible paradigm for effective and robust diffusion model adaptation.
format Preprint
id arxiv_https___arxiv_org_abs_2511_17979
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle FeRA: Frequency-Energy Constrained Routing for Effective Diffusion Adaptation Fine-Tuning
Yin, Bo
Hu, Xiaobin
Zhou, Xingyu
Jiang, Peng-Tao
Liao, Yue
Zhu, Junwei
Zhang, Jiangning
Tai, Ying
Wang, Chengjie
Yan, Shuicheng
Computer Vision and Pattern Recognition
Diffusion models have achieved remarkable success in generative modeling, yet how to effectively adapt large pretrained models to new tasks remains challenging. We revisit the reconstruction behavior of diffusion models during denoising to unveil the underlying frequency energy mechanism governing this process. Building upon this observation, we propose FeRA, a frequency driven fine tuning framework that aligns parameter updates with the intrinsic frequency energy progression of diffusion. FeRA establishes a comprehensive frequency energy framework for effective diffusion adaptation fine tuning, comprising three synergistic components: (i) a compact frequency energy indicator that characterizes the latent bandwise energy distribution, (ii) a soft frequency router that adaptively fuses multiple frequency specific adapter experts, and (iii) a frequency energy consistency regularization that stabilizes diffusion optimization and ensures coherent adaptation across bands. Routing operates in both training and inference, with inference time routing dynamically determined by the latent frequency energy. It integrates seamlessly with adapter based tuning schemes and generalizes well across diffusion backbones and resolutions. By aligning adaptation with the frequency energy mechanism, FeRA provides a simple, stable, and compatible paradigm for effective and robust diffusion model adaptation.
title FeRA: Frequency-Energy Constrained Routing for Effective Diffusion Adaptation Fine-Tuning
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2511.17979