Saved in:
Bibliographic Details
Main Authors: Namjoshi, Ruchir, Thadishetty, Nagasai, Kumar, Vignesh, Venkateshwara, Hemanth
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2510.22004
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866914114459140096
author Namjoshi, Ruchir
Thadishetty, Nagasai
Kumar, Vignesh
Venkateshwara, Hemanth
author_facet Namjoshi, Ruchir
Thadishetty, Nagasai
Kumar, Vignesh
Venkateshwara, Hemanth
contents In recent years, diffusion models have demonstrated remarkable success in high-fidelity image synthesis. However, fine-tuning these models for specialized domains, such as medical imaging, remains challenging due to limited domain-specific data and the high computational cost of full model adaptation. In this paper, we introduce Lite-Diff (Lightweight Diffusion Model Adaptation), a novel finetuning approach that integrates lightweight adaptation layers into a frozen diffusion U-Net while enhancing training with a latent morphological autoencoder (for domain-specific latent consistency) and a pixel level discriminator(for adversarial alignment). By freezing weights of the base model and optimizing only small residual adapter modules, LiteDiff significantly reduces the computational overhead and mitigates overfitting, even in minimal-data settings. Additionally, we conduct ablation studies to analyze the effects of selectively integrating adaptation layers in different U-Net blocks, revealing an optimal balance between efficiency and performance. Experiments on three chest X-ray datasets - (1) Kaggle Chest X-Ray Pneumonia, (2) NIH Chest X-ray14 and (3) VinBigData Chest X_ray demonstrate that LiteDiff achieves superior adaptation efficiency compared to naive full fine-tuning. Our framework provides a promising direction for transfer learning in diffusion models, facilitating their deployment in diverse low data domains.
format Preprint
id arxiv_https___arxiv_org_abs_2510_22004
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle LiteDiff
Namjoshi, Ruchir
Thadishetty, Nagasai
Kumar, Vignesh
Venkateshwara, Hemanth
Computer Vision and Pattern Recognition
In recent years, diffusion models have demonstrated remarkable success in high-fidelity image synthesis. However, fine-tuning these models for specialized domains, such as medical imaging, remains challenging due to limited domain-specific data and the high computational cost of full model adaptation. In this paper, we introduce Lite-Diff (Lightweight Diffusion Model Adaptation), a novel finetuning approach that integrates lightweight adaptation layers into a frozen diffusion U-Net while enhancing training with a latent morphological autoencoder (for domain-specific latent consistency) and a pixel level discriminator(for adversarial alignment). By freezing weights of the base model and optimizing only small residual adapter modules, LiteDiff significantly reduces the computational overhead and mitigates overfitting, even in minimal-data settings. Additionally, we conduct ablation studies to analyze the effects of selectively integrating adaptation layers in different U-Net blocks, revealing an optimal balance between efficiency and performance. Experiments on three chest X-ray datasets - (1) Kaggle Chest X-Ray Pneumonia, (2) NIH Chest X-ray14 and (3) VinBigData Chest X_ray demonstrate that LiteDiff achieves superior adaptation efficiency compared to naive full fine-tuning. Our framework provides a promising direction for transfer learning in diffusion models, facilitating their deployment in diverse low data domains.
title LiteDiff
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2510.22004