Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Staniszewski, Łukasz, Zaleska, Katarzyna, Deja, Kamil
Format:	Preprint
Published:	2024
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2410.04891
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866913734133284864
author	Staniszewski, Łukasz Zaleska, Katarzyna Deja, Kamil
author_facet	Staniszewski, Łukasz Zaleska, Katarzyna Deja, Kamil
contents	Recent personalization methods for diffusion models, such as Dreambooth and LoRA, allow fine-tuning pre-trained models to generate new concepts. However, applying these techniques across consecutive tasks in order to include, e.g., new objects or styles, leads to a forgetting of previous knowledge due to mutual interference between their adapters. In this work, we tackle the problem of continual customization under a rigorous regime with no access to past tasks' adapters. In such a scenario, we investigate how different adapters' initialization and merging methods can improve the quality of the final model. To that end, we evaluate the naive continual fine-tuning of customized models and compare this approach with three methods for consecutive adapters' training: sequentially merging new adapters, merging orthogonally initialized adapters, and updating only relevant task-specific weights. In our experiments, we show that the proposed techniques mitigate forgetting when compared to the naive approach. In our studies, we show different traits of selected techniques and their effect on the plasticity and stability of the continually adapted model. Repository with the code is available at https://github.com/luk-st/continual-lora.
format	Preprint
id	arxiv_https___arxiv_org_abs_2410_04891
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Low-Rank Continual Personalization of Diffusion Models Staniszewski, Łukasz Zaleska, Katarzyna Deja, Kamil Machine Learning Recent personalization methods for diffusion models, such as Dreambooth and LoRA, allow fine-tuning pre-trained models to generate new concepts. However, applying these techniques across consecutive tasks in order to include, e.g., new objects or styles, leads to a forgetting of previous knowledge due to mutual interference between their adapters. In this work, we tackle the problem of continual customization under a rigorous regime with no access to past tasks' adapters. In such a scenario, we investigate how different adapters' initialization and merging methods can improve the quality of the final model. To that end, we evaluate the naive continual fine-tuning of customized models and compare this approach with three methods for consecutive adapters' training: sequentially merging new adapters, merging orthogonally initialized adapters, and updating only relevant task-specific weights. In our experiments, we show that the proposed techniques mitigate forgetting when compared to the naive approach. In our studies, we show different traits of selected techniques and their effect on the plasticity and stability of the continually adapted model. Repository with the code is available at https://github.com/luk-st/continual-lora.
title	Low-Rank Continual Personalization of Diffusion Models
topic	Machine Learning
url	https://arxiv.org/abs/2410.04891

Similar Items