Saved in:
Bibliographic Details
Main Authors: Thakral, Kartik, Glaser, Tamar, Hassner, Tal, Vatsa, Mayank, Singh, Richa
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2503.13769
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866916659378257920
author Thakral, Kartik
Glaser, Tamar
Hassner, Tal
Vatsa, Mayank
Singh, Richa
author_facet Thakral, Kartik
Glaser, Tamar
Hassner, Tal
Vatsa, Mayank
Singh, Richa
contents How can we effectively unlearn selected concepts from pre-trained generative foundation models without resorting to extensive retraining? This research introduces `continual unlearning', a novel paradigm that enables the targeted removal of multiple specific concepts from foundational generative models, incrementally. We propose Decremental Unlearning without Generalization Erosion (DUGE) algorithm which selectively unlearns the generation of undesired concepts while preserving the generation of related, non-targeted concepts and alleviating generalization erosion. For this, DUGE targets three losses: a cross-attention loss that steers the focus towards images devoid of the target concept; a prior-preservation loss that safeguards knowledge related to non-target concepts; and a regularization loss that prevents the model from suffering from generalization erosion. Experimental results demonstrate the ability of the proposed approach to exclude certain concepts without compromising the overall integrity and performance of the model. This offers a pragmatic solution for refining generative models, adeptly handling the intricacies of model training and concept management lowering the risks of copyright infringement, personal or licensed material misuse, and replication of distinctive artistic styles. Importantly, it maintains the non-targeted concepts, thereby safeguarding the model's core capabilities and effectiveness.
format Preprint
id arxiv_https___arxiv_org_abs_2503_13769
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Continual Unlearning for Foundational Text-to-Image Models without Generalization Erosion
Thakral, Kartik
Glaser, Tamar
Hassner, Tal
Vatsa, Mayank
Singh, Richa
Computer Vision and Pattern Recognition
How can we effectively unlearn selected concepts from pre-trained generative foundation models without resorting to extensive retraining? This research introduces `continual unlearning', a novel paradigm that enables the targeted removal of multiple specific concepts from foundational generative models, incrementally. We propose Decremental Unlearning without Generalization Erosion (DUGE) algorithm which selectively unlearns the generation of undesired concepts while preserving the generation of related, non-targeted concepts and alleviating generalization erosion. For this, DUGE targets three losses: a cross-attention loss that steers the focus towards images devoid of the target concept; a prior-preservation loss that safeguards knowledge related to non-target concepts; and a regularization loss that prevents the model from suffering from generalization erosion. Experimental results demonstrate the ability of the proposed approach to exclude certain concepts without compromising the overall integrity and performance of the model. This offers a pragmatic solution for refining generative models, adeptly handling the intricacies of model training and concept management lowering the risks of copyright infringement, personal or licensed material misuse, and replication of distinctive artistic styles. Importantly, it maintains the non-targeted concepts, thereby safeguarding the model's core capabilities and effectiveness.
title Continual Unlearning for Foundational Text-to-Image Models without Generalization Erosion
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2503.13769