Saved in:
Bibliographic Details
Main Authors: Mandal, Pinak, Gottwald, Georg A.
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2506.04712
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866909805585629184
author Mandal, Pinak
Gottwald, Georg A.
author_facet Mandal, Pinak
Gottwald, Georg A.
contents As generative models become increasingly powerful and pervasive, the ability to unlearn specific data, whether due to privacy concerns, legal requirements, or the correction of harmful content, has become increasingly important. Unlike in conventional training, where data are accumulated and knowledge is reinforced, unlearning aims to selectively remove the influence of particular data points without costly retraining from scratch. To be effective and reliable, such algorithms need to achieve (i) forgetting of the undesired data, (ii) preservation of the quality of the generation, (iii) preservation of the influence of the desired training data on the model parameters, and (iv) small number of training steps. We propose fast unlearning algorithms based on loss gradient orthogonalization for unconditional and conditional generative models. We show that our algorithms are able to forget data while maintaining the fidelity of the original model. On standard image benchmarks, our algorithms achieve orders of magnitude faster unlearning times than their predecessors, such as gradient surgery. We demonstrate our algorithms with datasets of increasing complexity (MNIST, CelebA and ImageNet-1K) and for generative models of increasing complexity (VAEs and diffusion transformers).
format Preprint
id arxiv_https___arxiv_org_abs_2506_04712
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle UNO: Unlearning via Orthogonalization in Generative models
Mandal, Pinak
Gottwald, Georg A.
Machine Learning
Artificial Intelligence
As generative models become increasingly powerful and pervasive, the ability to unlearn specific data, whether due to privacy concerns, legal requirements, or the correction of harmful content, has become increasingly important. Unlike in conventional training, where data are accumulated and knowledge is reinforced, unlearning aims to selectively remove the influence of particular data points without costly retraining from scratch. To be effective and reliable, such algorithms need to achieve (i) forgetting of the undesired data, (ii) preservation of the quality of the generation, (iii) preservation of the influence of the desired training data on the model parameters, and (iv) small number of training steps. We propose fast unlearning algorithms based on loss gradient orthogonalization for unconditional and conditional generative models. We show that our algorithms are able to forget data while maintaining the fidelity of the original model. On standard image benchmarks, our algorithms achieve orders of magnitude faster unlearning times than their predecessors, such as gradient surgery. We demonstrate our algorithms with datasets of increasing complexity (MNIST, CelebA and ImageNet-1K) and for generative models of increasing complexity (VAEs and diffusion transformers).
title UNO: Unlearning via Orthogonalization in Generative models
topic Machine Learning
Artificial Intelligence
url https://arxiv.org/abs/2506.04712