Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Miksa, Jan, Krukowski, Patryk, Spurek, Przemysław, Rymarczyk, Dawid Damian, Sendera, Marcin
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2605.15737
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866914568888909824
author	Miksa, Jan Krukowski, Patryk Spurek, Przemysław Rymarczyk, Dawid Damian Sendera, Marcin
author_facet	Miksa, Jan Krukowski, Patryk Spurek, Przemysław Rymarczyk, Dawid Damian Sendera, Marcin
contents	Machine unlearning has reached a critical bottleneck. As traditional weight-space interventions focus primarily on erasing targeted concepts, they often fail to prevent the unintended suppression of other significant representations. This leads to substantial collateral damage, with essential knowledge being forgotten, because these methods lack formal mathematical guarantees for the preservation of neutral concepts. To avoid degradation, they are frequently forced into conservative updates. We propose BARRIER (Bounded Activation Regions for Robust Information Erasure), a paradigm-shifting framework that shifts the locus of intervention from static model weights to the dynamic geometry of hidden-layer activations. Unlike existing methods, BARRIER employs Interval Arithmetic (IA) on SVD-based projections of the activation space to encapsulate the specific target region within a bounding hypercube. By driving unlearning updates exclusively within this forget interval and mathematically bounding the model response on the complement, we ensure rigorous protection of the retain distribution. This geometric construction transforms the preservation of knowledge from an empirical heuristic into a formal optimization target with a probabilistic tail bound on functional drift. Crucially, this stability permits highly aggressive unlearning updates within the forget region. Empirical evaluations demonstrate that BARRIER matches state-of-the-art trade-offs across classifiers and diffusion models, maximizing targeted concept erasure while safeguarding the integrity of all other representations. Our code is available at https://github.com/OneAndZero24/BARRIER.
format	Preprint
id	arxiv_https___arxiv_org_abs_2605_15737
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	BARRIER: Bounded Activation Regions for Robust Information Erasure Miksa, Jan Krukowski, Patryk Spurek, Przemysław Rymarczyk, Dawid Damian Sendera, Marcin Computer Vision and Pattern Recognition Machine unlearning has reached a critical bottleneck. As traditional weight-space interventions focus primarily on erasing targeted concepts, they often fail to prevent the unintended suppression of other significant representations. This leads to substantial collateral damage, with essential knowledge being forgotten, because these methods lack formal mathematical guarantees for the preservation of neutral concepts. To avoid degradation, they are frequently forced into conservative updates. We propose BARRIER (Bounded Activation Regions for Robust Information Erasure), a paradigm-shifting framework that shifts the locus of intervention from static model weights to the dynamic geometry of hidden-layer activations. Unlike existing methods, BARRIER employs Interval Arithmetic (IA) on SVD-based projections of the activation space to encapsulate the specific target region within a bounding hypercube. By driving unlearning updates exclusively within this forget interval and mathematically bounding the model response on the complement, we ensure rigorous protection of the retain distribution. This geometric construction transforms the preservation of knowledge from an empirical heuristic into a formal optimization target with a probabilistic tail bound on functional drift. Crucially, this stability permits highly aggressive unlearning updates within the forget region. Empirical evaluations demonstrate that BARRIER matches state-of-the-art trade-offs across classifiers and diffusion models, maximizing targeted concept erasure while safeguarding the integrity of all other representations. Our code is available at https://github.com/OneAndZero24/BARRIER.
title	BARRIER: Bounded Activation Regions for Robust Information Erasure
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2605.15737

Similar Items