Saved in:
Bibliographic Details
Main Authors: Wang, Bin, Wang, Fan, Wang, Pingping, Cong, Jinyu, Yu, Yang, Yin, Yilong, Han, Zhongyi, Wei, Benzheng
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2602.17692
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866912931337207808
author Wang, Bin
Wang, Fan
Wang, Pingping
Cong, Jinyu
Yu, Yang
Yin, Yilong
Han, Zhongyi
Wei, Benzheng
author_facet Wang, Bin
Wang, Fan
Wang, Pingping
Cong, Jinyu
Yu, Yang
Yin, Yilong
Han, Zhongyi
Wei, Benzheng
contents In this paper, we introduce \textbf{agentic unlearning} which removes specified information from both model parameters and persistent memory in agents with closed-loop interaction. Existing unlearning methods target parameters alone, leaving two critical gaps: (i) parameter-memory backflow, where retrieval reactivates parametric remnants or memory artifacts reintroduce sensitive content, and (ii) the absence of a unified strategy that covers both parameter and memory pathways. We present Synchronized Backflow Unlearning (SBU), a framework that unlearns jointly across parameter and memory pathways. The memory pathway performs dependency closure-based unlearning that prunes isolated entities while logically invalidating shared artifacts. The parameter pathway employs stochastic reference alignment to guide model outputs toward a high-entropy prior. These pathways are integrated via a synchronized dual-update protocol, forming a closed-loop mechanism where memory unlearning and parametric suppression reinforce each other to prevent cross-pathway recontamination. Experiments on medical QA benchmarks show that SBU reduces traces of targeted private information across both pathways with limited degradation on retained data.
format Preprint
id arxiv_https___arxiv_org_abs_2602_17692
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Agentic Unlearning: When LLM Agent Meets Machine Unlearning
Wang, Bin
Wang, Fan
Wang, Pingping
Cong, Jinyu
Yu, Yang
Yin, Yilong
Han, Zhongyi
Wei, Benzheng
Machine Learning
Artificial Intelligence
In this paper, we introduce \textbf{agentic unlearning} which removes specified information from both model parameters and persistent memory in agents with closed-loop interaction. Existing unlearning methods target parameters alone, leaving two critical gaps: (i) parameter-memory backflow, where retrieval reactivates parametric remnants or memory artifacts reintroduce sensitive content, and (ii) the absence of a unified strategy that covers both parameter and memory pathways. We present Synchronized Backflow Unlearning (SBU), a framework that unlearns jointly across parameter and memory pathways. The memory pathway performs dependency closure-based unlearning that prunes isolated entities while logically invalidating shared artifacts. The parameter pathway employs stochastic reference alignment to guide model outputs toward a high-entropy prior. These pathways are integrated via a synchronized dual-update protocol, forming a closed-loop mechanism where memory unlearning and parametric suppression reinforce each other to prevent cross-pathway recontamination. Experiments on medical QA benchmarks show that SBU reduces traces of targeted private information across both pathways with limited degradation on retained data.
title Agentic Unlearning: When LLM Agent Meets Machine Unlearning
topic Machine Learning
Artificial Intelligence
url https://arxiv.org/abs/2602.17692