Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Chang, Hwan, Lee, Hwanhee
Format:	Preprint
Published:	2025
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2502.11441
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866916763740930048
author	Chang, Hwan Lee, Hwanhee
author_facet	Chang, Hwan Lee, Hwanhee
contents	Large language models (LLMs) risk retaining unauthorized or sensitive information from their training data, which raises privacy concerns. LLM unlearning seeks to mitigate these risks by selectively removing specified data while maintaining overall model performance. However, most existing work focus on methods to achieve effective forgetting and does not provide a detailed analysis of the retain set, the portion of training data that is not targeted for removal. In this paper, we investigate the effects of unlearning on various subsets of the retain set through a case study on entity unlearning. We introduce the Syntactically Similar Neighbor Set, a group of queries that share similar syntactic structures with the data targeted for removal, and show that this subset suffers the greatest performance drop during unlearning. Moreover, when used for regularization, this set not only preserves performance on syntactically similar queries but also delivers comparable or improved results across other data subsets. Our results highlight that syntactic similarity is a critical factor, potentially more so than domain or entity relationships, in achieving effective and practical LLM unlearning.
format	Preprint
id	arxiv_https___arxiv_org_abs_2502_11441
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Which Retain Set Matters for LLM Unlearning? A Case Study on Entity Unlearning Chang, Hwan Lee, Hwanhee Computation and Language Large language models (LLMs) risk retaining unauthorized or sensitive information from their training data, which raises privacy concerns. LLM unlearning seeks to mitigate these risks by selectively removing specified data while maintaining overall model performance. However, most existing work focus on methods to achieve effective forgetting and does not provide a detailed analysis of the retain set, the portion of training data that is not targeted for removal. In this paper, we investigate the effects of unlearning on various subsets of the retain set through a case study on entity unlearning. We introduce the Syntactically Similar Neighbor Set, a group of queries that share similar syntactic structures with the data targeted for removal, and show that this subset suffers the greatest performance drop during unlearning. Moreover, when used for regularization, this set not only preserves performance on syntactically similar queries but also delivers comparable or improved results across other data subsets. Our results highlight that syntactic similarity is a critical factor, potentially more so than domain or entity relationships, in achieving effective and practical LLM unlearning.
title	Which Retain Set Matters for LLM Unlearning? A Case Study on Entity Unlearning
topic	Computation and Language
url	https://arxiv.org/abs/2502.11441

Similar Items