Saved in:
Bibliographic Details
Main Authors: Wang, Yangfan, Sun, Tianyang, Tang, Chen, Liu, Jie, Cai, Wei, Jiang, Jingchi
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2604.11214
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866914468988977152
author Wang, Yangfan
Sun, Tianyang
Tang, Chen
Liu, Jie
Cai, Wei
Jiang, Jingchi
author_facet Wang, Yangfan
Sun, Tianyang
Tang, Chen
Liu, Jie
Cai, Wei
Jiang, Jingchi
contents Lifelong model editing (LME) aims to sequentially rectify outdated or inaccurate knowledge in deployed LLMs while minimizing side effects on unrelated inputs. However, existing approaches typically apply parameter perturbations to a static and dense set of LLM layers for all editing instances. This practice is counter-intuitive, as we hypothesize that different pieces of knowledge are stored in distinct layers of the model. Neglecting this layer-wise specificity can impede adaptability in integrating new knowledge and result in catastrophic forgetting for both general and previously edited knowledge. To address this, we propose HiEdit, a hierarchical reinforcement learning framework that adaptively identifies the most knowledge-relevant layers for each editing instance. By enabling dynamic, instance-aware layer selection and incorporating an intrinsic reward for sparsity, HiEdit achieves precise, localized updates. Experiments on various LLMs show that HiEdit boosts the performance of the competitive RLEdit by an average of 8.48% with perturbing only half of the layers per edit. Our code is available at: https://github.com/yangfanww/hiedit.
format Preprint
id arxiv_https___arxiv_org_abs_2604_11214
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle HiEdit: Lifelong Model Editing with Hierarchical Reinforcement Learning
Wang, Yangfan
Sun, Tianyang
Tang, Chen
Liu, Jie
Cai, Wei
Jiang, Jingchi
Computation and Language
Lifelong model editing (LME) aims to sequentially rectify outdated or inaccurate knowledge in deployed LLMs while minimizing side effects on unrelated inputs. However, existing approaches typically apply parameter perturbations to a static and dense set of LLM layers for all editing instances. This practice is counter-intuitive, as we hypothesize that different pieces of knowledge are stored in distinct layers of the model. Neglecting this layer-wise specificity can impede adaptability in integrating new knowledge and result in catastrophic forgetting for both general and previously edited knowledge. To address this, we propose HiEdit, a hierarchical reinforcement learning framework that adaptively identifies the most knowledge-relevant layers for each editing instance. By enabling dynamic, instance-aware layer selection and incorporating an intrinsic reward for sparsity, HiEdit achieves precise, localized updates. Experiments on various LLMs show that HiEdit boosts the performance of the competitive RLEdit by an average of 8.48% with perturbing only half of the layers per edit. Our code is available at: https://github.com/yangfanww/hiedit.
title HiEdit: Lifelong Model Editing with Hierarchical Reinforcement Learning
topic Computation and Language
url https://arxiv.org/abs/2604.11214