Saved in:
Bibliographic Details
Main Authors: Li, Shuaiyi, Deng, Yang, Cai, Deng, Lu, Hongyuan, Chen, Liang, Lam, Wai
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2403.05330
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866916432901570560
author Li, Shuaiyi
Deng, Yang
Cai, Deng
Lu, Hongyuan
Chen, Liang
Lam, Wai
author_facet Li, Shuaiyi
Deng, Yang
Cai, Deng
Lu, Hongyuan
Chen, Liang
Lam, Wai
contents As the typical retraining paradigm is unacceptably time- and resource-consuming, researchers are turning to model editing to find an effective way that supports both consecutive and batch scenarios to edit the model behavior directly. Despite all these practical expectations, existing model editing methods fail to realize all of them. Furthermore, the memory demands for such sequential model editing approaches tend to be prohibitive, frequently necessitating an external memory that grows incrementally over time. To cope with these challenges, we propose CoachHooK, a model editing method that simultaneously supports sequential and batch editing. CoachHooK is memory-friendly as it only needs a small amount of it to store several hook layers whose size remains unchanged over time. Experimental results demonstrate the superiority of our method over other batch-supportive model editing methods under both single-round and consecutive batch editing scenarios. Extensive analyses of CoachHooK have been conducted to verify the stability of our method over a number of consecutive steps.
format Preprint
id arxiv_https___arxiv_org_abs_2403_05330
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Consecutive Batch Model Editing with HooK Layers
Li, Shuaiyi
Deng, Yang
Cai, Deng
Lu, Hongyuan
Chen, Liang
Lam, Wai
Computation and Language
As the typical retraining paradigm is unacceptably time- and resource-consuming, researchers are turning to model editing to find an effective way that supports both consecutive and batch scenarios to edit the model behavior directly. Despite all these practical expectations, existing model editing methods fail to realize all of them. Furthermore, the memory demands for such sequential model editing approaches tend to be prohibitive, frequently necessitating an external memory that grows incrementally over time. To cope with these challenges, we propose CoachHooK, a model editing method that simultaneously supports sequential and batch editing. CoachHooK is memory-friendly as it only needs a small amount of it to store several hook layers whose size remains unchanged over time. Experimental results demonstrate the superiority of our method over other batch-supportive model editing methods under both single-round and consecutive batch editing scenarios. Extensive analyses of CoachHooK have been conducted to verify the stability of our method over a number of consecutive steps.
title Consecutive Batch Model Editing with HooK Layers
topic Computation and Language
url https://arxiv.org/abs/2403.05330