Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Kim, Yejin, Hwang, Dongjun, Cha, Sungmin, Choe, Junsuk
Format:	Preprint
Published:	2026
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2601.21794
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866915761179590656
author	Kim, Yejin Hwang, Dongjun Cha, Sungmin Choe, Junsuk
author_facet	Kim, Yejin Hwang, Dongjun Cha, Sungmin Choe, Junsuk
contents	Large Vision-Language Models (LVLMs) are widely adopted for their strong multimodal capabilities, yet they raise serious concerns such as privacy leakage and harmful content generation. Machine unlearning has emerged as a promising solution for removing the influence of specific data from trained models. However, existing approaches largely rely on gradient-based optimization, incurring substantial computational costs for large-scale LVLMs. To address this limitation, we propose Knowledge Vector Weakening (KVW), a training-free unlearning method that directly intervenes in the full model without gradient computation. KVW identifies knowledge vectors that are activated during the model's output generation on the forget set and progressively weakens their contributions, thereby preventing the model from exploiting undesirable knowledge. Experiments on the MLLMU and CLEAR benchmarks demonstrate that KVW achieves a stable forget-retain trade-off while significantly improving computational efficiency over gradient-based and LoRA-based unlearning methods.
format	Preprint
id	arxiv_https___arxiv_org_abs_2601_21794
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	Knowledge Vector Weakening: Efficient Training-free Unlearning for Large Vision-Language Models Kim, Yejin Hwang, Dongjun Cha, Sungmin Choe, Junsuk Machine Learning Large Vision-Language Models (LVLMs) are widely adopted for their strong multimodal capabilities, yet they raise serious concerns such as privacy leakage and harmful content generation. Machine unlearning has emerged as a promising solution for removing the influence of specific data from trained models. However, existing approaches largely rely on gradient-based optimization, incurring substantial computational costs for large-scale LVLMs. To address this limitation, we propose Knowledge Vector Weakening (KVW), a training-free unlearning method that directly intervenes in the full model without gradient computation. KVW identifies knowledge vectors that are activated during the model's output generation on the forget set and progressively weakens their contributions, thereby preventing the model from exploiting undesirable knowledge. Experiments on the MLLMU and CLEAR benchmarks demonstrate that KVW achieves a stable forget-retain trade-off while significantly improving computational efficiency over gradient-based and LoRA-based unlearning methods.
title	Knowledge Vector Weakening: Efficient Training-free Unlearning for Large Vision-Language Models
topic	Machine Learning
url	https://arxiv.org/abs/2601.21794

Similar Items