Enregistré dans:
Détails bibliographiques
Auteurs principaux: Yang, Zongzhen, Qi, Binhang, Sun, Hailong, Long, Wenrui, Zhao, Ruobing, Gao, Xiang
Format: Preprint
Publié: 2025
Sujets:
Accès en ligne:https://arxiv.org/abs/2503.01874
Tags: Ajouter un tag
Pas de tags, Soyez le premier à ajouter un tag!
_version_ 1866929740298846208
author Yang, Zongzhen
Qi, Binhang
Sun, Hailong
Long, Wenrui
Zhao, Ruobing
Gao, Xiang
author_facet Yang, Zongzhen
Qi, Binhang
Sun, Hailong
Long, Wenrui
Zhao, Ruobing
Gao, Xiang
contents Model merging based on task vectors, i.e., the parameter differences between fine-tuned models and a shared base model, provides an efficient way to integrate multiple task-specific models into a multitask model without retraining. Recent works have endeavored to address the conflicts between task vectors, one of the significant challenges faced by model merging, through sparsification; however, two issues significantly limit their performance: high parameter overlap and unbalanced weight distribution. To address these issues, we propose a simple, yet effective framework called CABS (Conflict-Aware and Balanced Sparsification), consisting of Conflict-Aware Sparsification (CA) and Balanced Sparsification (BS). CA can reduce parameter overlap by applying masks during sequential pruning, ensuring that each task vector retains distinct, non-overlapping parameters. BS leverages $n$: $m$ pruning to preserve critical weights while maintaining an even distribution across layers. Our comprehensive experiments demonstrate that CABS outperforms state-of-the-art methods across diverse tasks and model sizes.
format Preprint
id arxiv_https___arxiv_org_abs_2503_01874
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle CABS: Conflict-Aware and Balanced Sparsification for Enhancing Model Merging
Yang, Zongzhen
Qi, Binhang
Sun, Hailong
Long, Wenrui
Zhao, Ruobing
Gao, Xiang
Machine Learning
Artificial Intelligence
Model merging based on task vectors, i.e., the parameter differences between fine-tuned models and a shared base model, provides an efficient way to integrate multiple task-specific models into a multitask model without retraining. Recent works have endeavored to address the conflicts between task vectors, one of the significant challenges faced by model merging, through sparsification; however, two issues significantly limit their performance: high parameter overlap and unbalanced weight distribution. To address these issues, we propose a simple, yet effective framework called CABS (Conflict-Aware and Balanced Sparsification), consisting of Conflict-Aware Sparsification (CA) and Balanced Sparsification (BS). CA can reduce parameter overlap by applying masks during sequential pruning, ensuring that each task vector retains distinct, non-overlapping parameters. BS leverages $n$: $m$ pruning to preserve critical weights while maintaining an even distribution across layers. Our comprehensive experiments demonstrate that CABS outperforms state-of-the-art methods across diverse tasks and model sizes.
title CABS: Conflict-Aware and Balanced Sparsification for Enhancing Model Merging
topic Machine Learning
Artificial Intelligence
url https://arxiv.org/abs/2503.01874