Saved in:
| Main Authors: | Yang, Haoyu, Zhang, Zheng, Sathe, Saket |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2412.10416 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
SE-Merging: A Self-Enhanced Approach for Dynamic Model Merging
by: Chen, Zijun, et al.
Published: (2025)
by: Chen, Zijun, et al.
Published: (2025)
Transport and Merge: Cross-Architecture Merging for Large Language Models
by: Cui, Chenhang, et al.
Published: (2026)
by: Cui, Chenhang, et al.
Published: (2026)
Model Merging by Uncertainty-Based Gradient Matching
by: Daheim, Nico, et al.
Published: (2023)
by: Daheim, Nico, et al.
Published: (2023)
Extrapolation Merging: Keep Improving With Extrapolation and Merging
by: Lin, Yiguan, et al.
Published: (2025)
by: Lin, Yiguan, et al.
Published: (2025)
Pruning via Merging: Compressing LLMs via Manifold Alignment Based Layer Merging
by: Liu, Deyuan, et al.
Published: (2024)
by: Liu, Deyuan, et al.
Published: (2024)
LED-Merging: Mitigating Safety-Utility Conflicts in Model Merging with Location-Election-Disjoint
by: Ma, Qianli, et al.
Published: (2025)
by: Ma, Qianli, et al.
Published: (2025)
MergeME: Model Merging Techniques for Homogeneous and Heterogeneous MoEs
by: Zhou, Yuhang, et al.
Published: (2025)
by: Zhou, Yuhang, et al.
Published: (2025)
Model Merging for Knowledge Editing
by: Fu, Zichuan, et al.
Published: (2025)
by: Fu, Zichuan, et al.
Published: (2025)
Merge to Mix: Mixing Datasets via Model Merging
by: Tao, Zhixu Silvia, et al.
Published: (2025)
by: Tao, Zhixu Silvia, et al.
Published: (2025)
RCP-Merging: Merging Long Chain-of-Thought Models with Domain-Specific Models by Considering Reasoning Capability as Prior
by: Yang, Junyao, et al.
Published: (2025)
by: Yang, Junyao, et al.
Published: (2025)
Channel Merging: Preserving Specialization for Merged Experts
by: Zhang, Mingyang, et al.
Published: (2024)
by: Zhang, Mingyang, et al.
Published: (2024)
Sens-Merging: Sensitivity-Guided Parameter Balancing for Merging Large Language Models
by: Liu, Shuqi, et al.
Published: (2025)
by: Liu, Shuqi, et al.
Published: (2025)
AdaMergeX: Cross-Lingual Transfer with Large Language Models via Adaptive Adapter Merging
by: Zhao, Yiran, et al.
Published: (2024)
by: Zhao, Yiran, et al.
Published: (2024)
Bagging-Based Model Merging for Robust General Text Embeddings
by: Zhang, Hengran, et al.
Published: (2026)
by: Zhang, Hengran, et al.
Published: (2026)
Twin-Merging: Dynamic Integration of Modular Expertise in Model Merging
by: Lu, Zhenyi, et al.
Published: (2024)
by: Lu, Zhenyi, et al.
Published: (2024)
LoRE-Merging: Exploring Low-Rank Estimation For Large Language Model Merging
by: Liu, Zehua, et al.
Published: (2025)
by: Liu, Zehua, et al.
Published: (2025)
Arcee's MergeKit: A Toolkit for Merging Large Language Models
by: Goddard, Charles, et al.
Published: (2024)
by: Goddard, Charles, et al.
Published: (2024)
Mix Data or Merge Models? Balancing the Helpfulness, Honesty, and Harmlessness of Large Language Model via Model Merging
by: Yang, Jinluan, et al.
Published: (2025)
by: Yang, Jinluan, et al.
Published: (2025)
K-Merge: Online Continual Merging of Adapters for On-device Large Language Models
by: Shenaj, Donald, et al.
Published: (2025)
by: Shenaj, Donald, et al.
Published: (2025)
DPPA: Pruning Method for Large Language Model to Model Merging
by: Zhu, Yaochen, et al.
Published: (2024)
by: Zhu, Yaochen, et al.
Published: (2024)
Unlocking the Potential of Model Merging for Low-Resource Languages
by: Tao, Mingxu, et al.
Published: (2024)
by: Tao, Mingxu, et al.
Published: (2024)
Activation-Informed Merging of Large Language Models
by: Nobari, Amin Heyrani, et al.
Published: (2025)
by: Nobari, Amin Heyrani, et al.
Published: (2025)
Optimal Brain Iterative Merging: Mitigating Interference in LLM Merging
by: Wang, Zhixiang, et al.
Published: (2025)
by: Wang, Zhixiang, et al.
Published: (2025)
Batching BPE Tokenization Merges
by: Morgan, Alexander P.
Published: (2024)
by: Morgan, Alexander P.
Published: (2024)
The Thinking Spectrum: An Empirical Study of Tunable Reasoning in LLMs through Model Merging
by: Lan, Xiaochong, et al.
Published: (2025)
by: Lan, Xiaochong, et al.
Published: (2025)
HM3: Heterogeneous Multi-Class Model Merging
by: Hackmann, Stefan
Published: (2024)
by: Hackmann, Stefan
Published: (2024)
What Matters for Model Merging at Scale?
by: Yadav, Prateek, et al.
Published: (2024)
by: Yadav, Prateek, et al.
Published: (2024)
Dynamic Model Merging Made Slim
by: Du, Guodong, et al.
Published: (2026)
by: Du, Guodong, et al.
Published: (2026)
Merge-of-Thought Distillation
by: Shen, Zhanming, et al.
Published: (2025)
by: Shen, Zhanming, et al.
Published: (2025)
Have You Merged My Model? On The Robustness of Large Language Model IP Protection Methods Against Model Merging
by: Cong, Tianshuo, et al.
Published: (2024)
by: Cong, Tianshuo, et al.
Published: (2024)
RECALL: REpresentation-aligned Catastrophic-forgetting ALLeviation via Hierarchical Model Merging
by: Wang, Bowen, et al.
Published: (2025)
by: Wang, Bowen, et al.
Published: (2025)
Multi-objective Evolutionary Merging Enables Efficient Reasoning Models
by: Iacobelli, Mario, et al.
Published: (2026)
by: Iacobelli, Mario, et al.
Published: (2026)
Multi-task Code LLMs: Data Mix or Model Merge?
by: Zhu, Mingzhi, et al.
Published: (2026)
by: Zhu, Mingzhi, et al.
Published: (2026)
An Empirical Survey of Model Merging Algorithms for Social Bias Mitigation
by: Shirafuji, Daiki, et al.
Published: (2025)
by: Shirafuji, Daiki, et al.
Published: (2025)
Model Assembly Learning with Heterogeneous Layer Weight Merging
by: Zhang, Yi-Kai, et al.
Published: (2025)
by: Zhang, Yi-Kai, et al.
Published: (2025)
Merging in a Bottle: Differentiable Adaptive Merging (DAM) and the Path from Averaging to Automation
by: Gauthier-Caron, Thomas, et al.
Published: (2024)
by: Gauthier-Caron, Thomas, et al.
Published: (2024)
Bone Soups: A Seek-and-Soup Model Merging Approach for Controllable Multi-Objective Generation
by: Xie, Guofu, et al.
Published: (2025)
by: Xie, Guofu, et al.
Published: (2025)
Fisher Mask Nodes for Language Model Merging
by: K, Thennal D, et al.
Published: (2024)
by: K, Thennal D, et al.
Published: (2024)
STAR: Spectral Truncation and Rescale for Model Merging
by: Lee, Yu-Ang, et al.
Published: (2025)
by: Lee, Yu-Ang, et al.
Published: (2025)
Parameter Competition Balancing for Model Merging
by: Du, Guodong, et al.
Published: (2024)
by: Du, Guodong, et al.
Published: (2024)
Similar Items
-
SE-Merging: A Self-Enhanced Approach for Dynamic Model Merging
by: Chen, Zijun, et al.
Published: (2025) -
Transport and Merge: Cross-Architecture Merging for Large Language Models
by: Cui, Chenhang, et al.
Published: (2026) -
Model Merging by Uncertainty-Based Gradient Matching
by: Daheim, Nico, et al.
Published: (2023) -
Extrapolation Merging: Keep Improving With Extrapolation and Merging
by: Lin, Yiguan, et al.
Published: (2025) -
Pruning via Merging: Compressing LLMs via Manifold Alignment Based Layer Merging
by: Liu, Deyuan, et al.
Published: (2024)