Saved in:
| Main Authors: | Qu, Xingyu, Horvath, Samuel |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2402.05966 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Superpose Task-specific Features for Model Merging
by: Qiu, Haiquan, et al.
Published: (2025)
by: Qiu, Haiquan, et al.
Published: (2025)
FeatCal: Feature Calibration for Post-Merging Models
by: Gu, Yanggan, et al.
Published: (2026)
by: Gu, Yanggan, et al.
Published: (2026)
Twin-Merging: Dynamic Integration of Modular Expertise in Model Merging
by: Lu, Zhenyi, et al.
Published: (2024)
by: Lu, Zhenyi, et al.
Published: (2024)
MIN-Merging: Merge the Important Neurons for Model Merging
by: Liang, Yunfei
Published: (2025)
by: Liang, Yunfei
Published: (2025)
Can Muon Fine-tune Adam-Pretrained Models?
by: Qu, Xingyu, et al.
Published: (2026)
by: Qu, Xingyu, et al.
Published: (2026)
On Vanishing Variance in Transformer Length Generalization
by: Li, Ruining, et al.
Published: (2025)
by: Li, Ruining, et al.
Published: (2025)
FW-Merging: Scaling Model Merging with Frank-Wolfe Optimization
by: Chen, Hao Mark, et al.
Published: (2025)
by: Chen, Hao Mark, et al.
Published: (2025)
PSO-Merging: Merging Models Based on Particle Swarm Optimization
by: Zhang, Kehao, et al.
Published: (2025)
by: Zhang, Kehao, et al.
Published: (2025)
Bayesian Model Merging
by: Li, Kaiyang, et al.
Published: (2026)
by: Li, Kaiyang, et al.
Published: (2026)
Vanishing Contributions: A Unified Framework for Smooth and Iterative Model Compression
by: Nikiforos, Lorenzo, et al.
Published: (2025)
by: Nikiforos, Lorenzo, et al.
Published: (2025)
Vanishing Gradients in Reinforcement Finetuning of Language Models
by: Razin, Noam, et al.
Published: (2023)
by: Razin, Noam, et al.
Published: (2023)
Merging Smarter, Generalizing Better: Enhancing Model Merging on OOD Data
by: Zhang, Bingjie, et al.
Published: (2025)
by: Zhang, Bingjie, et al.
Published: (2025)
Decom-Renorm-Merge: Model Merging on the Right Space Improves Multitasking
by: Chaichana, Yuatyong, et al.
Published: (2025)
by: Chaichana, Yuatyong, et al.
Published: (2025)
Model Merging: Foundations and Algorithms
by: Crisostomi, Donato
Published: (2026)
by: Crisostomi, Donato
Published: (2026)
Model Merging in the Essential Subspace
by: Li, Longhua, et al.
Published: (2026)
by: Li, Longhua, et al.
Published: (2026)
Redefining Contributions: Shapley-Driven Federated Learning
by: Tastan, Nurbek, et al.
Published: (2024)
by: Tastan, Nurbek, et al.
Published: (2024)
CAT Merging: A Training-Free Approach for Resolving Conflicts in Model Merging
by: Sun, Wenju, et al.
Published: (2025)
by: Sun, Wenju, et al.
Published: (2025)
CALM: Consensus-Aware Localized Merging for Multi-Task Learning
by: Yan, Kunda, et al.
Published: (2025)
by: Yan, Kunda, et al.
Published: (2025)
Beyond Interpretability: The Gains of Feature Monosemanticity on Model Robustness
by: Zhang, Qi, et al.
Published: (2024)
by: Zhang, Qi, et al.
Published: (2024)
Towards Minimizing Feature Drift in Model Merging: Layer-wise Task Vector Fusion for Adaptive Knowledge Integration
by: Sun, Wenju, et al.
Published: (2025)
by: Sun, Wenju, et al.
Published: (2025)
Markov Chain Decoders Overcome the Heavy-Tail Limitations of Lipschitz Generative Models
by: Ziani, Abdelhakim, et al.
Published: (2026)
by: Ziani, Abdelhakim, et al.
Published: (2026)
These Are Not All the Features You Are Looking For: A Fundamental Bottleneck in Supervised Pretraining
by: Yang, Xingyu Alice, et al.
Published: (2025)
by: Yang, Xingyu Alice, et al.
Published: (2025)
MergeMix: Optimizing Mid-Training Data Mixtures via Learnable Model Merging
by: Wang, Jiapeng, et al.
Published: (2026)
by: Wang, Jiapeng, et al.
Published: (2026)
BD-Merging: Bias-Aware Dynamic Model Merging with Evidence-Guided Contrastive Learning
by: Xie, Yuhan, et al.
Published: (2026)
by: Xie, Yuhan, et al.
Published: (2026)
Training-free Heterogeneous Model Merging
by: Xu, Zhengqi, et al.
Published: (2024)
by: Xu, Zhengqi, et al.
Published: (2024)
Revisiting Weight Averaging for Model Merging
by: Choi, Jiho, et al.
Published: (2024)
by: Choi, Jiho, et al.
Published: (2024)
Multi-Level Collaboration in Model Merging
by: Li, Qi, et al.
Published: (2025)
by: Li, Qi, et al.
Published: (2025)
Sparsity-Aware Evolution for Model Merging
by: Zhang, Huan, et al.
Published: (2026)
by: Zhang, Huan, et al.
Published: (2026)
Fine, I'll Merge It Myself: A Multi-Fidelity Framework for Automated Model Merging
by: Su, Guinan, et al.
Published: (2025)
by: Su, Guinan, et al.
Published: (2025)
Fair Streaming Feature Selection
by: Duan, Zhangling, et al.
Published: (2024)
by: Duan, Zhangling, et al.
Published: (2024)
Diagnosing Transformers: Illuminating Feature Spaces for Clinical Decision-Making
by: Hsu, Aliyah R., et al.
Published: (2023)
by: Hsu, Aliyah R., et al.
Published: (2023)
Merge to Mix: Mixing Datasets via Model Merging
by: Tao, Zhixu Silvia, et al.
Published: (2025)
by: Tao, Zhixu Silvia, et al.
Published: (2025)
MergeIT: From Selection to Merging for Efficient Instruction Tuning
by: Cai, Hongyi, et al.
Published: (2025)
by: Cai, Hongyi, et al.
Published: (2025)
Non-Uniform Parameter-Wise Model Merging
by: Camacho, Albert Manuel Orozco, et al.
Published: (2024)
by: Camacho, Albert Manuel Orozco, et al.
Published: (2024)
NegMerge: Sign-Consensual Weight Merging for Machine Unlearning
by: Kim, Hyo Seo, et al.
Published: (2024)
by: Kim, Hyo Seo, et al.
Published: (2024)
Vanishing Bias Heuristic-guided Reinforcement Learning Algorithm
by: Li, Qinru, et al.
Published: (2023)
by: Li, Qinru, et al.
Published: (2023)
MergeDNA: Context-aware Genome Modeling with Dynamic Tokenization through Token Merging
by: Li, Siyuan, et al.
Published: (2025)
by: Li, Siyuan, et al.
Published: (2025)
SimMerge: Learning to Select Merge Operators from Similarity Signals
by: Bolton, Oliver, et al.
Published: (2026)
by: Bolton, Oliver, et al.
Published: (2026)
ES-Merging: Biological MLLM Merging via Embedding Space Signals
by: Lee, Wonbin, et al.
Published: (2026)
by: Lee, Wonbin, et al.
Published: (2026)
Arcee's MergeKit: A Toolkit for Merging Large Language Models
by: Goddard, Charles, et al.
Published: (2024)
by: Goddard, Charles, et al.
Published: (2024)
Similar Items
-
Superpose Task-specific Features for Model Merging
by: Qiu, Haiquan, et al.
Published: (2025) -
FeatCal: Feature Calibration for Post-Merging Models
by: Gu, Yanggan, et al.
Published: (2026) -
Twin-Merging: Dynamic Integration of Modular Expertise in Model Merging
by: Lu, Zhenyi, et al.
Published: (2024) -
MIN-Merging: Merge the Important Neurons for Model Merging
by: Liang, Yunfei
Published: (2025) -
Can Muon Fine-tune Adam-Pretrained Models?
by: Qu, Xingyu, et al.
Published: (2026)