Saved in:
| Main Authors: | Zhou, Yuhang, Karamanolakis, Giannis, Soto, Victor, Rumshisky, Anna, Kulkarni, Mayank, Huang, Furong, Ai, Wei, Lu, Jianhua |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2502.00997 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
MergeMoE: Efficient Compression of MoE Models via Expert Output Merging
by: Miao, Ruijie, et al.
Published: (2025)
by: Miao, Ruijie, et al.
Published: (2025)
Interactive Machine Teaching by Labeling Rules and Instances
by: Karamanolakis, Giannis, et al.
Published: (2024)
by: Karamanolakis, Giannis, et al.
Published: (2024)
Retraining-Free Merging of Sparse MoE via Hierarchical Clustering
by: Chen, I-Chun, et al.
Published: (2024)
by: Chen, I-Chun, et al.
Published: (2024)
Faster, Smaller, and Smarter: Task-Aware Expert Merging for Online MoE Inference
by: Han, Ziyi, et al.
Published: (2025)
by: Han, Ziyi, et al.
Published: (2025)
MAML-en-LLM: Model Agnostic Meta-Training of LLMs for Improved In-Context Learning
by: Sinha, Sanchit, et al.
Published: (2024)
by: Sinha, Sanchit, et al.
Published: (2024)
Sub-MoE: Efficient Mixture-of-Expert LLMs Compression via Subspace Expert Merging
by: Li, Lujun, et al.
Published: (2025)
by: Li, Lujun, et al.
Published: (2025)
UMM-RM: An Upcycle-and-Merge MoE Reward Model for Mitigating Reward Hacking
by: Fu, Lingling, et al.
Published: (2025)
by: Fu, Lingling, et al.
Published: (2025)
GFairHint: Improving Individual Fairness for Graph Neural Networks via Fairness Hint
by: Xu, Paiheng, et al.
Published: (2023)
by: Xu, Paiheng, et al.
Published: (2023)
Twin-Merging: Dynamic Integration of Modular Expertise in Model Merging
by: Lu, Zhenyi, et al.
Published: (2024)
by: Lu, Zhenyi, et al.
Published: (2024)
Sparse Crosscoders for diffing MoEs and Dense models
by: Chaudhari, Marmik, et al.
Published: (2026)
by: Chaudhari, Marmik, et al.
Published: (2026)
DC-Merge: Improving Model Merging with Directional Consistency
by: Zhang, Han-Chen, et al.
Published: (2026)
by: Zhang, Han-Chen, et al.
Published: (2026)
Explore Spurious Correlations at the Concept Level in Language Models for Text Classification
by: Zhou, Yuhang, et al.
Published: (2023)
by: Zhou, Yuhang, et al.
Published: (2023)
Training-free Heterogeneous Model Merging
by: Xu, Zhengqi, et al.
Published: (2024)
by: Xu, Zhengqi, et al.
Published: (2024)
Mix-MoE: Improving Multilingual Machine Translation of Large Language Models through Mixed MoEs
by: Li, Bo, et al.
Published: (2026)
by: Li, Bo, et al.
Published: (2026)
Enhancing Multi-modal Models with Heterogeneous MoE Adapters for Fine-tuning
by: Zhou, Sashuai, et al.
Published: (2025)
by: Zhou, Sashuai, et al.
Published: (2025)
MIN-Merging: Merge the Important Neurons for Model Merging
by: Liang, Yunfei
Published: (2025)
by: Liang, Yunfei
Published: (2025)
Polysemantic Experts, Monosemantic Paths: Routing as Control in MoEs
by: Ye, Charles, et al.
Published: (2026)
by: Ye, Charles, et al.
Published: (2026)
SonicMoE: Accelerating MoE with IO and Tile-aware Optimizations
by: Guo, Wentao, et al.
Published: (2025)
by: Guo, Wentao, et al.
Published: (2025)
CSRec: Rethinking Sequential Recommendation from A Causal Perspective
by: Liu, Xiaoyu, et al.
Published: (2024)
by: Liu, Xiaoyu, et al.
Published: (2024)
FedMerge: Federated Personalization via Model Merging
by: Chen, Shutong, et al.
Published: (2025)
by: Chen, Shutong, et al.
Published: (2025)
ASD-RSD; To Merge or Not to Merge
by: Sinclair, Dorothy
Published: (1971)
by: Sinclair, Dorothy
Published: (1971)
MergeTok: Unified Continuous and Discrete Visual Tokenization via Token Merging
by: Zhang, Luyuan, et al.
Published: (2026)
by: Zhang, Luyuan, et al.
Published: (2026)
PivotMerge: Bridging Heterogeneous Multimodal Pre-training via Post-Alignment Model Merging
by: Shao, Zibo, et al.
Published: (2026)
by: Shao, Zibo, et al.
Published: (2026)
Merge to Mix: Mixing Datasets via Model Merging
by: Tao, Zhixu Silvia, et al.
Published: (2025)
by: Tao, Zhixu Silvia, et al.
Published: (2025)
ATM: Improving Model Merging by Alternating Tuning and Merging
by: Zhou, Luca, et al.
Published: (2024)
by: Zhou, Luca, et al.
Published: (2024)
Merge-of-Thought Distillation
by: Shen, Zhanming, et al.
Published: (2025)
by: Shen, Zhanming, et al.
Published: (2025)
To Merge and Not to Merge: Israel's Union List of Monographs in the Context of Merging Algorithms.
by: Lazinger, Susan S.
Published: (1994)
by: Lazinger, Susan S.
Published: (1994)
OptMerge: Unifying Multimodal LLM Capabilities and Modalities via Model Merging
by: Wei, Yongxian, et al.
Published: (2025)
by: Wei, Yongxian, et al.
Published: (2025)
EMR-Merging: Tuning-Free High-Performance Model Merging
by: Huang, Chenyu, et al.
Published: (2024)
by: Huang, Chenyu, et al.
Published: (2024)
Dirichlet-Prior Shaping: Guiding Expert Specialization in Upcycled MoEs
by: Mirvakhabova, Leyla, et al.
Published: (2025)
by: Mirvakhabova, Leyla, et al.
Published: (2025)
Continual Pre-training of MoEs: How robust is your router?
by: Thérien, Benjamin, et al.
Published: (2025)
by: Thérien, Benjamin, et al.
Published: (2025)
MergeDNA: Context-aware Genome Modeling with Dynamic Tokenization through Token Merging
by: Li, Siyuan, et al.
Published: (2025)
by: Li, Siyuan, et al.
Published: (2025)
Extra-Merge: Tracing the Rank-1 Subspace of Model Merging in Language Model Pre-Training
by: Zhou, Wenjie, et al.
Published: (2026)
by: Zhou, Wenjie, et al.
Published: (2026)
Grove MoE: Towards Efficient and Superior MoE LLMs with Adjugate Experts
by: Wu, Haoyuan, et al.
Published: (2025)
by: Wu, Haoyuan, et al.
Published: (2025)
Merge Hijacking: Backdoor Attacks to Model Merging of Large Language Models
by: Yuan, Zenghui, et al.
Published: (2025)
by: Yuan, Zenghui, et al.
Published: (2025)
SE-Merging: A Self-Enhanced Approach for Dynamic Model Merging
by: Chen, Zijun, et al.
Published: (2025)
by: Chen, Zijun, et al.
Published: (2025)
DeepMerge: Deep-Learning-Based Region-Merging for Image Segmentation
by: Lv, Xianwei, et al.
Published: (2023)
by: Lv, Xianwei, et al.
Published: (2023)
Extrapolation Merging: Keep Improving With Extrapolation and Merging
by: Lin, Yiguan, et al.
Published: (2025)
by: Lin, Yiguan, et al.
Published: (2025)
Channel Merging: Preserving Specialization for Merged Experts
by: Zhang, Mingyang, et al.
Published: (2024)
by: Zhang, Mingyang, et al.
Published: (2024)
HM3: Heterogeneous Multi-Class Model Merging
by: Hackmann, Stefan
Published: (2024)
by: Hackmann, Stefan
Published: (2024)
Similar Items
-
MergeMoE: Efficient Compression of MoE Models via Expert Output Merging
by: Miao, Ruijie, et al.
Published: (2025) -
Interactive Machine Teaching by Labeling Rules and Instances
by: Karamanolakis, Giannis, et al.
Published: (2024) -
Retraining-Free Merging of Sparse MoE via Hierarchical Clustering
by: Chen, I-Chun, et al.
Published: (2024) -
Faster, Smaller, and Smarter: Task-Aware Expert Merging for Online MoE Inference
by: Han, Ziyi, et al.
Published: (2025) -
MAML-en-LLM: Model Agnostic Meta-Training of LLMs for Improved In-Context Learning
by: Sinha, Sanchit, et al.
Published: (2024)