Saved in:
| Main Authors: | Cheng, Runxi, Xiong, Feng, Wei, Yongxian, Zhu, Wanyun, Yuan, Chun |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2503.08099 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Learn To Learn More Precisely
by: Cheng, Runxi, et al.
Published: (2024)
by: Cheng, Runxi, et al.
Published: (2024)
Multi-Task Model Merging via Adaptive Weight Disentanglement
by: Xiong, Feng, et al.
Published: (2024)
by: Xiong, Feng, et al.
Published: (2024)
Modeling Multi-Task Model Merging as Adaptive Projective Gradient Descent
by: Wei, Yongxian, et al.
Published: (2025)
by: Wei, Yongxian, et al.
Published: (2025)
Task Groupings Regularization: Data-Free Meta-Learning with Heterogeneous Pre-trained Models
by: Wei, Yongxian, et al.
Published: (2024)
by: Wei, Yongxian, et al.
Published: (2024)
Task-Distributionally Robust Data-Free Meta-Learning
by: Hu, Zixuan, et al.
Published: (2023)
by: Hu, Zixuan, et al.
Published: (2023)
End-to-End Reaction Field Energy Modeling via Deep Learning based Voxel-to-voxel Transform
by: Wu, Yongxian, et al.
Published: (2024)
by: Wu, Yongxian, et al.
Published: (2024)
Task Singular Vectors: Reducing Task Interference in Model Merging
by: Gargiulo, Antonio Andrea, et al.
Published: (2024)
by: Gargiulo, Antonio Andrea, et al.
Published: (2024)
FREE: Faster and Better Data-Free Meta-Learning
by: Wei, Yongxian, et al.
Published: (2024)
by: Wei, Yongxian, et al.
Published: (2024)
Sparse Model Inversion: Efficient Inversion of Vision Transformers for Data-Free Applications
by: Hu, Zixuan, et al.
Published: (2025)
by: Hu, Zixuan, et al.
Published: (2025)
Unlocking Tuning-Free Few-Shot Adaptability in Visual Foundation Models by Recycling Pre-Tuned LoRAs
by: Hu, Zixuan, et al.
Published: (2024)
by: Hu, Zixuan, et al.
Published: (2024)
OptMerge: Unifying Multimodal LLM Capabilities and Modalities via Model Merging
by: Wei, Yongxian, et al.
Published: (2025)
by: Wei, Yongxian, et al.
Published: (2025)
Task Vector Quantization for Memory-Efficient Model Merging
by: Kim, Youngeun, et al.
Published: (2025)
by: Kim, Youngeun, et al.
Published: (2025)
Chorus: Harmonizing Context and Sensing Signals for Data-Free Model Customization in IoT
by: Zhang, Liyu, et al.
Published: (2025)
by: Zhang, Liyu, et al.
Published: (2025)
MaD-Mix: Multi-Modal Data Mixtures via Latent Space Coupling for Vision-Language Model Training
by: Xie, Wanyun, et al.
Published: (2026)
by: Xie, Wanyun, et al.
Published: (2026)
Janus-Q: End-to-End Event-Driven Trading via Hierarchical-Gated Reward Modeling
by: Li, Xiang, et al.
Published: (2026)
by: Li, Xiang, et al.
Published: (2026)
HS-STaR: Hierarchical Sampling for Self-Taught Reasoners via Difficulty Estimation and Budget Reallocation
by: Xiong, Feng, et al.
Published: (2025)
by: Xiong, Feng, et al.
Published: (2025)
DisTaC: Conditioning Task Vectors via Distillation for Robust Model Merging
by: Yoshida, Kotaro, et al.
Published: (2025)
by: Yoshida, Kotaro, et al.
Published: (2025)
Tensorized Clustered LoRA Merging for Multi-Task Interference
by: Su, Zhan, et al.
Published: (2025)
by: Su, Zhan, et al.
Published: (2025)
Retraining-Free Merging of Sparse MoE via Hierarchical Clustering
by: Chen, I-Chun, et al.
Published: (2024)
by: Chen, I-Chun, et al.
Published: (2024)
Model Merging via Data-Free Covariance Estimation
by: Hameed, Marawan Gamal Abdel, et al.
Published: (2026)
by: Hameed, Marawan Gamal Abdel, et al.
Published: (2026)
Auto-FlexSwitch: Efficient Dynamic Model Merging via Learnable Task Vector Compression
by: Gao, Junqi, et al.
Published: (2026)
by: Gao, Junqi, et al.
Published: (2026)
Adaptive Defense against Harmful Fine-Tuning for Large Language Models via Bayesian Data Scheduler
by: Hu, Zixuan, et al.
Published: (2025)
by: Hu, Zixuan, et al.
Published: (2025)
Chameleon: A Flexible Data-mixing Framework for Language Model Pretraining and Finetuning
by: Xie, Wanyun, et al.
Published: (2025)
by: Xie, Wanyun, et al.
Published: (2025)
SyMerge: From Non-Interference to Synergistic Merging via Single-Layer Adaptation
by: Jung, Aecheon, et al.
Published: (2024)
by: Jung, Aecheon, et al.
Published: (2024)
Memory Grafting: Scaling Language Model Pre-training via Offline Conditional Memory
by: Cheng, Runxi, et al.
Published: (2026)
by: Cheng, Runxi, et al.
Published: (2026)
Stable Nonconvex-Nonconcave Training via Linear Interpolation
by: Pethick, Thomas, et al.
Published: (2023)
by: Pethick, Thomas, et al.
Published: (2023)
Guided Model Merging for Hybrid Data Learning: Leveraging Centralized Data to Refine Decentralized Models
by: Zhu, Junyi, et al.
Published: (2025)
by: Zhu, Junyi, et al.
Published: (2025)
Merge and Guide: Unifying Model Merging and Guided Decoding for Controllable Multi-Objective Generation
by: Xie, Guofu, et al.
Published: (2025)
by: Xie, Guofu, et al.
Published: (2025)
Mitigating Parameter Interference in Model Merging via Sharpness-Aware Fine-Tuning
by: Lee, Yeoreum, et al.
Published: (2025)
by: Lee, Yeoreum, et al.
Published: (2025)
Optimal Brain Iterative Merging: Mitigating Interference in LLM Merging
by: Wang, Zhixiang, et al.
Published: (2025)
by: Wang, Zhixiang, et al.
Published: (2025)
Mitigating the Backdoor Effect for Multi-Task Model Merging via Safety-Aware Subspace
by: Yang, Jinluan, et al.
Published: (2024)
by: Yang, Jinluan, et al.
Published: (2024)
Conditional Rectified Flow-based End-to-End Rapid Seismic Inversion Method
by: Xu, Haofei, et al.
Published: (2026)
by: Xu, Haofei, et al.
Published: (2026)
Extra-Merge: Tracing the Rank-1 Subspace of Model Merging in Language Model Pre-Training
by: Zhou, Wenjie, et al.
Published: (2026)
by: Zhou, Wenjie, et al.
Published: (2026)
Mix Data or Merge Models? Optimizing for Diverse Multi-Task Learning
by: Aakanksha, et al.
Published: (2024)
by: Aakanksha, et al.
Published: (2024)
Towards Minimizing Feature Drift in Model Merging: Layer-wise Task Vector Fusion for Adaptive Knowledge Integration
by: Sun, Wenju, et al.
Published: (2025)
by: Sun, Wenju, et al.
Published: (2025)
SAMPa: Sharpness-aware Minimization Parallelized
by: Xie, Wanyun, et al.
Published: (2024)
by: Xie, Wanyun, et al.
Published: (2024)
How Big Should a Wireless Foundation Model Be?
by: Cheng, Wei-Lun, et al.
Published: (2026)
by: Cheng, Wei-Lun, et al.
Published: (2026)
Twin-Merging: Dynamic Integration of Modular Expertise in Model Merging
by: Lu, Zhenyi, et al.
Published: (2024)
by: Lu, Zhenyi, et al.
Published: (2024)
From Task-Specific Models to Unified Systems: A Review of Model Merging Approaches
by: Ruan, Wei, et al.
Published: (2025)
by: Ruan, Wei, et al.
Published: (2025)
Computational Budget Should Be Considered in Data Selection
by: Wan, Weilin, et al.
Published: (2025)
by: Wan, Weilin, et al.
Published: (2025)
Similar Items
-
Learn To Learn More Precisely
by: Cheng, Runxi, et al.
Published: (2024) -
Multi-Task Model Merging via Adaptive Weight Disentanglement
by: Xiong, Feng, et al.
Published: (2024) -
Modeling Multi-Task Model Merging as Adaptive Projective Gradient Descent
by: Wei, Yongxian, et al.
Published: (2025) -
Task Groupings Regularization: Data-Free Meta-Learning with Heterogeneous Pre-trained Models
by: Wei, Yongxian, et al.
Published: (2024) -
Task-Distributionally Robust Data-Free Meta-Learning
by: Hu, Zixuan, et al.
Published: (2023)