:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Yang, Ruofeng, Li, Yongcan, Jiang, Bo, Chen, Cheng, Li, Shuai
Format:	Preprint
Published:	2026
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2601.01475
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Elucidating Rectified Flow with Deterministic Sampler: Polynomial Discretization Complexity for Multi and One-step Models
by: Yang, Ruofeng, et al.
Published: (2025)

ARIS: Autonomous Research via Adversarial Multi-Agent Collaboration
by: Yang, Ruofeng, et al.
Published: (2026)

Understanding Representation Learnability of Nonlinear Self-Supervised Learning
by: Yang, Ruofeng, et al.
Published: (2024)

Dynamic Adaptive Shared Experts with Grouped Multi-Head Attention Mixture of Experts
by: Li, Cheng, et al.
Published: (2025)

Efficient and Effective Weight-Ensembling Mixture of Experts for Multi-Task Model Merging
by: Shen, Li, et al.
Published: (2024)

Multi-Modal Time Series Prediction via Mixture of Modulated Experts
by: Zhang, Lige, et al.
Published: (2026)

Sub-MoE: Efficient Mixture-of-Expert LLMs Compression via Subspace Expert Merging
by: Li, Lujun, et al.
Published: (2025)

Facet-Aware Multi-Head Mixture-of-Experts Model for Sequential Recommendation
by: Liu, Mingrui, et al.
Published: (2024)

Subspace Optimization for Large Language Models with Convergence Guarantees
by: He, Yutong, et al.
Published: (2024)

Towards Convergence Rates for Parameter Estimation in Gaussian-gated Mixture of Experts
by: Nguyen, Huy, et al.
Published: (2023)

MixTTE: Multi-Level Mixture-of-Experts for Scalable and Adaptive Travel Time Estimation
by: Jiang, Wenzhao, et al.
Published: (2026)

Merging Multi-Task Models via Weight-Ensembling Mixture of Experts
by: Tang, Anke, et al.
Published: (2024)

EEGMamba: Bidirectional State Space Model with Mixture of Experts for EEG Multi-task Classification
by: Gui, Yiyu, et al.
Published: (2024)

Grassmannian Mixture-of-Experts: Concentration-Controlled Routing on Subspace Manifolds
by: Shihab, Ibne Farabi, et al.
Published: (2026)

Dimension-Free Convergence of Diffusion Models for Approximate Gaussian Mixtures
by: Li, Gen, et al.
Published: (2025)

PhysVarMix: Physics-Informed Variational Mixture Model for Multi-Modal Trajectory Prediction
by: Li, Haichuan, et al.
Published: (2025)

Efficiently Editing Mixture-of-Experts Models with Compressed Experts
by: He, Yifei, et al.
Published: (2025)

MoE-I$^2$: Compressing Mixture of Experts Models through Inter-Expert Pruning and Intra-Expert Low-Rank Decomposition
by: Yang, Cheng, et al.
Published: (2024)

Convergence Rates for Softmax Gating Mixture of Experts
by: Nguyen, Huy, et al.
Published: (2025)

Language-Conditioned Representations and Mixture-of-Experts Policy for Robust Multi-Task Robotic Manipulation
by: Zhang, Xiucheng, et al.
Published: (2025)

Multi-Head Mixture-of-Experts
by: Wu, Xun, et al.
Published: (2024)

Towards Unified Modeling in Federated Multi-Task Learning via Subspace Decoupling
by: Wei, Yipan, et al.
Published: (2025)

Model Selection and Parameter Estimation of Multi-dimensional Gaussian Mixture Model
by: Liu, Xinyu, et al.
Published: (2026)

Interpretable Preferences via Multi-Objective Reward Modeling and Mixture-of-Experts
by: Wang, Haoxiang, et al.
Published: (2024)

Each Rank Could be an Expert: Single-Ranked Mixture of Experts LoRA for Multi-Task Learning
by: Zhao, Ziyu, et al.
Published: (2025)

M$^3$TN: Multi-gate Mixture-of-Experts based Multi-valued Treatment Network for Uplift Modeling
by: Sun, Zexu, et al.
Published: (2024)

SMILE: Zero-Shot Sparse Mixture of Low-Rank Experts Construction From Pre-Trained Foundation Models
by: Tang, Anke, et al.
Published: (2024)

Mitigating the Backdoor Effect for Multi-Task Model Merging via Safety-Aware Subspace
by: Yang, Jinluan, et al.
Published: (2024)

FarSkip-Collective: Unhobbling Blocking Communication in Mixture of Experts Models
by: Dukler, Yonatan, et al.
Published: (2025)

Unveiling Hidden Collaboration within Mixture-of-Experts in Large Language Models
by: Tang, Yuanbo, et al.
Published: (2025)

HMoE: Heterogeneous Mixture of Experts for Language Modeling
by: Wang, An, et al.
Published: (2024)

Modality Interactive Mixture-of-Experts for Fake News Detection
by: Liu, Yifan, et al.
Published: (2025)

Chain-of-Experts: Unlocking the Communication Power of Mixture-of-Experts Models
by: Wang, Zihan, et al.
Published: (2025)

Adaptive Shared Experts with LoRA-Based Mixture of Experts for Multi-Task Learning
by: Yang, Minghao, et al.
Published: (2025)

T-REX: Mixture-of-Rank-One-Experts with Semantic-aware Intuition for Multi-task Large Language Model Finetuning
by: Zhang, Rongyu, et al.
Published: (2024)

Mixture-of-Experts with Gradient Conflict-Driven Subspace Topology Pruning for Emergent Modularity
by: Gan, Yuxing, et al.
Published: (2025)

Not All Experts are Equal: Efficient Expert Pruning and Skipping for Mixture-of-Experts Large Language Models
by: Lu, Xudong, et al.
Published: (2024)

Sparse Models, Sparse Safety: Unsafe Routes in Mixture-of-Experts LLMs
by: Jiang, Yukun, et al.
Published: (2026)

Flex-MoE: Modeling Arbitrary Modality Combination via the Flexible Mixture-of-Experts
by: Yun, Sukwon, et al.
Published: (2024)

Separation and Collaboration: Two-Level Routing Grouped Mixture-of-Experts for Multi-Domain Continual Learning
by: Zhou, Jialu, et al.
Published: (2025)