Guardado en:
| Autores principales: | Li, Zhongyang, Li, Ziyue, Zhou, Tianyi |
|---|---|
| Formato: | Preprint |
| Publicado: |
2025
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2511.07419 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts
por: Li, Zhongyang, et al.
Publicado: (2025)
por: Li, Zhongyang, et al.
Publicado: (2025)
C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing
por: Li, Zhongyang, et al.
Publicado: (2025)
por: Li, Zhongyang, et al.
Publicado: (2025)
Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free
por: Li, Ziyue, et al.
Publicado: (2024)
por: Li, Ziyue, et al.
Publicado: (2024)
Skip a Layer or Loop it? Test-Time Depth Adaptation of Pretrained LLMs
por: Li, Ziyue, et al.
Publicado: (2025)
por: Li, Ziyue, et al.
Publicado: (2025)
Grassmannian Mixture-of-Experts: Concentration-Controlled Routing on Subspace Manifolds
por: Shihab, Ibne Farabi, et al.
Publicado: (2026)
por: Shihab, Ibne Farabi, et al.
Publicado: (2026)
RouteHijack: Routing-Aware Attack on Mixture-of-Experts LLMs
por: Xu, Zhiyuan, et al.
Publicado: (2026)
por: Xu, Zhiyuan, et al.
Publicado: (2026)
Maximum Score Routing For Mixture-of-Experts
por: Dong, Bowen, et al.
Publicado: (2025)
por: Dong, Bowen, et al.
Publicado: (2025)
MoEEdit: Efficient and Routing-Stable Knowledge Editing for Mixture-of-Experts LLMs
por: Gu, Yupu, et al.
Publicado: (2026)
por: Gu, Yupu, et al.
Publicado: (2026)
Selective Sinkhorn Routing for Improved Sparse Mixture of Experts
por: Nguyen, Duc Anh, et al.
Publicado: (2025)
por: Nguyen, Duc Anh, et al.
Publicado: (2025)
Improving Routing in Sparse Mixture of Experts with Graph of Tokens
por: Nguyen, Tam, et al.
Publicado: (2025)
por: Nguyen, Tam, et al.
Publicado: (2025)
Grokking in LLM Pretraining? Monitor Memorization-to-Generalization without Test
por: Li, Ziyue, et al.
Publicado: (2025)
por: Li, Ziyue, et al.
Publicado: (2025)
Neural Inhibition Improves Dynamic Routing and Mixture of Experts
por: Zou, Will Y., et al.
Publicado: (2025)
por: Zou, Will Y., et al.
Publicado: (2025)
Sparse Models, Sparse Safety: Unsafe Routes in Mixture-of-Experts LLMs
por: Jiang, Yukun, et al.
Publicado: (2026)
por: Jiang, Yukun, et al.
Publicado: (2026)
RASA: Routing-Aware Safety Alignment for Mixture-of-Experts Models
por: Liang, Jiacheng, et al.
Publicado: (2026)
por: Liang, Jiacheng, et al.
Publicado: (2026)
Multilingual Routing in Mixture-of-Experts
por: Bandarkar, Lucas, et al.
Publicado: (2025)
por: Bandarkar, Lucas, et al.
Publicado: (2025)
Routing-Free Mixture-of-Experts
por: Liu, Yilun, et al.
Publicado: (2026)
por: Liu, Yilun, et al.
Publicado: (2026)
Mixture of Message Passing Experts with Routing Entropy Regularization for Node Classification
por: Chen, Xuanze, et al.
Publicado: (2025)
por: Chen, Xuanze, et al.
Publicado: (2025)
Exact Recovery of Community Detection in dependent Gaussian Mixture Models
por: Li, Zhongyang, et al.
Publicado: (2022)
por: Li, Zhongyang, et al.
Publicado: (2022)
Mixture-of-Experts Models in Vision: Routing, Optimization, and Generalization
por: Rokah, Adam, et al.
Publicado: (2026)
por: Rokah, Adam, et al.
Publicado: (2026)
Mixture Compressor for Mixture-of-Experts LLMs Gains More
por: Huang, Wei, et al.
Publicado: (2024)
por: Huang, Wei, et al.
Publicado: (2024)
Stable Routing for Mixture-of-Experts in Class-Incremental Learning
por: Guo, Zirui, et al.
Publicado: (2026)
por: Guo, Zirui, et al.
Publicado: (2026)
Not All Models Suit Expert Offloading: On Local Routing Consistency of Mixture-of-Expert Models
por: Liang, Jingcong, et al.
Publicado: (2025)
por: Liang, Jingcong, et al.
Publicado: (2025)
EMoE: Eigenbasis-Guided Routing for Mixture-of-Experts
por: Cheng, Anzhe, et al.
Publicado: (2026)
por: Cheng, Anzhe, et al.
Publicado: (2026)
Generalizing GNNs with Tokenized Mixture of Experts
por: Guo, Xiaoguang, et al.
Publicado: (2026)
por: Guo, Xiaoguang, et al.
Publicado: (2026)
Adaptive Inverted-Index Routing for Granular Mixtures-of-Experts
por: Kladny, Klaus-Rudolf, et al.
Publicado: (2026)
por: Kladny, Klaus-Rudolf, et al.
Publicado: (2026)
Separation and Collaboration: Two-Level Routing Grouped Mixture-of-Experts for Multi-Domain Continual Learning
por: Zhou, Jialu, et al.
Publicado: (2025)
por: Zhou, Jialu, et al.
Publicado: (2025)
Towards Generalization-Oriented Models for Vehicle Routing Problems with Mixture-of-Experts
por: Miao, Changhao, et al.
Publicado: (2026)
por: Miao, Changhao, et al.
Publicado: (2026)
Variational Routing: A Scalable Bayesian Framework for Calibrated Mixture-of-Experts Transformers
por: Li, Albus Yizhuo, et al.
Publicado: (2026)
por: Li, Albus Yizhuo, et al.
Publicado: (2026)
Unraveling the Localized Latents: Learning Stratified Manifold Structures in LLM Embedding Space with Sparse Mixture-of-Experts
por: Li, Xin, et al.
Publicado: (2025)
por: Li, Xin, et al.
Publicado: (2025)
Sub-MoE: Efficient Mixture-of-Expert LLMs Compression via Subspace Expert Merging
por: Li, Lujun, et al.
Publicado: (2025)
por: Li, Lujun, et al.
Publicado: (2025)
MVMoE: Multi-Task Vehicle Routing Solver with Mixture-of-Experts
por: Zhou, Jianan, et al.
Publicado: (2024)
por: Zhou, Jianan, et al.
Publicado: (2024)
How Instruction and Reasoning Data shape Post-Training: Data Quality through the Lens of Layer-wise Gradients
por: Li, Ming, et al.
Publicado: (2025)
por: Li, Ming, et al.
Publicado: (2025)
OrdMoE: Preference Alignment via Hierarchical Expert Group Ranking in Multimodal Mixture-of-Experts LLMs
por: Gao, Yuting, et al.
Publicado: (2025)
por: Gao, Yuting, et al.
Publicado: (2025)
L2R: Low-Rank and Lipschitz-Controlled Routing for Mixture-of-Experts
por: Yang, Minghao, et al.
Publicado: (2026)
por: Yang, Minghao, et al.
Publicado: (2026)
Geometric Mixture-of-Experts with Curvature-Guided Adaptive Routing for Graph Representation Learning
por: Cao, Haifang, et al.
Publicado: (2026)
por: Cao, Haifang, et al.
Publicado: (2026)
Dynamic Expert Sharing: Decoupling Memory from Parallelism in Mixture-of-Experts Diffusion LLMs
por: Chen, Hao Mark, et al.
Publicado: (2026)
por: Chen, Hao Mark, et al.
Publicado: (2026)
Guided by the Experts: Provable Feature Learning Dynamic of Soft-Routed Mixture-of-Experts
por: Liao, Fangshuo, et al.
Publicado: (2025)
por: Liao, Fangshuo, et al.
Publicado: (2025)
When Are Experts Misrouted? Counterfactual Routing Analysis in Mixture-of-Experts Language Models
por: Yoon, Youngsik, et al.
Publicado: (2026)
por: Yoon, Youngsik, et al.
Publicado: (2026)
MoBE: Mixture-of-Basis-Experts for Compressing MoE-based LLMs
por: Chen, Xiaodong, et al.
Publicado: (2025)
por: Chen, Xiaodong, et al.
Publicado: (2025)
Many-Objective Multi-Solution Transport
por: Li, Ziyue, et al.
Publicado: (2024)
por: Li, Ziyue, et al.
Publicado: (2024)
Ejemplares similares
-
R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts
por: Li, Zhongyang, et al.
Publicado: (2025) -
C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing
por: Li, Zhongyang, et al.
Publicado: (2025) -
Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free
por: Li, Ziyue, et al.
Publicado: (2024) -
Skip a Layer or Loop it? Test-Time Depth Adaptation of Pretrained LLMs
por: Li, Ziyue, et al.
Publicado: (2025) -
Grassmannian Mixture-of-Experts: Concentration-Controlled Routing on Subspace Manifolds
por: Shihab, Ibne Farabi, et al.
Publicado: (2026)