Saved in:
| Main Authors: | Su, Ye, Liu, Yong |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2601.03577 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Multi-Task Reinforcement Learning with Mixture of Orthogonal Experts
by: Hendawy, Ahmed, et al.
Published: (2023)
by: Hendawy, Ahmed, et al.
Published: (2023)
Dynamic Expert Quantization for Scalable Mixture-of-Experts Inference
by: Chu, Kexin, et al.
Published: (2025)
by: Chu, Kexin, et al.
Published: (2025)
Speculating Experts Accelerates Inference for Mixture-of-Experts
by: Madan, Vivan, et al.
Published: (2026)
by: Madan, Vivan, et al.
Published: (2026)
Sparse Orthogonal Variational Inference for Gaussian Processes
by: Shi, Jiaxin, et al.
Published: (2019)
by: Shi, Jiaxin, et al.
Published: (2019)
Accelerating Mixture-of-Expert Inference with Adaptive Expert Split Mechanism
by: Yan, Jiaming, et al.
Published: (2025)
by: Yan, Jiaming, et al.
Published: (2025)
A Unified Virtual Mixture-of-Experts Framework:Enhanced Inference and Hallucination Mitigation in Single-Model System
by: Liu, Mingyan
Published: (2025)
by: Liu, Mingyan
Published: (2025)
Prediction-powered Inference by Mixture of Experts
by: Gu, Yanwu, et al.
Published: (2026)
by: Gu, Yanwu, et al.
Published: (2026)
Variational Inference with Mixtures of Isotropic Gaussians
by: Petit-Talamon, Marguerite, et al.
Published: (2025)
by: Petit-Talamon, Marguerite, et al.
Published: (2025)
Theory of Mixture-of-Experts for Mobile Edge Computing
by: Li, Hongbo, et al.
Published: (2024)
by: Li, Hongbo, et al.
Published: (2024)
Theory on Mixture-of-Experts in Continual Learning
by: Li, Hongbo, et al.
Published: (2024)
by: Li, Hongbo, et al.
Published: (2024)
Variational Distillation of Diffusion Policies into Mixture of Experts
by: Zhou, Hongyi, et al.
Published: (2024)
by: Zhou, Hongyi, et al.
Published: (2024)
Beyond Sunk Costs: Boosting LLM Pre-training Efficiency via Orthogonal Growth of Mixture-of-Experts
by: Wang, Ruizhe, et al.
Published: (2025)
by: Wang, Ruizhe, et al.
Published: (2025)
Federated Variational Inference for Bayesian Mixture Models
by: Rao, Jackie, et al.
Published: (2025)
by: Rao, Jackie, et al.
Published: (2025)
Efficient Expert Pruning for Sparse Mixture-of-Experts Language Models: Enhancing Performance and Reducing Inference Costs
by: Liu, Enshu, et al.
Published: (2024)
by: Liu, Enshu, et al.
Published: (2024)
Efficient Mixture-of-Experts LLM Inference with Apple Silicon NPUs
by: Benazir, Afsara, et al.
Published: (2026)
by: Benazir, Afsara, et al.
Published: (2026)
Efficient Mixture Learning in Black-Box Variational Inference
by: Hotti, Alexandra, et al.
Published: (2024)
by: Hotti, Alexandra, et al.
Published: (2024)
ELBOing Stein: Variational Bayes with Stein Mixture Inference
by: Rønning, Ola, et al.
Published: (2024)
by: Rønning, Ola, et al.
Published: (2024)
PreMoE: Proactive Inference for Efficient Mixture-of-Experts
by: Pei, Zehua, et al.
Published: (2025)
by: Pei, Zehua, et al.
Published: (2025)
Toward Inference-optimal Mixture-of-Expert Large Language Models
by: Yun, Longfei, et al.
Published: (2024)
by: Yun, Longfei, et al.
Published: (2024)
Context-Aware Mixture-of-Experts Inference on CXL-Enabled GPU-NDP Systems
by: Fan, Zehao, et al.
Published: (2025)
by: Fan, Zehao, et al.
Published: (2025)
A General Theory for Softmax Gating Multinomial Logistic Mixture of Experts
by: Nguyen, Huy, et al.
Published: (2023)
by: Nguyen, Huy, et al.
Published: (2023)
Anchor-MoE: A Mean-Anchored Mixture of Experts For Probabilistic Regression
by: Su, Baozhuo, et al.
Published: (2025)
by: Su, Baozhuo, et al.
Published: (2025)
A Survey on Inference Optimization Techniques for Mixture of Experts Models
by: Liu, Jiacheng, et al.
Published: (2024)
by: Liu, Jiacheng, et al.
Published: (2024)
A Comprehensive Survey of Mixture-of-Experts: Algorithms, Theory, and Applications
by: Mu, Siyuan, et al.
Published: (2025)
by: Mu, Siyuan, et al.
Published: (2025)
Theoretical Guarantees for Variational Inference with Fixed-Variance Mixture of Gaussians
by: Huix, Tom, et al.
Published: (2024)
by: Huix, Tom, et al.
Published: (2024)
Adaptive Heterogeneous Mixtures of Normalising Flows for Robust Variational Inference
by: Wiriyapong, Benjamin, et al.
Published: (2025)
by: Wiriyapong, Benjamin, et al.
Published: (2025)
Variational Mixture of Graph Neural Experts for Alzheimer's Disease Biomarker Recognition in EEG Brain Networks
by: Ding, Jun-En, et al.
Published: (2025)
by: Ding, Jun-En, et al.
Published: (2025)
Similarity-Aware Mixture-of-Experts for Data-Efficient Continual Learning
by: Mclaughlin, Connor, et al.
Published: (2026)
by: Mclaughlin, Connor, et al.
Published: (2026)
Klotski: Efficient Mixture-of-Expert Inference via Expert-Aware Multi-Batch Pipeline
by: Fang, Zhiyuan, et al.
Published: (2025)
by: Fang, Zhiyuan, et al.
Published: (2025)
Optimizing Distributed Deployment of Mixture-of-Experts Model Inference in Serverless Computing
by: Liu, Mengfan, et al.
Published: (2025)
by: Liu, Mengfan, et al.
Published: (2025)
MoPEQ: Mixture of Mixed Precision Quantized Experts
by: Chitty-Venkata, Krishna Teja, et al.
Published: (2025)
by: Chitty-Venkata, Krishna Teja, et al.
Published: (2025)
Variational Routing: A Scalable Bayesian Framework for Calibrated Mixture-of-Experts Transformers
by: Li, Albus Yizhuo, et al.
Published: (2026)
by: Li, Albus Yizhuo, et al.
Published: (2026)
Alloc-MoE: Budget-Aware Expert Activation Allocation for Efficient Mixture-of-Experts Inference
by: Liu, Baihui, et al.
Published: (2026)
by: Liu, Baihui, et al.
Published: (2026)
Mixture of Message Passing Experts with Routing Entropy Regularization for Node Classification
by: Chen, Xuanze, et al.
Published: (2025)
by: Chen, Xuanze, et al.
Published: (2025)
A Statistical Theory of Gated Attention through the Lens of Hierarchical Mixture of Experts
by: Nguyen, Viet, et al.
Published: (2026)
by: Nguyen, Viet, et al.
Published: (2026)
Scaling Multi-Node Mixture-of-Experts Inference Using Expert Activation Patterns
by: Bambhaniya, Abhimanyu, et al.
Published: (2026)
by: Bambhaniya, Abhimanyu, et al.
Published: (2026)
CodeQuant: Unified Clustering and Quantization for Enhanced Outlier Smoothing in Low-Precision Mixture-of-Experts
by: Yin, Xiangyang, et al.
Published: (2026)
by: Yin, Xiangyang, et al.
Published: (2026)
BuddyMoE: Exploiting Expert Redundancy to Accelerate Memory-Constrained Mixture-of-Experts Inference
by: Wang, Yun, et al.
Published: (2025)
by: Wang, Yun, et al.
Published: (2025)
Sequential Function-Space Variational Inference via Gaussian Mixture Approximation
by: Zhu, Menghao Waiyan William, et al.
Published: (2025)
by: Zhu, Menghao Waiyan William, et al.
Published: (2025)
Variational Inference on the Boolean Hypercube with the Quantum Entropy
by: Beyler, Eliot, et al.
Published: (2024)
by: Beyler, Eliot, et al.
Published: (2024)
Similar Items
-
Multi-Task Reinforcement Learning with Mixture of Orthogonal Experts
by: Hendawy, Ahmed, et al.
Published: (2023) -
Dynamic Expert Quantization for Scalable Mixture-of-Experts Inference
by: Chu, Kexin, et al.
Published: (2025) -
Speculating Experts Accelerates Inference for Mixture-of-Experts
by: Madan, Vivan, et al.
Published: (2026) -
Sparse Orthogonal Variational Inference for Gaussian Processes
by: Shi, Jiaxin, et al.
Published: (2019) -
Accelerating Mixture-of-Expert Inference with Adaptive Expert Split Mechanism
by: Yan, Jiaming, et al.
Published: (2025)