:: Library Catalog

Image de couverture de livre

Enregistré dans:

Détails bibliographiques
Auteurs principaux:	Lin, Shuning, He, Yifan, Chen, Yitong
Format:	Preprint
Publié:	2025
Sujets:	Machine Learning Artificial Intelligence
Accès en ligne:	https://arxiv.org/abs/2511.05814
Tags:	Ajouter un tag Pas de tags, Soyez le premier à ajouter un tag!

Documents similaires

Not All Models Suit Expert Offloading: On Local Routing Consistency of Mixture-of-Expert Models
par: Liang, Jingcong, et autres
Publié: (2025)

Efficiently Editing Mixture-of-Experts Models with Compressed Experts
par: He, Yifei, et autres
Publié: (2025)

Mixture of Cache-Conditional Experts for Efficient Mobile Device Inference
par: Skliar, Andrii, et autres
Publié: (2024)

Mixture of A Million Experts
par: He, Xu Owen
Publié: (2024)

KV Cache Offloading for Context-Intensive Tasks
par: Bocharnikov, Andrey, et autres
Publié: (2026)

MoMa: Efficient Early-Fusion Pre-training with Mixture of Modality-Aware Experts
par: Lin, Xi Victoria, et autres
Publié: (2024)

Theory on Mixture-of-Experts in Continual Learning
par: Li, Hongbo, et autres
Publié: (2024)

MC#: Mixture Compressor for Mixture-of-Experts Large Models
par: Huang, Wei, et autres
Publié: (2025)

Mixture of Raytraced Experts
par: Perin, Andrea, et autres
Publié: (2025)

A Stronger Mixture of Low-Rank Experts for Fine-Tuning Foundation Models
par: Sun, Mengyang, et autres
Publié: (2025)

Mixture of Experts in a Mixture of RL settings
par: Willi, Timon, et autres
Publié: (2024)

Speculating Experts Accelerates Inference for Mixture-of-Experts
par: Madan, Vivan, et autres
Publié: (2026)

A Comprehensive Survey of Mixture-of-Experts: Algorithms, Theory, and Applications
par: Mu, Siyuan, et autres
Publié: (2025)

Symphony-MoE: Harmonizing Disparate Pre-trained Models into a Coherent Mixture-of-Experts
par: Wang, Qi, et autres
Publié: (2025)

HyperMoE: Towards Better Mixture of Experts via Transferring Among Experts
par: Zhao, Hao, et autres
Publié: (2024)

PERFT: Parameter-Efficient Routed Fine-Tuning for Mixture-of-Expert Model
par: Liu, Yilun, et autres
Publié: (2024)

Dual Mixture-of-Experts Framework for Discrete-Time Survival Analysis
par: Lee, Hyeonjun, et autres
Publié: (2025)

Aggregation Alignment for Federated Learning with Mixture-of-Experts under Data Heterogeneity
par: Fang, Zihan, et autres
Publié: (2026)

UniPool: A Globally Shared Expert Pool for Mixture-of-Experts
par: Huang, Minbin, et autres
Publié: (2026)

Dynamic Adaptive Shared Experts with Grouped Multi-Head Attention Mixture of Experts
par: Li, Cheng, et autres
Publié: (2025)

Sparsity and Superposition in Mixture of Experts
par: Chaudhari, Marmik, et autres
Publié: (2025)

Mixture of Diverse Size Experts
par: Sun, Manxi, et autres
Publié: (2024)

Mixture of Concept Bottleneck Experts
par: De Santis, Francesco, et autres
Publié: (2026)

Mixture-of-Linear-Experts for Long-term Time Series Forecasting
par: Ni, Ronghao, et autres
Publié: (2023)

Mixture-of-Experts Meets In-Context Reinforcement Learning
par: Wu, Wenhao, et autres
Publié: (2025)

Wavelet Mixture of Experts for Time Series Forecasting
par: Zhou, Zheng, et autres
Publié: (2025)

Towards Efficient Mixture of Experts: A Holistic Study of Compression Techniques
par: He, Shwai, et autres
Publié: (2024)

AnyExperts: On-Demand Expert Allocation for Multimodal Language Models with Mixture of Expert
par: Gao, Yuting, et autres
Publié: (2025)

Accelerating Mixture-of-Expert Inference with Adaptive Expert Split Mechanism
par: Yan, Jiaming, et autres
Publié: (2025)

Expert Upcycling: Shifting the Compute-Efficient Frontier of Mixture-of-Experts
par: Dwivedi, Chaitanya, et autres
Publié: (2026)

SMILE: Zero-Shot Sparse Mixture of Low-Rank Experts Construction From Pre-Trained Foundation Models
par: Tang, Anke, et autres
Publié: (2024)

FlashMoE: Reducing SSD I/O Bottlenecks via ML-Based Cache Replacement for Mixture-of-Experts Inference on Edge Devices
par: Kim, Byeongju, et autres
Publié: (2026)

Mixture of Experts in Large Language Models
par: Zhang, Danyang, et autres
Publié: (2025)

Graph Knowledge Distillation to Mixture of Experts
par: Rumiantsev, Pavel, et autres
Publié: (2024)

Mixture of Weak & Strong Experts on Graphs
par: Zeng, Hanqing, et autres
Publié: (2023)

EAC-MoE: Expert-Selection Aware Compressor for Mixture-of-Experts Large Language Models
par: Chen, Yuanteng, et autres
Publié: (2025)

Klotski: Efficient Mixture-of-Expert Inference via Expert-Aware Multi-Batch Pipeline
par: Fang, Zhiyuan, et autres
Publié: (2025)

Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Models
par: Guo, Yongxin, et autres
Publié: (2024)

Dynamic Expert Quantization for Scalable Mixture-of-Experts Inference
par: Chu, Kexin, et autres
Publié: (2025)

Modeling Expert Interactions in Sparse Mixture of Experts via Graph Structures
par: Nguyen-Nhat, Minh-Khoi, et autres
Publié: (2025)