Saved in:
| Main Authors: | Rokah, Adam, Veress, Daniel, Caulk, Caleb, Sharan, Sourav |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2601.15021 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
EMoE: Eigenbasis-Guided Routing for Mixture-of-Experts
by: Cheng, Anzhe, et al.
Published: (2026)
by: Cheng, Anzhe, et al.
Published: (2026)
Stable Routing for Mixture-of-Experts in Class-Incremental Learning
by: Guo, Zirui, et al.
Published: (2026)
by: Guo, Zirui, et al.
Published: (2026)
Scaling Continual Learning to 300+ Tasks with Bi-Level Routing Mixture-of-Experts
by: Lou, Meng, et al.
Published: (2026)
by: Lou, Meng, et al.
Published: (2026)
Expert Race: A Flexible Routing Strategy for Scaling Diffusion Transformer with Mixture of Experts
by: Yuan, Yike, et al.
Published: (2025)
by: Yuan, Yike, et al.
Published: (2025)
Mixture of Experts Made Personalized: Federated Prompt Learning for Vision-Language Models
by: Luo, Jun, et al.
Published: (2024)
by: Luo, Jun, et al.
Published: (2024)
Routers in Vision Mixture of Experts: An Empirical Study
by: Liu, Tianlin, et al.
Published: (2024)
by: Liu, Tianlin, et al.
Published: (2024)
Multilinear Mixture of Experts: Scalable Expert Specialization through Factorization
by: Oldfield, James, et al.
Published: (2024)
by: Oldfield, James, et al.
Published: (2024)
Domain-Specialized Object Detection via Model-Level Mixtures of Experts
by: Pavlitska, Svetlana, et al.
Published: (2026)
by: Pavlitska, Svetlana, et al.
Published: (2026)
Efficient Training of Diffusion Mixture-of-Experts Models: A Practical Recipe
by: Liu, Yahui, et al.
Published: (2025)
by: Liu, Yahui, et al.
Published: (2025)
Merging Multi-Task Models via Weight-Ensembling Mixture of Experts
by: Tang, Anke, et al.
Published: (2024)
by: Tang, Anke, et al.
Published: (2024)
Video Relationship Detection Using Mixture of Experts
by: Shaabana, Ala, et al.
Published: (2024)
by: Shaabana, Ala, et al.
Published: (2024)
Robust Experts: the Effect of Adversarial Training on CNNs with Sparse Mixture-of-Experts Layers
by: Pavlitska, Svetlana, et al.
Published: (2025)
by: Pavlitska, Svetlana, et al.
Published: (2025)
Efficient and Effective Weight-Ensembling Mixture of Experts for Multi-Task Model Merging
by: Shen, Li, et al.
Published: (2024)
by: Shen, Li, et al.
Published: (2024)
MoTE: Mixture of Ternary Experts for Memory-efficient Large Multimodal Models
by: Wang, Hongyu, et al.
Published: (2025)
by: Wang, Hongyu, et al.
Published: (2025)
Towards Adversarial Robustness of Model-Level Mixture-of-Experts Architectures for Semantic Segmentation
by: Pavlitska, Svetlana, et al.
Published: (2024)
by: Pavlitska, Svetlana, et al.
Published: (2024)
LPT++: Efficient Training on Mixture of Long-tailed Experts
by: Dong, Bowen, et al.
Published: (2024)
by: Dong, Bowen, et al.
Published: (2024)
Mixture of Experts in Image Classification: What's the Sweet Spot?
by: Videau, Mathurin, et al.
Published: (2024)
by: Videau, Mathurin, et al.
Published: (2024)
Extracting Uncertainty Estimates from Mixtures of Experts for Semantic Segmentation
by: Pavlitska, Svetlana, et al.
Published: (2025)
by: Pavlitska, Svetlana, et al.
Published: (2025)
BioFact-MoE: Biologically Factorized Mixture of Experts for Vision-Language Prognostic Modeling in Hepatocellular Carcinoma
by: Yang, Junlin, et al.
Published: (2026)
by: Yang, Junlin, et al.
Published: (2026)
Mixture of Group Experts for Learning Invariant Representations
by: Kang, Lei, et al.
Published: (2025)
by: Kang, Lei, et al.
Published: (2025)
Improving OOD Generalization of Pre-trained Encoders via Aligned Embedding-Space Ensembles
by: Peng, Shuman, et al.
Published: (2024)
by: Peng, Shuman, et al.
Published: (2024)
EC-DIT: Scaling Diffusion Transformers with Adaptive Expert-Choice Routing
by: Sun, Haotian, et al.
Published: (2024)
by: Sun, Haotian, et al.
Published: (2024)
Lightweight Metadata-Aware Mixture-of-Experts Masked Autoencoder for Earth Observation
by: Albughdadi, Mohanad
Published: (2025)
by: Albughdadi, Mohanad
Published: (2025)
From Sparse to Soft Mixtures of Experts
by: Puigcerver, Joan, et al.
Published: (2023)
by: Puigcerver, Joan, et al.
Published: (2023)
MINGLE: Mixture of Null-Space Gated Low-Rank Experts for Test-Time Continual Model Merging
by: Qiu, Zihuan, et al.
Published: (2025)
by: Qiu, Zihuan, et al.
Published: (2025)
Design and Behavior of Sparse Mixture-of-Experts Layers in CNN-based Semantic Segmentation
by: Pavlitska, Svetlana, et al.
Published: (2026)
by: Pavlitska, Svetlana, et al.
Published: (2026)
Task-customized Masked AutoEncoder via Mixture of Cluster-conditional Experts
by: Liu, Zhili, et al.
Published: (2024)
by: Liu, Zhili, et al.
Published: (2024)
Teacher-Guided Routing for Sparse Vision Mixture-of-Experts
by: Kada, Masahiro, et al.
Published: (2026)
by: Kada, Masahiro, et al.
Published: (2026)
Co-Supervised Learning: Improving Weak-to-Strong Generalization with Hierarchical Mixture of Experts
by: Liu, Yuejiang, et al.
Published: (2024)
by: Liu, Yuejiang, et al.
Published: (2024)
MoPD: Mixture-of-Prompts Distillation for Vision-Language Models
by: Chen, Yang, et al.
Published: (2024)
by: Chen, Yang, et al.
Published: (2024)
Mixture-of-Experts for Open Set Domain Adaptation: A Dual-Space Detection Approach
by: Du, Zhenbang, et al.
Published: (2023)
by: Du, Zhenbang, et al.
Published: (2023)
FLAVARS: A Multimodal Foundational Language and Vision Alignment Model for Remote Sensing
by: Corley, Isaac, et al.
Published: (2025)
by: Corley, Isaac, et al.
Published: (2025)
Parameter-Efficient Quantized Mixture-of-Experts Meets Vision-Language Instruction Tuning for Semiconductor Electron Micrograph Analysis
by: Srinivas, Sakhinana Sagar, et al.
Published: (2024)
by: Srinivas, Sakhinana Sagar, et al.
Published: (2024)
Sublinear Variational Optimization of Gaussian Mixture Models with Millions to Billions of Parameters
by: Salwig, Sebastian, et al.
Published: (2025)
by: Salwig, Sebastian, et al.
Published: (2025)
MoRE-Brain: Routed Mixture of Experts for Interpretable and Generalizable Cross-Subject fMRI Visual Decoding
by: Wei, Yuxiang, et al.
Published: (2025)
by: Wei, Yuxiang, et al.
Published: (2025)
FastMMoE: Accelerating Multimodal Large Language Models through Dynamic Expert Activation and Routing-Aware Token Pruning
by: Xia, Guoyang, et al.
Published: (2025)
by: Xia, Guoyang, et al.
Published: (2025)
IMPROVE: Iterative Model Pipeline Refinement and Optimization Leveraging LLM Experts
by: Xue, Eric, et al.
Published: (2025)
by: Xue, Eric, et al.
Published: (2025)
MoQE: Improve Quantization Model performance via Mixture of Quantization Experts
by: Zhang, Jinhao, et al.
Published: (2025)
by: Zhang, Jinhao, et al.
Published: (2025)
AMEND: A Mixture of Experts Framework for Long-tailed Trajectory Prediction
by: Mercurius, Ray Coden, et al.
Published: (2024)
by: Mercurius, Ray Coden, et al.
Published: (2024)
Adaptive Shared Experts with LoRA-Based Mixture of Experts for Multi-Task Learning
by: Yang, Minghao, et al.
Published: (2025)
by: Yang, Minghao, et al.
Published: (2025)
Similar Items
-
EMoE: Eigenbasis-Guided Routing for Mixture-of-Experts
by: Cheng, Anzhe, et al.
Published: (2026) -
Stable Routing for Mixture-of-Experts in Class-Incremental Learning
by: Guo, Zirui, et al.
Published: (2026) -
Scaling Continual Learning to 300+ Tasks with Bi-Level Routing Mixture-of-Experts
by: Lou, Meng, et al.
Published: (2026) -
Expert Race: A Flexible Routing Strategy for Scaling Diffusion Transformer with Mixture of Experts
by: Yuan, Yike, et al.
Published: (2025) -
Mixture of Experts Made Personalized: Federated Prompt Learning for Vision-Language Models
by: Luo, Jun, et al.
Published: (2024)