:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Rokah, Adam, Veress, Daniel, Caulk, Caleb, Sharan, Sourav
Format:	Preprint
Published:	2026
Subjects:	Machine Learning Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2601.15021
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

EMoE: Eigenbasis-Guided Routing for Mixture-of-Experts
by: Cheng, Anzhe, et al.
Published: (2026)

Stable Routing for Mixture-of-Experts in Class-Incremental Learning
by: Guo, Zirui, et al.
Published: (2026)

Scaling Continual Learning to 300+ Tasks with Bi-Level Routing Mixture-of-Experts
by: Lou, Meng, et al.
Published: (2026)

Expert Race: A Flexible Routing Strategy for Scaling Diffusion Transformer with Mixture of Experts
by: Yuan, Yike, et al.
Published: (2025)

Mixture of Experts Made Personalized: Federated Prompt Learning for Vision-Language Models
by: Luo, Jun, et al.
Published: (2024)

Routers in Vision Mixture of Experts: An Empirical Study
by: Liu, Tianlin, et al.
Published: (2024)

Multilinear Mixture of Experts: Scalable Expert Specialization through Factorization
by: Oldfield, James, et al.
Published: (2024)

Domain-Specialized Object Detection via Model-Level Mixtures of Experts
by: Pavlitska, Svetlana, et al.
Published: (2026)

Efficient Training of Diffusion Mixture-of-Experts Models: A Practical Recipe
by: Liu, Yahui, et al.
Published: (2025)

Merging Multi-Task Models via Weight-Ensembling Mixture of Experts
by: Tang, Anke, et al.
Published: (2024)

Video Relationship Detection Using Mixture of Experts
by: Shaabana, Ala, et al.
Published: (2024)

Robust Experts: the Effect of Adversarial Training on CNNs with Sparse Mixture-of-Experts Layers
by: Pavlitska, Svetlana, et al.
Published: (2025)

Efficient and Effective Weight-Ensembling Mixture of Experts for Multi-Task Model Merging
by: Shen, Li, et al.
Published: (2024)

MoTE: Mixture of Ternary Experts for Memory-efficient Large Multimodal Models
by: Wang, Hongyu, et al.
Published: (2025)

Towards Adversarial Robustness of Model-Level Mixture-of-Experts Architectures for Semantic Segmentation
by: Pavlitska, Svetlana, et al.
Published: (2024)

LPT++: Efficient Training on Mixture of Long-tailed Experts
by: Dong, Bowen, et al.
Published: (2024)

Mixture of Experts in Image Classification: What's the Sweet Spot?
by: Videau, Mathurin, et al.
Published: (2024)

Extracting Uncertainty Estimates from Mixtures of Experts for Semantic Segmentation
by: Pavlitska, Svetlana, et al.
Published: (2025)

BioFact-MoE: Biologically Factorized Mixture of Experts for Vision-Language Prognostic Modeling in Hepatocellular Carcinoma
by: Yang, Junlin, et al.
Published: (2026)

Mixture of Group Experts for Learning Invariant Representations
by: Kang, Lei, et al.
Published: (2025)

Improving OOD Generalization of Pre-trained Encoders via Aligned Embedding-Space Ensembles
by: Peng, Shuman, et al.
Published: (2024)

EC-DIT: Scaling Diffusion Transformers with Adaptive Expert-Choice Routing
by: Sun, Haotian, et al.
Published: (2024)

Lightweight Metadata-Aware Mixture-of-Experts Masked Autoencoder for Earth Observation
by: Albughdadi, Mohanad
Published: (2025)

From Sparse to Soft Mixtures of Experts
by: Puigcerver, Joan, et al.
Published: (2023)

MINGLE: Mixture of Null-Space Gated Low-Rank Experts for Test-Time Continual Model Merging
by: Qiu, Zihuan, et al.
Published: (2025)

Design and Behavior of Sparse Mixture-of-Experts Layers in CNN-based Semantic Segmentation
by: Pavlitska, Svetlana, et al.
Published: (2026)

Task-customized Masked AutoEncoder via Mixture of Cluster-conditional Experts
by: Liu, Zhili, et al.
Published: (2024)

Teacher-Guided Routing for Sparse Vision Mixture-of-Experts
by: Kada, Masahiro, et al.
Published: (2026)

Co-Supervised Learning: Improving Weak-to-Strong Generalization with Hierarchical Mixture of Experts
by: Liu, Yuejiang, et al.
Published: (2024)

MoPD: Mixture-of-Prompts Distillation for Vision-Language Models
by: Chen, Yang, et al.
Published: (2024)

Mixture-of-Experts for Open Set Domain Adaptation: A Dual-Space Detection Approach
by: Du, Zhenbang, et al.
Published: (2023)

FLAVARS: A Multimodal Foundational Language and Vision Alignment Model for Remote Sensing
by: Corley, Isaac, et al.
Published: (2025)

Parameter-Efficient Quantized Mixture-of-Experts Meets Vision-Language Instruction Tuning for Semiconductor Electron Micrograph Analysis
by: Srinivas, Sakhinana Sagar, et al.
Published: (2024)

Sublinear Variational Optimization of Gaussian Mixture Models with Millions to Billions of Parameters
by: Salwig, Sebastian, et al.
Published: (2025)

MoRE-Brain: Routed Mixture of Experts for Interpretable and Generalizable Cross-Subject fMRI Visual Decoding
by: Wei, Yuxiang, et al.
Published: (2025)

FastMMoE: Accelerating Multimodal Large Language Models through Dynamic Expert Activation and Routing-Aware Token Pruning
by: Xia, Guoyang, et al.
Published: (2025)

IMPROVE: Iterative Model Pipeline Refinement and Optimization Leveraging LLM Experts
by: Xue, Eric, et al.
Published: (2025)

MoQE: Improve Quantization Model performance via Mixture of Quantization Experts
by: Zhang, Jinhao, et al.
Published: (2025)

AMEND: A Mixture of Experts Framework for Long-tailed Trajectory Prediction
by: Mercurius, Ray Coden, et al.
Published: (2024)

Adaptive Shared Experts with LoRA-Based Mixture of Experts for Multi-Task Learning
by: Yang, Minghao, et al.
Published: (2025)