Saved in:
| Main Author: | Luttner, Lucas |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2312.08083 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Node-wise Filtering in Graph Neural Networks: A Mixture of Experts Approach
by: Han, Haoyu, et al.
Published: (2024)
by: Han, Haoyu, et al.
Published: (2024)
Optimizing Pre-Training Data Mixtures with Mixtures of Data Expert Models
by: Belenki, Lior, et al.
Published: (2025)
by: Belenki, Lior, et al.
Published: (2025)
$ϕ$-Balancing for Mixture-of-Experts Training
by: Chen, Lizhang, et al.
Published: (2026)
by: Chen, Lizhang, et al.
Published: (2026)
Convolutional Neural Networks and Mixture of Experts for Intrusion Detection in 5G Networks and beyond
by: Ilias, Loukas, et al.
Published: (2024)
by: Ilias, Loukas, et al.
Published: (2024)
Train Separately, Merge Together: Modular Post-Training with Mixture-of-Experts
by: Morrison, Jacob, et al.
Published: (2026)
by: Morrison, Jacob, et al.
Published: (2026)
Differentially Private Training of Mixture of Experts Models
by: Tholoniat, Pierre, et al.
Published: (2024)
by: Tholoniat, Pierre, et al.
Published: (2024)
Graph-Conditioned Mixture of Graph Neural Network Experts for Traffic Forecasting
by: Ghaffari, Amirhossein, et al.
Published: (2026)
by: Ghaffari, Amirhossein, et al.
Published: (2026)
Multilingual Routing in Mixture-of-Experts
by: Bandarkar, Lucas, et al.
Published: (2025)
by: Bandarkar, Lucas, et al.
Published: (2025)
Exploring Expert Specialization through Unsupervised Training in Sparse Mixture of Experts
by: Nikolic, Strahinja, et al.
Published: (2025)
by: Nikolic, Strahinja, et al.
Published: (2025)
MixtureKit: A General Framework for Composing, Training, and Visualizing Mixture-of-Experts Models
by: Chamma, Ahmad, et al.
Published: (2025)
by: Chamma, Ahmad, et al.
Published: (2025)
Dense Backpropagation Improves Training for Sparse Mixture-of-Experts
by: Panda, Ashwinee, et al.
Published: (2025)
by: Panda, Ashwinee, et al.
Published: (2025)
MoNTA: Accelerating Mixture-of-Experts Training with Network-Traffc-Aware Parallel Optimization
by: Guo, Jingming, et al.
Published: (2024)
by: Guo, Jingming, et al.
Published: (2024)
Fast Training of Mixture-of-Experts for Time Series Forecasting via Expert Loss Integration
by: Mahtout, Btissame El, et al.
Published: (2026)
by: Mahtout, Btissame El, et al.
Published: (2026)
Variational Mixture of Graph Neural Experts for Alzheimer's Disease Biomarker Recognition in EEG Brain Networks
by: Ding, Jun-En, et al.
Published: (2025)
by: Ding, Jun-En, et al.
Published: (2025)
Efficient Training of Large-Scale AI Models Through Federated Mixture-of-Experts: A System-Level Approach
by: Chen, Xiaobing, et al.
Published: (2025)
by: Chen, Xiaobing, et al.
Published: (2025)
Neural Inhibition Improves Dynamic Routing and Mixture of Experts
by: Zou, Will Y., et al.
Published: (2025)
by: Zou, Will Y., et al.
Published: (2025)
Robust Experts: the Effect of Adversarial Training on CNNs with Sparse Mixture-of-Experts Layers
by: Pavlitska, Svetlana, et al.
Published: (2025)
by: Pavlitska, Svetlana, et al.
Published: (2025)
LPT++: Efficient Training on Mixture of Long-tailed Experts
by: Dong, Bowen, et al.
Published: (2024)
by: Dong, Bowen, et al.
Published: (2024)
Lazarus: Resilient and Elastic Training of Mixture-of-Experts Models
by: Wu, Yongji, et al.
Published: (2024)
by: Wu, Yongji, et al.
Published: (2024)
FSMoE: A Flexible and Scalable Training System for Sparse Mixture-of-Experts Models
by: Pan, Xinglin, et al.
Published: (2025)
by: Pan, Xinglin, et al.
Published: (2025)
DriftMoE: A Mixture of Experts Approach to Handle Concept Drifts
by: Aspis, Miguel, et al.
Published: (2025)
by: Aspis, Miguel, et al.
Published: (2025)
FairlyUncertain: A Comprehensive Benchmark of Uncertainty in Algorithmic Fairness
by: Rosenblatt, Lucas, et al.
Published: (2024)
by: Rosenblatt, Lucas, et al.
Published: (2024)
Learning More Generalized Experts by Merging Experts in Mixture-of-Experts
by: Park, Sejik
Published: (2024)
by: Park, Sejik
Published: (2024)
Mixture of Scope Experts at Test: Generalizing Deeper Graph Neural Networks with Shallow Variants
by: Deng, Gangda, et al.
Published: (2024)
by: Deng, Gangda, et al.
Published: (2024)
Discussion: Effective and Interpretable Outcome Prediction by Training Sparse Mixtures of Linear Experts
by: Folino, Francesco, et al.
Published: (2024)
by: Folino, Francesco, et al.
Published: (2024)
CompeteSMoE -- Effective Training of Sparse Mixture of Experts via Competition
by: Pham, Quang, et al.
Published: (2024)
by: Pham, Quang, et al.
Published: (2024)
M3Net: A Multi-Metric Mixture of Experts Network Digital Twin with Graph Neural Networks
by: Guda, Blessed, et al.
Published: (2025)
by: Guda, Blessed, et al.
Published: (2025)
Similarity-Aware Mixture-of-Experts for Data-Efficient Continual Learning
by: Mclaughlin, Connor, et al.
Published: (2026)
by: Mclaughlin, Connor, et al.
Published: (2026)
DynaMoE: Dynamic Token-Level Expert Activation with Layer-Wise Adaptive Capacity for Mixture-of-Experts Neural Networks
by: Gülmez, Gökdeniz
Published: (2026)
by: Gülmez, Gökdeniz
Published: (2026)
Mixture of Experts (MoE): A Big Data Perspective
by: Gan, Wensheng, et al.
Published: (2025)
by: Gan, Wensheng, et al.
Published: (2025)
Mixture of A Million Experts
by: He, Xu Owen
Published: (2024)
by: He, Xu Owen
Published: (2024)
Mixture-of-Experts Operator Transformer for Large-Scale PDE Pre-Training
by: Wang, Hong, et al.
Published: (2025)
by: Wang, Hong, et al.
Published: (2025)
Knowledge Localization in Mixture-of-Experts LLMs Using Cross-Lingual Inconsistency
by: Bandarkar, Lucas, et al.
Published: (2026)
by: Bandarkar, Lucas, et al.
Published: (2026)
Efficient Training of Diffusion Mixture-of-Experts Models: A Practical Recipe
by: Liu, Yahui, et al.
Published: (2025)
by: Liu, Yahui, et al.
Published: (2025)
Enhancing the "Immunity" of Mixture-of-Experts Networks for Adversarial Defense
by: Han, Qiao, et al.
Published: (2024)
by: Han, Qiao, et al.
Published: (2024)
Imitation Learning from Observations: An Autoregressive Mixture of Experts Approach
by: Wang, Renzi, et al.
Published: (2024)
by: Wang, Renzi, et al.
Published: (2024)
Path-Constrained Mixture-of-Experts
by: Gu, Zijin, et al.
Published: (2026)
by: Gu, Zijin, et al.
Published: (2026)
$μ$-Parametrization for Mixture of Experts
by: Małaśnicki, Jan, et al.
Published: (2025)
by: Małaśnicki, Jan, et al.
Published: (2025)
Dense Training, Sparse Inference: Rethinking Training of Mixture-of-Experts Language Models
by: Pan, Bowen, et al.
Published: (2024)
by: Pan, Bowen, et al.
Published: (2024)
Expert Merging in Sparse Mixture of Experts with Nash Bargaining
by: Nguyen, Dung V., et al.
Published: (2025)
by: Nguyen, Dung V., et al.
Published: (2025)
Similar Items
-
Node-wise Filtering in Graph Neural Networks: A Mixture of Experts Approach
by: Han, Haoyu, et al.
Published: (2024) -
Optimizing Pre-Training Data Mixtures with Mixtures of Data Expert Models
by: Belenki, Lior, et al.
Published: (2025) -
$ϕ$-Balancing for Mixture-of-Experts Training
by: Chen, Lizhang, et al.
Published: (2026) -
Convolutional Neural Networks and Mixture of Experts for Intrusion Detection in 5G Networks and beyond
by: Ilias, Loukas, et al.
Published: (2024) -
Train Separately, Merge Together: Modular Post-Training with Mixture-of-Experts
by: Morrison, Jacob, et al.
Published: (2026)