Saved in:
| Main Authors: | Do, Giang, Pham, Kha, Le, Hung, Tran, Truyen |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2411.19402 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Eigenvectors of Experts are Training-free Non-collapsing Routers
by: Do, Giang, et al.
Published: (2026)
by: Do, Giang, et al.
Published: (2026)
Rethinking Sparse Mixture of Experts from a Unified Perspective
by: Do, Giang, et al.
Published: (2025)
by: Do, Giang, et al.
Published: (2025)
S2MoE: Robust Sparse Mixture of Experts via Stochastic Learning
by: Do, Giang, et al.
Published: (2025)
by: Do, Giang, et al.
Published: (2025)
Do Domain-specific Experts exist in MoE-based LLMs?
by: Do, Giang, et al.
Published: (2026)
by: Do, Giang, et al.
Published: (2026)
MP-PINN: A Multi-Phase Physics-Informed Neural Network for Epidemic Forecasting
by: Nguyen, Thang, et al.
Published: (2024)
by: Nguyen, Thang, et al.
Published: (2024)
SimSMoE: Solving Representational Collapse via Similarity Measure
by: Do, Giang, et al.
Published: (2024)
by: Do, Giang, et al.
Published: (2024)
Enhancing Length Extrapolation in Sequential Models with Pointer-Augmented Neural Memory
by: Le, Hung, et al.
Published: (2024)
by: Le, Hung, et al.
Published: (2024)
Learning Structural Causal Models from Ordering: Identifiable Flow Models
by: Le, Minh Khoa, et al.
Published: (2024)
by: Le, Minh Khoa, et al.
Published: (2024)
CompeteSMoE -- Effective Training of Sparse Mixture of Experts via Competition
by: Pham, Quang, et al.
Published: (2024)
by: Pham, Quang, et al.
Published: (2024)
FAIREDU: A Multiple Regression-Based Method for Enhancing Fairness in Machine Learning Models for Educational Applications
by: Pham, Nga, et al.
Published: (2024)
by: Pham, Nga, et al.
Published: (2024)
Selective Sinkhorn Routing for Improved Sparse Mixture of Experts
by: Nguyen, Duc Anh, et al.
Published: (2025)
by: Nguyen, Duc Anh, et al.
Published: (2025)
Revisiting the Dataset Bias Problem from a Statistical Perspective
by: Do, Kien, et al.
Published: (2024)
by: Do, Kien, et al.
Published: (2024)
Expert Merging in Sparse Mixture of Experts with Nash Bargaining
by: Nguyen, Dung V., et al.
Published: (2025)
by: Nguyen, Dung V., et al.
Published: (2025)
Improving Routing in Sparse Mixture of Experts with Graph of Tokens
by: Nguyen, Tam, et al.
Published: (2025)
by: Nguyen, Tam, et al.
Published: (2025)
Guardrails in Logit Space: Safety Token Regularization for LLM Alignment
by: Bach, Thong, et al.
Published: (2026)
by: Bach, Thong, et al.
Published: (2026)
One-Prompt Strikes Back: Sparse Mixture of Experts for Prompt-based Continual Learning
by: Le, Minh, et al.
Published: (2025)
by: Le, Minh, et al.
Published: (2025)
Finding the Trigger: Causal Abductive Reasoning on Video Events
by: Le, Thao Minh, et al.
Published: (2025)
by: Le, Thao Minh, et al.
Published: (2025)
Modeling Expert Interactions in Sparse Mixture of Experts via Graph Structures
by: Nguyen-Nhat, Minh-Khoi, et al.
Published: (2025)
by: Nguyen-Nhat, Minh-Khoi, et al.
Published: (2025)
Progressive Multi-granular Alignments for Grounded Reasoning in Large Vision-Language Models
by: Le, Quang-Hung, et al.
Published: (2024)
by: Le, Quang-Hung, et al.
Published: (2024)
UB-SMoE: Universally Balanced Sparse Mixture-of-Experts for Resource-adaptive Federated Fine-tuning of Foundation Models
by: Tran, Van-Tuan, et al.
Published: (2026)
by: Tran, Van-Tuan, et al.
Published: (2026)
Rethinking Deep Alignment Through The Lens Of Incomplete Learning
by: Bach, Thong, et al.
Published: (2025)
by: Bach, Thong, et al.
Published: (2025)
Continual Safety Alignment via Gradient-Based Sample Selection
by: Bach, Thong, et al.
Published: (2026)
by: Bach, Thong, et al.
Published: (2026)
Robust SDE Parameter Estimation Under Missing Time Information Setting
by: Van Tran, Long, et al.
Published: (2026)
by: Van Tran, Long, et al.
Published: (2026)
Improving Minimax Estimation Rates for Contaminated Mixture of Multinomial Logistic Experts via Expert Heterogeneity
by: Yan, Fanqi, et al.
Published: (2026)
by: Yan, Fanqi, et al.
Published: (2026)
Mixture of Experts Meets Prompt-Based Continual Learning
by: Le, Minh, et al.
Published: (2024)
by: Le, Minh, et al.
Published: (2024)
Generalized Sobolev Transport for Probability Measures on a Graph
by: Le, Tam, et al.
Published: (2024)
by: Le, Tam, et al.
Published: (2024)
Optimal Transport for Measures with Noisy Tree Metric
by: Le, Tam, et al.
Published: (2023)
by: Le, Tam, et al.
Published: (2023)
Policy Learning for Off-Dynamics RL with Deficient Support
by: Van, Linh Le Pham, et al.
Published: (2024)
by: Van, Linh Le Pham, et al.
Published: (2024)
Curvature-Aware Safety Restoration In LLMs Fine-Tuning
by: Bach, Thong, et al.
Published: (2025)
by: Bach, Thong, et al.
Published: (2025)
LIBMoE: A Library for comprehensive benchmarking Mixture of Experts in Large Language Models
by: Nguyen, Nam V., et al.
Published: (2024)
by: Nguyen, Nam V., et al.
Published: (2024)
Automatic Expert Discovery in LLM Upcycling via Sparse Interpolated Mixture-of-Experts
by: Chen, Shengzhuang, et al.
Published: (2025)
by: Chen, Shengzhuang, et al.
Published: (2025)
Diversifying the Expert Knowledge for Task-Agnostic Pruning in Sparse Mixture-of-Experts
by: Zhang, Zeliang, et al.
Published: (2024)
by: Zhang, Zeliang, et al.
Published: (2024)
Exploring Expert Specialization through Unsupervised Training in Sparse Mixture of Experts
by: Nikolic, Strahinja, et al.
Published: (2025)
by: Nikolic, Strahinja, et al.
Published: (2025)
Routers Learn the Geometry of Their Experts: Geometric Coupling in Sparse Mixture-of-Experts
by: Ahrac, Sagi, et al.
Published: (2026)
by: Ahrac, Sagi, et al.
Published: (2026)
Hybrid Cross-domain Robust Reinforcement Learning
by: Van, Linh Le Pham, et al.
Published: (2025)
by: Van, Linh Le Pham, et al.
Published: (2025)
MoC-System: Efficient Fault Tolerance for Sparse Mixture-of-Experts Model Training
by: Cai, Weilin, et al.
Published: (2024)
by: Cai, Weilin, et al.
Published: (2024)
From Sparse to Soft Mixtures of Experts
by: Puigcerver, Joan, et al.
Published: (2023)
by: Puigcerver, Joan, et al.
Published: (2023)
Generalization Bound for Diffusion Models using Random Features
by: Saha, Esha, et al.
Published: (2023)
by: Saha, Esha, et al.
Published: (2023)
A Provably Effective Method for Pruning Experts in Fine-tuned Sparse Mixture-of-Experts
by: Chowdhury, Mohammed Nowaz Rabbani, et al.
Published: (2024)
by: Chowdhury, Mohammed Nowaz Rabbani, et al.
Published: (2024)
Score-based Integrated Gradient for Root Cause Explanations of Outliers
by: Nguyen, Phuoc, et al.
Published: (2026)
by: Nguyen, Phuoc, et al.
Published: (2026)
Similar Items
-
Eigenvectors of Experts are Training-free Non-collapsing Routers
by: Do, Giang, et al.
Published: (2026) -
Rethinking Sparse Mixture of Experts from a Unified Perspective
by: Do, Giang, et al.
Published: (2025) -
S2MoE: Robust Sparse Mixture of Experts via Stochastic Learning
by: Do, Giang, et al.
Published: (2025) -
Do Domain-specific Experts exist in MoE-based LLMs?
by: Do, Giang, et al.
Published: (2026) -
MP-PINN: A Multi-Phase Physics-Informed Neural Network for Epidemic Forecasting
by: Nguyen, Thang, et al.
Published: (2024)