Saved in:
Bibliographic Details
Main Authors: Wang, Ye, Wu, Zixuan, Shen, Lifeng, Xie, Jiang, Wang, Xiaoling, Yu, Hong, Wang, Guoyin
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2603.15708
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866918391725424640
author Wang, Ye
Wu, Zixuan
Shen, Lifeng
Xie, Jiang
Wang, Xiaoling
Yu, Hong
Wang, Guoyin
author_facet Wang, Ye
Wu, Zixuan
Shen, Lifeng
Xie, Jiang
Wang, Xiaoling
Yu, Hong
Wang, Guoyin
contents Imbalanced data distribution remains a critical challenge in sequential learning, leading models to easily recognize frequent categories while failing to detect minority classes adequately. The Mixture-of-Experts model offers a scalable solution, yet its application is often hindered by parameter inefficiency, poor expert specialization, and difficulty in resolving prediction conflicts. To Master the Minority classes effectively, we propose the Uncertainty-based Multi-Expert fusion network (UME) framework. UME is designed with three core innovations: First, we employ Ensemble LoRA for parameter-efficient modeling, significantly reducing the trainable parameter count. Second, we introduce Sequential Specialization guided by Dempster-Shafer Theory (DST), which ensures effective specialization on the challenging-tailed classes. Finally, an Uncertainty-Guided Fusion mechanism uses DST's certainty measures to dynamically weigh expert opinions, resolving conflicts by prioritizing the most confident expert for reliable final predictions. Extensive experiments across four public hierarchical text classification datasets demonstrate that UME achieves state-of-the-art performance. We achieve a performance gain of up to 17.97\% over the best baseline on individual categories, while reducing trainable parameters by up to 10.32\%. The findings highlight that uncertainty-guided expert coordination is a principled strategy for addressing challenging-tailed sequence learning. Our code is available at https://github.com/CQUPTWZX/Multi-experts.
format Preprint
id arxiv_https___arxiv_org_abs_2603_15708
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Mastering the Minority: An Uncertainty-guided Multi-Expert Framework for Challenging-tailed Sequence Learning
Wang, Ye
Wu, Zixuan
Shen, Lifeng
Xie, Jiang
Wang, Xiaoling
Yu, Hong
Wang, Guoyin
Machine Learning
Artificial Intelligence
Imbalanced data distribution remains a critical challenge in sequential learning, leading models to easily recognize frequent categories while failing to detect minority classes adequately. The Mixture-of-Experts model offers a scalable solution, yet its application is often hindered by parameter inefficiency, poor expert specialization, and difficulty in resolving prediction conflicts. To Master the Minority classes effectively, we propose the Uncertainty-based Multi-Expert fusion network (UME) framework. UME is designed with three core innovations: First, we employ Ensemble LoRA for parameter-efficient modeling, significantly reducing the trainable parameter count. Second, we introduce Sequential Specialization guided by Dempster-Shafer Theory (DST), which ensures effective specialization on the challenging-tailed classes. Finally, an Uncertainty-Guided Fusion mechanism uses DST's certainty measures to dynamically weigh expert opinions, resolving conflicts by prioritizing the most confident expert for reliable final predictions. Extensive experiments across four public hierarchical text classification datasets demonstrate that UME achieves state-of-the-art performance. We achieve a performance gain of up to 17.97\% over the best baseline on individual categories, while reducing trainable parameters by up to 10.32\%. The findings highlight that uncertainty-guided expert coordination is a principled strategy for addressing challenging-tailed sequence learning. Our code is available at https://github.com/CQUPTWZX/Multi-experts.
title Mastering the Minority: An Uncertainty-guided Multi-Expert Framework for Challenging-tailed Sequence Learning
topic Machine Learning
Artificial Intelligence
url https://arxiv.org/abs/2603.15708