Saved in:
| Main Authors: | Tang, Tianwen, Zhu, Tong, Liu, Haodong, Bai, Yin, Cheng, Jia, Chen, Wenliang |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2404.08559 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
MoPE: Mixture of Prompt Experts for Parameter-Efficient and Scalable Multimodal Fusion
by: Jiang, Ruixiang, et al.
Published: (2024)
by: Jiang, Ruixiang, et al.
Published: (2024)
MoPE: A Mixture of Password Experts for Improving Password Guessing
by: Duan, Mingjian, et al.
Published: (2025)
by: Duan, Mingjian, et al.
Published: (2025)
Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Experts
by: Zhu, Tong, et al.
Published: (2024)
by: Zhu, Tong, et al.
Published: (2024)
DiffusionDialog: A Diffusion Model for Diverse Dialog Generation with Latent Space
by: Xiang, Jianxiang, et al.
Published: (2024)
by: Xiang, Jianxiang, et al.
Published: (2024)
GEM: Graph-Enhanced Mixture-of-Experts with ReAct Agents for Dialogue State Tracking
by: Zhu, Ziqi, et al.
Published: (2026)
by: Zhu, Ziqi, et al.
Published: (2026)
Zero-Shot Cross-Domain Dialogue State Tracking via Dual Low-Rank Adaptation
by: Luo, Xiang, et al.
Published: (2024)
by: Luo, Xiang, et al.
Published: (2024)
Understanding and Leveraging the Expert Specialization of Context Faithfulness in Mixture-of-Experts LLMs
by: Bai, Jun, et al.
Published: (2025)
by: Bai, Jun, et al.
Published: (2025)
Controllable and Diverse Data Augmentation with Large Language Model for Low-Resource Open-Domain Dialogue Generation
by: Liu, Zhenhua, et al.
Published: (2024)
by: Liu, Zhenhua, et al.
Published: (2024)
Schema Augmentation for Zero-Shot Domain Adaptation in Dialogue State Tracking
by: Richardson, Christopher, et al.
Published: (2024)
by: Richardson, Christopher, et al.
Published: (2024)
UNO-DST: Leveraging Unlabelled Data in Zero-Shot Dialogue State Tracking
by: Li, Chuang, et al.
Published: (2023)
by: Li, Chuang, et al.
Published: (2023)
Diverse and Effective Synthetic Data Generation for Adaptable Zero-Shot Dialogue State Tracking
by: Finch, James D., et al.
Published: (2024)
by: Finch, James D., et al.
Published: (2024)
Linear-MoE: Linear Sequence Modeling Meets Mixture-of-Experts
by: Sun, Weigao, et al.
Published: (2025)
by: Sun, Weigao, et al.
Published: (2025)
Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models
by: Wei, Tianwen, et al.
Published: (2024)
by: Wei, Tianwen, et al.
Published: (2024)
LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training
by: Zhu, Tong, et al.
Published: (2024)
by: Zhu, Tong, et al.
Published: (2024)
ReacTOD: Bounded Neuro-Symbolic Agentic NLU for Zero-Shot Dialogue State Tracking
by: Lin, Yanjun, et al.
Published: (2026)
by: Lin, Yanjun, et al.
Published: (2026)
MoBiLE: Efficient Mixture-of-Experts Inference on Consumer GPU with Mixture of Big Little Experts
by: Zhao, Yushu, et al.
Published: (2025)
by: Zhao, Yushu, et al.
Published: (2025)
Pangu Pro MoE: Mixture of Grouped Experts for Efficient Sparsity
by: Tang, Yehui, et al.
Published: (2025)
by: Tang, Yehui, et al.
Published: (2025)
LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training
by: Qu, Xiaoye, et al.
Published: (2024)
by: Qu, Xiaoye, et al.
Published: (2024)
MoPE-CLIP: Structured Pruning for Efficient Vision-Language Models with Module-wise Pruning Error Metric
by: Lin, Haokun, et al.
Published: (2024)
by: Lin, Haokun, et al.
Published: (2024)
Enhancing Dialogue State Tracking Models through LLM-backed User-Agents Simulation
by: Niu, Cheng, et al.
Published: (2024)
by: Niu, Cheng, et al.
Published: (2024)
DiaMoE-TTS: A Unified IPA-Based Dialect TTS Framework with Mixture-of-Experts and Parameter-Efficient Zero-Shot Adaptation
by: Chen, Ziqi, et al.
Published: (2025)
by: Chen, Ziqi, et al.
Published: (2025)
Mixture of insighTful Experts (MoTE): The Synergy of Thought Chains and Expert Mixtures in Self-Alignment
by: Liu, Zhili, et al.
Published: (2024)
by: Liu, Zhili, et al.
Published: (2024)
EvoMoE: Expert Evolution in Mixture of Experts for Multimodal Large Language Models
by: Jing, Linglin, et al.
Published: (2025)
by: Jing, Linglin, et al.
Published: (2025)
MoG: Mixture of Experts for Graph-based Retrieval-Augmented Generation
by: Yuan, Zheng, et al.
Published: (2026)
by: Yuan, Zheng, et al.
Published: (2026)
MoDEM: Mixture of Domain Expert Models
by: Simonds, Toby, et al.
Published: (2024)
by: Simonds, Toby, et al.
Published: (2024)
RotMoLE: Enhancing Mixture of Low-Rank Experts through Rotational Gating Mechanism
by: Sun, Mengyang, et al.
Published: (2026)
by: Sun, Mengyang, et al.
Published: (2026)
MoMoE: Mixture of Moderation Experts Framework for AI-Assisted Online Governance
by: Goyal, Agam, et al.
Published: (2025)
by: Goyal, Agam, et al.
Published: (2025)
$\texttt{MoE-RBench}$: Towards Building Reliable Language Models with Sparse Mixture-of-Experts
by: Chen, Guanjie, et al.
Published: (2024)
by: Chen, Guanjie, et al.
Published: (2024)
MobileMoE: Scaling On-Device Mixture of Experts
by: Chen, Yanbei, et al.
Published: (2026)
by: Chen, Yanbei, et al.
Published: (2026)
$\infty$-MoE: Generalizing Mixture of Experts to Infinite Experts
by: Takashiro, Shota, et al.
Published: (2026)
by: Takashiro, Shota, et al.
Published: (2026)
CorrectionLM: Self-Corrections with SLM for Dialogue State Tracking
by: Lee, Chia-Hsuan, et al.
Published: (2024)
by: Lee, Chia-Hsuan, et al.
Published: (2024)
Chain-of-Experts: Unlocking the Communication Power of Mixture-of-Experts Models
by: Wang, Zihan, et al.
Published: (2025)
by: Wang, Zihan, et al.
Published: (2025)
MH-MoE: Multi-Head Mixture-of-Experts
by: Huang, Shaohan, et al.
Published: (2024)
by: Huang, Shaohan, et al.
Published: (2024)
Zero-Shot Cross-Lingual Transfer using Prefix-Based Adaptation
by: A, Snegha, et al.
Published: (2025)
by: A, Snegha, et al.
Published: (2025)
MaskMoE: Boosting Token-Level Learning via Routing Mask in Mixture-of-Experts
by: Su, Zhenpeng, et al.
Published: (2024)
by: Su, Zhenpeng, et al.
Published: (2024)
MoFE: Mixture of Frozen Experts Architecture
by: Seo, Jean, et al.
Published: (2025)
by: Seo, Jean, et al.
Published: (2025)
QuantMoE-Bench: Examining Post-Training Quantization for Mixture-of-Experts
by: Li, Pingzhi, et al.
Published: (2024)
by: Li, Pingzhi, et al.
Published: (2024)
DynMoLE: Boosting Mixture of LoRA Experts Fine-Tuning with a Hybrid Routing Mechanism
by: Li, Dengchun, et al.
Published: (2025)
by: Li, Dengchun, et al.
Published: (2025)
Router Upcycling: Leveraging Mixture-of-Routers in Mixture-of-Experts Upcycling
by: Ran, Junfeng, et al.
Published: (2025)
by: Ran, Junfeng, et al.
Published: (2025)
OrchestraLLM: Efficient Orchestration of Language Models for Dialogue State Tracking
by: Lee, Chia-Hsuan, et al.
Published: (2023)
by: Lee, Chia-Hsuan, et al.
Published: (2023)
Similar Items
-
MoPE: Mixture of Prompt Experts for Parameter-Efficient and Scalable Multimodal Fusion
by: Jiang, Ruixiang, et al.
Published: (2024) -
MoPE: A Mixture of Password Experts for Improving Password Guessing
by: Duan, Mingjian, et al.
Published: (2025) -
Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Experts
by: Zhu, Tong, et al.
Published: (2024) -
DiffusionDialog: A Diffusion Model for Diverse Dialog Generation with Latent Space
by: Xiang, Jianxiang, et al.
Published: (2024) -
GEM: Graph-Enhanced Mixture-of-Experts with ReAct Agents for Dialogue State Tracking
by: Zhu, Ziqi, et al.
Published: (2026)