Saved in:
| Main Authors: | Han, Jiayi, Du, Liang, Chen, Yinda, Kang, Xiao, Ding, Weiyang, Han, Donghong |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.14900 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
SLIM: Let LLM Learn More and Forget Less with Soft LoRA and Identity Mixture
by: Han, Jiayi, et al.
Published: (2024)
by: Han, Jiayi, et al.
Published: (2024)
CP-Router: An Uncertainty-Aware Router Between LLM and LRM
by: Su, Jiayuan, et al.
Published: (2025)
by: Su, Jiayuan, et al.
Published: (2025)
FURINA: A Fully Customizable Role-Playing Benchmark via Scalable Multi-Agent Collaboration Pipeline
by: Wu, Haotian, et al.
Published: (2025)
by: Wu, Haotian, et al.
Published: (2025)
Rethinking Precision of Pseudo Label: Test-Time Adaptation via Complementary Learning
by: Han, Jiayi, et al.
Published: (2023)
by: Han, Jiayi, et al.
Published: (2023)
DeepSieve: Information Sieving via LLM-as-a-Knowledge-Router
by: Guo, Minghao, et al.
Published: (2025)
by: Guo, Minghao, et al.
Published: (2025)
DALR: Dual-level Alignment Learning for Multimodal Sentence Representation Learning
by: He, Kang, et al.
Published: (2025)
by: He, Kang, et al.
Published: (2025)
Enhance-then-Balance Modality Collaboration for Robust Multimodal Sentiment Analysis
by: He, Kang, et al.
Published: (2026)
by: He, Kang, et al.
Published: (2026)
PRCCF: A Persona-guided Retrieval and Causal-aware Cognitive Filtering Framework for Emotional Support Conversation
by: Luo, Yanxin, et al.
Published: (2026)
by: Luo, Yanxin, et al.
Published: (2026)
Mixture of Routers
by: Zhang, Jia-Chen, et al.
Published: (2025)
by: Zhang, Jia-Chen, et al.
Published: (2025)
Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning
by: Zhang, Haozhen, et al.
Published: (2025)
by: Zhang, Haozhen, et al.
Published: (2025)
WebRouter: Query-specific Router via Variational Information Bottleneck for Cost-sensitive Web Agent
by: Li, Tao, et al.
Published: (2025)
by: Li, Tao, et al.
Published: (2025)
Router Upcycling: Leveraging Mixture-of-Routers in Mixture-of-Experts Upcycling
by: Ran, Junfeng, et al.
Published: (2025)
by: Ran, Junfeng, et al.
Published: (2025)
TabDLM: Free-Form Tabular Data Generation via Joint Numerical-Language Diffusion
by: Cai, Donghong, et al.
Published: (2026)
by: Cai, Donghong, et al.
Published: (2026)
SimulMEGA: MoE Routers are Advanced Policy Makers for Simultaneous Speech Translation
by: Le, Chenyang, et al.
Published: (2025)
by: Le, Chenyang, et al.
Published: (2025)
Improving embedding with contrastive fine-tuning on small datasets with expert-augmented scores
by: Lu, Jun, et al.
Published: (2024)
by: Lu, Jun, et al.
Published: (2024)
Embedding Inversion via Conditional Masked Diffusion Language Models
by: Xiao, Han
Published: (2026)
by: Xiao, Han
Published: (2026)
Efficient Model-agnostic Alignment via Bayesian Persuasion
by: Bai, Fengshuo, et al.
Published: (2024)
by: Bai, Fengshuo, et al.
Published: (2024)
Find Your Optimal Teacher: Personalized Data Synthesis via Router-Guided Multi-Teacher Distillation
by: Zhang, Hengyuan, et al.
Published: (2025)
by: Zhang, Hengyuan, et al.
Published: (2025)
OrcaRouter: A Production-Oriented LLM Router with Hybrid Offline-Online Learning
by: Bao, Zhenghua, et al.
Published: (2026)
by: Bao, Zhenghua, et al.
Published: (2026)
Arch-Router: Aligning LLM Routing with Human Preferences
by: Tran, Co, et al.
Published: (2025)
by: Tran, Co, et al.
Published: (2025)
AgentRouter: A Knowledge-Graph-Guided LLM Router for Collaborative Multi-Agent Question Answering
by: Zhang, Zheyuan, et al.
Published: (2025)
by: Zhang, Zheyuan, et al.
Published: (2025)
RouterDC: Query-Based Router by Dual Contrastive Learning for Assembling Large Language Models
by: Chen, Shuhao, et al.
Published: (2024)
by: Chen, Shuhao, et al.
Published: (2024)
Layerwise Recurrent Router for Mixture-of-Experts
by: Qiu, Zihan, et al.
Published: (2024)
by: Qiu, Zihan, et al.
Published: (2024)
To Aggregate or Not to Aggregate. That is the Question: A Case Study on Annotation Subjectivity in Span Prediction
by: Kurniawan, Kemal, et al.
Published: (2024)
by: Kurniawan, Kemal, et al.
Published: (2024)
Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss
by: Lv, Ang, et al.
Published: (2025)
by: Lv, Ang, et al.
Published: (2025)
Toward Super Agent System with Hybrid AI Routers
by: Yao, Yuhang, et al.
Published: (2025)
by: Yao, Yuhang, et al.
Published: (2025)
BIMCV-R: A Landmark Dataset for 3D CT Text-Image Retrieval
by: Chen, Yinda, et al.
Published: (2024)
by: Chen, Yinda, et al.
Published: (2024)
Zero-Shot Conversational Stance Detection: Dataset and Approaches
by: Ding, Yuzhe, et al.
Published: (2025)
by: Ding, Yuzhe, et al.
Published: (2025)
POET-X: Memory-efficient LLM Training by Scaling Orthogonal Transformation
by: Qiu, Zeju, et al.
Published: (2026)
by: Qiu, Zeju, et al.
Published: (2026)
S2D2: Fast Decoding for Diffusion LLMs via Training-Free Self-Speculation
by: Han, Ligong, et al.
Published: (2026)
by: Han, Ligong, et al.
Published: (2026)
GRIP: In-Parameter Graph Reasoning through Fine-Tuning Large Language Models
by: Feng, Jiarui, et al.
Published: (2025)
by: Feng, Jiarui, et al.
Published: (2025)
Glider: Global and Local Instruction-Driven Expert Router
by: Li, Pingzhi, et al.
Published: (2024)
by: Li, Pingzhi, et al.
Published: (2024)
Enhancing Confidence Expression in Large Language Models Through Learning from Past Experience
by: Han, Haixia, et al.
Published: (2024)
by: Han, Haixia, et al.
Published: (2024)
Part-Of-Speech Sensitivity of Routers in Mixture of Experts Models
by: Antoine, Elie, et al.
Published: (2024)
by: Antoine, Elie, et al.
Published: (2024)
OpenFedLLM: Training Large Language Models on Decentralized Private Data via Federated Learning
by: Ye, Rui, et al.
Published: (2024)
by: Ye, Rui, et al.
Published: (2024)
ROMER: Expert Replacement and Router Calibration for Robust MoE LLMs on Analog Compute-in-Memory Systems
by: Zhou, Wenyong, et al.
Published: (2026)
by: Zhou, Wenyong, et al.
Published: (2026)
RouterEval: A Comprehensive Benchmark for Routing LLMs to Explore Model-level Scaling Up in LLMs
by: Huang, Zhongzhan, et al.
Published: (2025)
by: Huang, Zhongzhan, et al.
Published: (2025)
When to Reason: Semantic Router for vLLM
by: Wang, Chen, et al.
Published: (2025)
by: Wang, Chen, et al.
Published: (2025)
LLM Router: Rethinking Routing with Prefill Activations
by: Varshney, Tanay, et al.
Published: (2026)
by: Varshney, Tanay, et al.
Published: (2026)
ThinkRouter: Efficient Reasoning via Routing Thinking between Latent and Discrete Spaces
by: Xu, Xin, et al.
Published: (2026)
by: Xu, Xin, et al.
Published: (2026)
Similar Items
-
SLIM: Let LLM Learn More and Forget Less with Soft LoRA and Identity Mixture
by: Han, Jiayi, et al.
Published: (2024) -
CP-Router: An Uncertainty-Aware Router Between LLM and LRM
by: Su, Jiayuan, et al.
Published: (2025) -
FURINA: A Fully Customizable Role-Playing Benchmark via Scalable Multi-Agent Collaboration Pipeline
by: Wu, Haotian, et al.
Published: (2025) -
Rethinking Precision of Pseudo Label: Test-Time Adaptation via Complementary Learning
by: Han, Jiayi, et al.
Published: (2023) -
DeepSieve: Information Sieving via LLM-as-a-Knowledge-Router
by: Guo, Minghao, et al.
Published: (2025)