:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Han, Jiayi, Du, Liang, Chen, Yinda, Kang, Xiao, Ding, Weiyang, Han, Donghong
Format:	Preprint
Published:	2025
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2509.14900
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

SLIM: Let LLM Learn More and Forget Less with Soft LoRA and Identity Mixture
by: Han, Jiayi, et al.
Published: (2024)

CP-Router: An Uncertainty-Aware Router Between LLM and LRM
by: Su, Jiayuan, et al.
Published: (2025)

FURINA: A Fully Customizable Role-Playing Benchmark via Scalable Multi-Agent Collaboration Pipeline
by: Wu, Haotian, et al.
Published: (2025)

Rethinking Precision of Pseudo Label: Test-Time Adaptation via Complementary Learning
by: Han, Jiayi, et al.
Published: (2023)

DeepSieve: Information Sieving via LLM-as-a-Knowledge-Router
by: Guo, Minghao, et al.
Published: (2025)

DALR: Dual-level Alignment Learning for Multimodal Sentence Representation Learning
by: He, Kang, et al.
Published: (2025)

Enhance-then-Balance Modality Collaboration for Robust Multimodal Sentiment Analysis
by: He, Kang, et al.
Published: (2026)

PRCCF: A Persona-guided Retrieval and Causal-aware Cognitive Filtering Framework for Emotional Support Conversation
by: Luo, Yanxin, et al.
Published: (2026)

Mixture of Routers
by: Zhang, Jia-Chen, et al.
Published: (2025)

Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning
by: Zhang, Haozhen, et al.
Published: (2025)

WebRouter: Query-specific Router via Variational Information Bottleneck for Cost-sensitive Web Agent
by: Li, Tao, et al.
Published: (2025)

Router Upcycling: Leveraging Mixture-of-Routers in Mixture-of-Experts Upcycling
by: Ran, Junfeng, et al.
Published: (2025)

TabDLM: Free-Form Tabular Data Generation via Joint Numerical-Language Diffusion
by: Cai, Donghong, et al.
Published: (2026)

SimulMEGA: MoE Routers are Advanced Policy Makers for Simultaneous Speech Translation
by: Le, Chenyang, et al.
Published: (2025)

Improving embedding with contrastive fine-tuning on small datasets with expert-augmented scores
by: Lu, Jun, et al.
Published: (2024)

Embedding Inversion via Conditional Masked Diffusion Language Models
by: Xiao, Han
Published: (2026)

Efficient Model-agnostic Alignment via Bayesian Persuasion
by: Bai, Fengshuo, et al.
Published: (2024)

Find Your Optimal Teacher: Personalized Data Synthesis via Router-Guided Multi-Teacher Distillation
by: Zhang, Hengyuan, et al.
Published: (2025)

OrcaRouter: A Production-Oriented LLM Router with Hybrid Offline-Online Learning
by: Bao, Zhenghua, et al.
Published: (2026)

Arch-Router: Aligning LLM Routing with Human Preferences
by: Tran, Co, et al.
Published: (2025)

AgentRouter: A Knowledge-Graph-Guided LLM Router for Collaborative Multi-Agent Question Answering
by: Zhang, Zheyuan, et al.
Published: (2025)

RouterDC: Query-Based Router by Dual Contrastive Learning for Assembling Large Language Models
by: Chen, Shuhao, et al.
Published: (2024)

Layerwise Recurrent Router for Mixture-of-Experts
by: Qiu, Zihan, et al.
Published: (2024)

To Aggregate or Not to Aggregate. That is the Question: A Case Study on Annotation Subjectivity in Span Prediction
by: Kurniawan, Kemal, et al.
Published: (2024)

Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss
by: Lv, Ang, et al.
Published: (2025)

Toward Super Agent System with Hybrid AI Routers
by: Yao, Yuhang, et al.
Published: (2025)

BIMCV-R: A Landmark Dataset for 3D CT Text-Image Retrieval
by: Chen, Yinda, et al.
Published: (2024)

Zero-Shot Conversational Stance Detection: Dataset and Approaches
by: Ding, Yuzhe, et al.
Published: (2025)

POET-X: Memory-efficient LLM Training by Scaling Orthogonal Transformation
by: Qiu, Zeju, et al.
Published: (2026)

S2D2: Fast Decoding for Diffusion LLMs via Training-Free Self-Speculation
by: Han, Ligong, et al.
Published: (2026)

GRIP: In-Parameter Graph Reasoning through Fine-Tuning Large Language Models
by: Feng, Jiarui, et al.
Published: (2025)

Glider: Global and Local Instruction-Driven Expert Router
by: Li, Pingzhi, et al.
Published: (2024)

Enhancing Confidence Expression in Large Language Models Through Learning from Past Experience
by: Han, Haixia, et al.
Published: (2024)

Part-Of-Speech Sensitivity of Routers in Mixture of Experts Models
by: Antoine, Elie, et al.
Published: (2024)

OpenFedLLM: Training Large Language Models on Decentralized Private Data via Federated Learning
by: Ye, Rui, et al.
Published: (2024)

ROMER: Expert Replacement and Router Calibration for Robust MoE LLMs on Analog Compute-in-Memory Systems
by: Zhou, Wenyong, et al.
Published: (2026)

RouterEval: A Comprehensive Benchmark for Routing LLMs to Explore Model-level Scaling Up in LLMs
by: Huang, Zhongzhan, et al.
Published: (2025)

When to Reason: Semantic Router for vLLM
by: Wang, Chen, et al.
Published: (2025)

LLM Router: Rethinking Routing with Prefill Activations
by: Varshney, Tanay, et al.
Published: (2026)

ThinkRouter: Efficient Reasoning via Routing Thinking between Latent and Discrete Spaces
by: Xu, Xin, et al.
Published: (2026)