Saved in:
| Main Authors: | Zhao, Jiuzhou, Chen, Chunrong, Qiao, Chenqi, Zheng, Lebin, Han, Minqi, Zhang, Yanchi Liu Yongzhou Xu Xiaochuan Xu Min |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2601.04544 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
TensorOpera Router: A Multi-Model Router for Efficient LLM Inference
by: Stripelis, Dimitris, et al.
Published: (2024)
by: Stripelis, Dimitris, et al.
Published: (2024)
LightRouter: Towards Efficient LLM Collaboration with Minimal Overhead
by: Zhang, Yifan, et al.
Published: (2025)
by: Zhang, Yifan, et al.
Published: (2025)
Mixture of Routers
by: Zhang, Jia-Chen, et al.
Published: (2025)
by: Zhang, Jia-Chen, et al.
Published: (2025)
GraphRouter: A Graph-based Router for LLM Selections
by: Feng, Tao, et al.
Published: (2024)
by: Feng, Tao, et al.
Published: (2024)
Route-and-Reason: Scaling Large Language Model Reasoning with Reinforced Model Router
by: Shao, Chenyang, et al.
Published: (2025)
by: Shao, Chenyang, et al.
Published: (2025)
When to Reason: Semantic Router for vLLM
by: Wang, Chen, et al.
Published: (2025)
by: Wang, Chen, et al.
Published: (2025)
Towards Fair and Comprehensive Evaluation of Routers in Collaborative LLM Systems
by: Wu, Wanxing, et al.
Published: (2026)
by: Wu, Wanxing, et al.
Published: (2026)
ACE-Router: Generalizing History-Aware Routing from MCP Tools to the Agent Web
by: Yao, Zhiyuan, et al.
Published: (2026)
by: Yao, Zhiyuan, et al.
Published: (2026)
Toward Super Agent System with Hybrid AI Routers
by: Yao, Yuhang, et al.
Published: (2025)
by: Yao, Yuhang, et al.
Published: (2025)
GlimpRouter: Efficient Collaborative Inference by Glimpsing One Token of Thoughts
by: Zeng, Wenhao, et al.
Published: (2026)
by: Zeng, Wenhao, et al.
Published: (2026)
Lodestar: An Online-Learning LLM Inference Router
by: Lim, Gangmuk, et al.
Published: (2026)
by: Lim, Gangmuk, et al.
Published: (2026)
OrcaRouter: A Production-Oriented LLM Router with Hybrid Offline-Online Learning
by: Bao, Zhenghua, et al.
Published: (2026)
by: Bao, Zhenghua, et al.
Published: (2026)
When Routing Collapses: On the Degenerate Convergence of LLM Routers
by: Lai, Guannan, et al.
Published: (2026)
by: Lai, Guannan, et al.
Published: (2026)
IRT-Router: Effective and Interpretable Multi-LLM Routing via Item Response Theory
by: Song, Wei, et al.
Published: (2025)
by: Song, Wei, et al.
Published: (2025)
MPD$^2$-Router: Mask-aware Multi-expert Prior-regularized Dual-head Deferral Router in Glaucoma Screening and Diagnosis
by: Zhan, Wenxin
Published: (2026)
by: Zhan, Wenxin
Published: (2026)
SRSA: A Cost-Efficient Strategy-Router Search Agent for Real-world Human-Machine Interactions
by: Wang, Yaqi, et al.
Published: (2024)
by: Wang, Yaqi, et al.
Published: (2024)
RouterDC: Query-Based Router by Dual Contrastive Learning for Assembling Large Language Models
by: Chen, Shuhao, et al.
Published: (2024)
by: Chen, Shuhao, et al.
Published: (2024)
Mixture-of-Schedulers: An Adaptive Scheduling Agent as a Learned Router for Expert Policies
by: Wang, Xinbo, et al.
Published: (2025)
by: Wang, Xinbo, et al.
Published: (2025)
ThinkRouter: Efficient Reasoning via Routing Thinking between Latent and Discrete Spaces
by: Xu, Xin, et al.
Published: (2026)
by: Xu, Xin, et al.
Published: (2026)
SecureRouter: Encrypted Routing for Efficient Secure Inference
by: Zhang, Yukuan, et al.
Published: (2026)
by: Zhang, Yukuan, et al.
Published: (2026)
VerifiAgent: a Unified Verification Agent in Language Model Reasoning
by: Han, Jiuzhou, et al.
Published: (2025)
by: Han, Jiuzhou, et al.
Published: (2025)
Optimus-3: Dual-Router Aligned Mixture-of-Experts Agent with Dual-Granularity Reasoning-Aware Policy Optimization
by: Li, Zaijing, et al.
Published: (2025)
by: Li, Zaijing, et al.
Published: (2025)
ViTE: Virtual Graph Trajectory Expert Router for Pedestrian Trajectory Prediction
by: Li, Ruochen, et al.
Published: (2025)
by: Li, Ruochen, et al.
Published: (2025)
Yuan 2.0-M32: Mixture of Experts with Attention Router
by: Wu, Shaohua, et al.
Published: (2024)
by: Wu, Shaohua, et al.
Published: (2024)
RouterBench: A Benchmark for Multi-LLM Routing System
by: Hu, Qitian Jason, et al.
Published: (2024)
by: Hu, Qitian Jason, et al.
Published: (2024)
MemRouter: Memory-as-Embedding Routing for Long-Term Conversational Agents
by: Hu, Tianyu, et al.
Published: (2026)
by: Hu, Tianyu, et al.
Published: (2026)
Olympus: A Universal Task Router for Computer Vision Tasks
by: Lin, Yuanze, et al.
Published: (2024)
by: Lin, Yuanze, et al.
Published: (2024)
Universal Multi-Domain Translation via Diffusion Routers
by: Kieu, Duc, et al.
Published: (2025)
by: Kieu, Duc, et al.
Published: (2025)
RCR-Router: Efficient Role-Aware Context Routing for Multi-Agent LLM Systems with Structured Memory
by: Liu, Jun, et al.
Published: (2025)
by: Liu, Jun, et al.
Published: (2025)
Life-Cycle Routing Vulnerabilities of LLM Router
by: Lin, Qiqi, et al.
Published: (2025)
by: Lin, Qiqi, et al.
Published: (2025)
AudioRouter: Data Efficient Audio Understanding via RL based Dual Reasoning
by: Chen, Liyang, et al.
Published: (2026)
by: Chen, Liyang, et al.
Published: (2026)
Routers in Vision Mixture of Experts: An Empirical Study
by: Liu, Tianlin, et al.
Published: (2024)
by: Liu, Tianlin, et al.
Published: (2024)
RASER: Recoverability-Aware Selective Escalation Router for Multi-Hop Question Answering
by: Li, Yuyang, et al.
Published: (2026)
by: Li, Yuyang, et al.
Published: (2026)
ICL-Router: In-Context Learned Model Representations for LLM Routing
by: Wang, Chenxu, et al.
Published: (2025)
by: Wang, Chenxu, et al.
Published: (2025)
VideoRouter: Query-Adaptive Dual Routing for Efficient Long-Video Understanding
by: Lin, Kuanwei, et al.
Published: (2026)
by: Lin, Kuanwei, et al.
Published: (2026)
Switchcraft: AI Model Router for Agentic Tool Calling
by: Agarwal, Sharad, et al.
Published: (2026)
by: Agarwal, Sharad, et al.
Published: (2026)
Attending to Routers Aids Indoor Wireless Localization
by: Roy, Ayush, et al.
Published: (2026)
by: Roy, Ayush, et al.
Published: (2026)
Performance Characterization of Expert Router for Scalable LLM Inference
by: Pichlmeier, Josef, et al.
Published: (2024)
by: Pichlmeier, Josef, et al.
Published: (2024)
MViewRouter: Internalizing Geometric Equivariance via Multi-view Alternating Attention for Combinatorial Routing
by: Liu, Shiyan, et al.
Published: (2026)
by: Liu, Shiyan, et al.
Published: (2026)
State of AI: An Empirical 100 Trillion Token Study with OpenRouter
by: Aubakirova, Malika, et al.
Published: (2026)
by: Aubakirova, Malika, et al.
Published: (2026)
Similar Items
-
TensorOpera Router: A Multi-Model Router for Efficient LLM Inference
by: Stripelis, Dimitris, et al.
Published: (2024) -
LightRouter: Towards Efficient LLM Collaboration with Minimal Overhead
by: Zhang, Yifan, et al.
Published: (2025) -
Mixture of Routers
by: Zhang, Jia-Chen, et al.
Published: (2025) -
GraphRouter: A Graph-based Router for LLM Selections
by: Feng, Tao, et al.
Published: (2024) -
Route-and-Reason: Scaling Large Language Model Reasoning with Reinforced Model Router
by: Shao, Chenyang, et al.
Published: (2025)