Saved in:
| Main Authors: | Qiao, Haoyu, Zhang, Hao, Mao, Shanwen, Cheng, Siyao, Liu, Jie |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.21237 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Guarded Query Routing for Large Language Models
by: Šléher, Richard, et al.
Published: (2025)
by: Šléher, Richard, et al.
Published: (2025)
Beyond Query Memorization: Large Language Model Routing with Query Decomposition and Historical Matching
by: Lv, Bo, et al.
Published: (2026)
by: Lv, Bo, et al.
Published: (2026)
RACER: Risk-Aware Calibrated Efficient Routing for Large Language Models
by: Hao, Sai, et al.
Published: (2026)
by: Hao, Sai, et al.
Published: (2026)
SynapseRoute: An Auto-Route Switching Framework on Dual-State Large Language Model
by: Zhang, Wencheng, et al.
Published: (2025)
by: Zhang, Wencheng, et al.
Published: (2025)
Learning to Route Queries to Heads for Attention-based Re-ranking with Large Language Models
by: Tian, Yuxing, et al.
Published: (2026)
by: Tian, Yuxing, et al.
Published: (2026)
CR^2: Cost-Aware Risk-Controlled Routing for Wireless Device-Edge LLM Inference
by: Xue, Nan, et al.
Published: (2026)
by: Xue, Nan, et al.
Published: (2026)
Dynamic Quality-Latency Aware Routing for LLM Inference in Wireless Edge-Device Networks
by: Bao, Rui, et al.
Published: (2025)
by: Bao, Rui, et al.
Published: (2025)
Query Routing for Homogeneous Tools: An Instantiation in the RAG Scenario
by: Mu, Feiteng, et al.
Published: (2024)
by: Mu, Feiteng, et al.
Published: (2024)
ARS: Automatic Routing Solver with Large Language Models
by: Li, Kai, et al.
Published: (2025)
by: Li, Kai, et al.
Published: (2025)
RealRoute: Dynamic Query Routing System via Retrieve-then-Verify Paradigm
by: Liu, Jiahe, et al.
Published: (2026)
by: Liu, Jiahe, et al.
Published: (2026)
SinkRouter: Sink-Aware Routing for Efficient Long-Context Decoding in Large Language and Multimodal Models
by: Liu, Junnan, et al.
Published: (2026)
by: Liu, Junnan, et al.
Published: (2026)
AdaMoE: Token-Adaptive Routing with Null Experts for Mixture-of-Experts Language Models
by: Zeng, Zihao, et al.
Published: (2024)
by: Zeng, Zihao, et al.
Published: (2024)
Robust Batch-Level Query Routing for Large Language Models under Cost and Capacity Constraints
by: Markovic-Voronov, Jelena, et al.
Published: (2026)
by: Markovic-Voronov, Jelena, et al.
Published: (2026)
Learning Query-Aware Budget-Tier Routing for Runtime Agent Memory
by: Zhang, Haozhen, et al.
Published: (2026)
by: Zhang, Haozhen, et al.
Published: (2026)
Energy-Aware Routing to Large Reasoning Models
by: Ellis-Mohr, Austin R., et al.
Published: (2025)
by: Ellis-Mohr, Austin R., et al.
Published: (2025)
Hybrid LLM: Cost-Efficient and Quality-Aware Query Routing
by: Ding, Dujian, et al.
Published: (2024)
by: Ding, Dujian, et al.
Published: (2024)
Multi-Objective Infeasibility Diagnosis for Routing Problems Using Large Language Models
by: Li, Kai, et al.
Published: (2025)
by: Li, Kai, et al.
Published: (2025)
PickLLM: Context-Aware RL-Assisted Large Language Model Routing
by: Sikeridis, Dimitrios, et al.
Published: (2024)
by: Sikeridis, Dimitrios, et al.
Published: (2024)
VideoRouter: Query-Adaptive Dual Routing for Efficient Long-Video Understanding
by: Lin, Kuanwei, et al.
Published: (2026)
by: Lin, Kuanwei, et al.
Published: (2026)
Plan before Solving: Problem-Aware Strategy Routing for Mathematical Reasoning with LLMs
by: Qi, Shihao, et al.
Published: (2025)
by: Qi, Shihao, et al.
Published: (2025)
Model Routing as a Trust Problem: Route Receipts for Adaptive AI Systems
by: Schmalbach, Vincent
Published: (2026)
by: Schmalbach, Vincent
Published: (2026)
Learning to Solve Compositional Geometry Routing Problems
by: Fan, Mingfeng, et al.
Published: (2026)
by: Fan, Mingfeng, et al.
Published: (2026)
Evolutionary Retrosynthetic Route Planning
by: Zhang, Yan, et al.
Published: (2023)
by: Zhang, Yan, et al.
Published: (2023)
MixLLM: Dynamic Routing in Mixed Large Language Models
by: Wang, Xinyuan, et al.
Published: (2025)
by: Wang, Xinyuan, et al.
Published: (2025)
RouteHijack: Routing-Aware Attack on Mixture-of-Experts LLMs
by: Xu, Zhiyuan, et al.
Published: (2026)
by: Xu, Zhiyuan, et al.
Published: (2026)
DynamicRouteGPT: A Real-Time Multi-Vehicle Dynamic Navigation Framework Based on Large Language Models
by: Zhou, Ziai, et al.
Published: (2024)
by: Zhou, Ziai, et al.
Published: (2024)
Leveraging the Power of Large Language Models in Entity Linking via Adaptive Routing and Targeted Reasoning
by: Li, Yajie, et al.
Published: (2025)
by: Li, Yajie, et al.
Published: (2025)
SymRAG: Efficient Neuro-Symbolic Retrieval Through Adaptive Query Routing
by: Hakim, Safayat Bin, et al.
Published: (2025)
by: Hakim, Safayat Bin, et al.
Published: (2025)
Route-and-Reason: Scaling Large Language Model Reasoning with Reinforced Model Router
by: Shao, Chenyang, et al.
Published: (2025)
by: Shao, Chenyang, et al.
Published: (2025)
Learning to Route: Per-Sample Adaptive Routing for Multimodal Multitask Prediction
by: Ajirak, Marzieh, et al.
Published: (2025)
by: Ajirak, Marzieh, et al.
Published: (2025)
Trust-Aware Routing for Distributed Generative AI Inference at the Edge
by: Nguyen, Chanh, et al.
Published: (2026)
by: Nguyen, Chanh, et al.
Published: (2026)
Routing-Based Continual Learning for Multimodal Large Language Models
by: Mohta, Jay, et al.
Published: (2025)
by: Mohta, Jay, et al.
Published: (2025)
Routoo: Learning to Route to Large Language Models Effectively
by: Mohammadshahi, Alireza, et al.
Published: (2024)
by: Mohammadshahi, Alireza, et al.
Published: (2024)
ACE-Router: Generalizing History-Aware Routing from MCP Tools to the Agent Web
by: Yao, Zhiyuan, et al.
Published: (2026)
by: Yao, Zhiyuan, et al.
Published: (2026)
Input Domain Aware MoE: Decoupling Routing Decisions from Task Optimization in Mixture of Experts
by: Hua, Yongxiang, et al.
Published: (2025)
by: Hua, Yongxiang, et al.
Published: (2025)
Can Large Language Models Solve Robot Routing?
by: Huang, Zhehui, et al.
Published: (2024)
by: Huang, Zhehui, et al.
Published: (2024)
BEST-Route: Adaptive LLM Routing with Test-Time Optimal Compute
by: Ding, Dujian, et al.
Published: (2025)
by: Ding, Dujian, et al.
Published: (2025)
MTRouter: Cost-Aware Multi-Turn LLM Routing with History-Model Joint Embeddings
by: Zhang, Yiqun, et al.
Published: (2026)
by: Zhang, Yiqun, et al.
Published: (2026)
Resilient Routing: Risk-Aware Dynamic Routing in Smart Logistics via Spatiotemporal Graph Learning
by: Xue, Zhiming, et al.
Published: (2026)
by: Xue, Zhiming, et al.
Published: (2026)
RouteFinder: Towards Foundation Models for Vehicle Routing Problems
by: Berto, Federico, et al.
Published: (2024)
by: Berto, Federico, et al.
Published: (2024)
Similar Items
-
Guarded Query Routing for Large Language Models
by: Šléher, Richard, et al.
Published: (2025) -
Beyond Query Memorization: Large Language Model Routing with Query Decomposition and Historical Matching
by: Lv, Bo, et al.
Published: (2026) -
RACER: Risk-Aware Calibrated Efficient Routing for Large Language Models
by: Hao, Sai, et al.
Published: (2026) -
SynapseRoute: An Auto-Route Switching Framework on Dual-State Large Language Model
by: Zhang, Wencheng, et al.
Published: (2025) -
Learning to Route Queries to Heads for Attention-based Re-ranking with Large Language Models
by: Tian, Yuxing, et al.
Published: (2026)