:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Xu, Zhongling, Zheng, Shunan, Wang, Wei
Format:	Preprint
Published:	2026
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2605.25424
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Preserve Support, Not Correspondence: Dynamic Routing for Offline Reinforcement Learning
by: Mu, Zhancun, et al.
Published: (2026)

Learning Query-Aware Budget-Tier Routing for Runtime Agent Memory
by: Zhang, Haozhen, et al.
Published: (2026)

RouteLLM: Learning to Route LLMs with Preference Data
by: Ong, Isaac, et al.
Published: (2024)

RouteHijack: Routing-Aware Attack on Mixture-of-Experts LLMs
by: Xu, Zhiyuan, et al.
Published: (2026)

Safe Offline Reinforcement Learning with Real-Time Budget Constraints
by: Lin, Qian, et al.
Published: (2023)

LoRe: Adaptive Interaction-Evaluation Routing with Per-Step Interaction Budgets for Iterative Graph Solvers
by: Li, Jintao, et al.
Published: (2026)

Offline Multi-Agent Reinforcement Learning via In-Sample Sequential Policy Optimization
by: Liu, Zongkai, et al.
Published: (2024)

PR2: Predictive Routing Replay for MoE-Based LLM Reinforcement Learning
by: Dong, Daize, et al.
Published: (2026)

Adaptive Policy Selection and Fine-Tuning under Interaction Budgets for Offline-to-Online Reinforcement Learning
by: Bozkurt, Alper Kamil, et al.
Published: (2026)

Deep Reinforcement Learning for Multi-Truck Vehicle Routing Problems with Multi-Leg Demand Routes
by: Levin, Joshua, et al.
Published: (2024)

TabSeq: A Framework for Deep Learning on Tabular Data via Sequential Ordering
by: Habib, Al Zadid Sultan Bin, et al.
Published: (2024)

TGB-Seq Benchmark: Challenging Temporal GNNs with Complex Sequential Dynamics
by: Yi, Lu, et al.
Published: (2025)

Enhancing Robustness of Offline Reinforcement Learning Under Data Corruption via Sharpness-Aware Minimization
by: Xu, Le, et al.
Published: (2025)

IPD: Boosting Sequential Policy with Imaginary Planning Distillation in Offline Reinforcement Learning
by: Qin, Yihao, et al.
Published: (2026)

Deep Reinforcement Learning for Picker Routing Problem in Warehousing
by: Dunn, George, et al.
Published: (2024)

In-Context Compositional Q-Learning for Offline Reinforcement Learning
by: Xu, Qiushui, et al.
Published: (2025)

Hybrid LLM: Cost-Efficient and Quality-Aware Query Routing
by: Ding, Dujian, et al.
Published: (2024)

Spatial-Temporal Reinforcement Learning for Network Routing with Non-Markovian Traffic
by: Wang, Molly, et al.
Published: (2025)

Beyond Hard Constraints: Budget-Conditioned Reachability For Safe Offline Reinforcement Learning
by: Brahmanage, Janaka Chathuranga, et al.
Published: (2026)

ICL-Router: In-Context Learned Model Representations for LLM Routing
by: Wang, Chenxu, et al.
Published: (2025)

More Than Routing: Joint GPS and Route Modeling for Refine Trajectory Representation Learning
by: Ma, Zhipeng, et al.
Published: (2024)

PickLLM: Context-Aware RL-Assisted Large Language Model Routing
by: Sikeridis, Dimitrios, et al.
Published: (2024)

Budgeting Counterfactual for Offline RL
by: Liu, Yao, et al.
Published: (2023)

ARROW: An Adaptive Rollout and Routing Method for Global Weather Forecasting
by: Tian, Jindong, et al.
Published: (2025)

Learning to Route: Per-Sample Adaptive Routing for Multimodal Multitask Prediction
by: Ajirak, Marzieh, et al.
Published: (2025)

Advancing Routing-Awareness in Analog ICs Floorplanning
by: Basso, Davide, et al.
Published: (2025)

BEST-Route: Adaptive LLM Routing with Test-Time Optimal Compute
by: Ding, Dujian, et al.
Published: (2025)

SAMG: Offline-to-Online Reinforcement Learning via State-Action-Conditional Offline Model Guidance
by: Zhang, Liyu, et al.
Published: (2024)

SkillOrchestra: Learning to Route Agents via Skill Transfer
by: Wang, Jiayu, et al.
Published: (2026)

Reinforcement Learning for Multi-Truck Vehicle Routing Problems
by: Levin, Joshua, et al.
Published: (2022)

Tackling Data Corruption in Offline Reinforcement Learning via Sequence Modeling
by: Xu, Jiawei, et al.
Published: (2024)

Trust by Design: Skill Profiles for Transparent, Cost-Aware LLM Routing
by: Okamoto, Mika, et al.
Published: (2026)

Deep Reinforcement Learning for Solving the Fleet Size and Mix Vehicle Routing Problem
by: Wan, Pengfu, et al.
Published: (2025)

Localized Dynamics-Aware Domain Adaption for Off-Dynamics Offline Reinforcement Learning
by: Xia, Zhangjie, et al.
Published: (2026)

Token-Level LLM Collaboration via FusionRoute
by: Xiong, Nuoya, et al.
Published: (2026)

Prompt Learning for Generalized Vehicle Routing
by: Liu, Fei, et al.
Published: (2024)

RADAR: Reasoning-Ability and Difficulty-Aware Routing for Reasoning LLMs
by: Fernandez, Nigel, et al.
Published: (2025)

Offline Reinforcement Learning for LLM Multi-Step Reasoning
by: Wang, Huaijie, et al.
Published: (2024)

Offline Trajectory Optimization for Offline Reinforcement Learning
by: Zhao, Ziqi, et al.
Published: (2024)

Permutation Equivariant Model-based Offline Reinforcement Learning for Auto-bidding
by: Mou, Zhiyu, et al.
Published: (2025)