Saved in:
| Main Authors: | Xu, Zhongling, Zheng, Shunan, Wang, Wei |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.25424 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Preserve Support, Not Correspondence: Dynamic Routing for Offline Reinforcement Learning
by: Mu, Zhancun, et al.
Published: (2026)
by: Mu, Zhancun, et al.
Published: (2026)
Learning Query-Aware Budget-Tier Routing for Runtime Agent Memory
by: Zhang, Haozhen, et al.
Published: (2026)
by: Zhang, Haozhen, et al.
Published: (2026)
RouteLLM: Learning to Route LLMs with Preference Data
by: Ong, Isaac, et al.
Published: (2024)
by: Ong, Isaac, et al.
Published: (2024)
RouteHijack: Routing-Aware Attack on Mixture-of-Experts LLMs
by: Xu, Zhiyuan, et al.
Published: (2026)
by: Xu, Zhiyuan, et al.
Published: (2026)
Safe Offline Reinforcement Learning with Real-Time Budget Constraints
by: Lin, Qian, et al.
Published: (2023)
by: Lin, Qian, et al.
Published: (2023)
LoRe: Adaptive Interaction-Evaluation Routing with Per-Step Interaction Budgets for Iterative Graph Solvers
by: Li, Jintao, et al.
Published: (2026)
by: Li, Jintao, et al.
Published: (2026)
Offline Multi-Agent Reinforcement Learning via In-Sample Sequential Policy Optimization
by: Liu, Zongkai, et al.
Published: (2024)
by: Liu, Zongkai, et al.
Published: (2024)
PR2: Predictive Routing Replay for MoE-Based LLM Reinforcement Learning
by: Dong, Daize, et al.
Published: (2026)
by: Dong, Daize, et al.
Published: (2026)
Adaptive Policy Selection and Fine-Tuning under Interaction Budgets for Offline-to-Online Reinforcement Learning
by: Bozkurt, Alper Kamil, et al.
Published: (2026)
by: Bozkurt, Alper Kamil, et al.
Published: (2026)
Deep Reinforcement Learning for Multi-Truck Vehicle Routing Problems with Multi-Leg Demand Routes
by: Levin, Joshua, et al.
Published: (2024)
by: Levin, Joshua, et al.
Published: (2024)
TabSeq: A Framework for Deep Learning on Tabular Data via Sequential Ordering
by: Habib, Al Zadid Sultan Bin, et al.
Published: (2024)
by: Habib, Al Zadid Sultan Bin, et al.
Published: (2024)
TGB-Seq Benchmark: Challenging Temporal GNNs with Complex Sequential Dynamics
by: Yi, Lu, et al.
Published: (2025)
by: Yi, Lu, et al.
Published: (2025)
Enhancing Robustness of Offline Reinforcement Learning Under Data Corruption via Sharpness-Aware Minimization
by: Xu, Le, et al.
Published: (2025)
by: Xu, Le, et al.
Published: (2025)
IPD: Boosting Sequential Policy with Imaginary Planning Distillation in Offline Reinforcement Learning
by: Qin, Yihao, et al.
Published: (2026)
by: Qin, Yihao, et al.
Published: (2026)
Deep Reinforcement Learning for Picker Routing Problem in Warehousing
by: Dunn, George, et al.
Published: (2024)
by: Dunn, George, et al.
Published: (2024)
In-Context Compositional Q-Learning for Offline Reinforcement Learning
by: Xu, Qiushui, et al.
Published: (2025)
by: Xu, Qiushui, et al.
Published: (2025)
Hybrid LLM: Cost-Efficient and Quality-Aware Query Routing
by: Ding, Dujian, et al.
Published: (2024)
by: Ding, Dujian, et al.
Published: (2024)
Spatial-Temporal Reinforcement Learning for Network Routing with Non-Markovian Traffic
by: Wang, Molly, et al.
Published: (2025)
by: Wang, Molly, et al.
Published: (2025)
Beyond Hard Constraints: Budget-Conditioned Reachability For Safe Offline Reinforcement Learning
by: Brahmanage, Janaka Chathuranga, et al.
Published: (2026)
by: Brahmanage, Janaka Chathuranga, et al.
Published: (2026)
ICL-Router: In-Context Learned Model Representations for LLM Routing
by: Wang, Chenxu, et al.
Published: (2025)
by: Wang, Chenxu, et al.
Published: (2025)
More Than Routing: Joint GPS and Route Modeling for Refine Trajectory Representation Learning
by: Ma, Zhipeng, et al.
Published: (2024)
by: Ma, Zhipeng, et al.
Published: (2024)
PickLLM: Context-Aware RL-Assisted Large Language Model Routing
by: Sikeridis, Dimitrios, et al.
Published: (2024)
by: Sikeridis, Dimitrios, et al.
Published: (2024)
Budgeting Counterfactual for Offline RL
by: Liu, Yao, et al.
Published: (2023)
by: Liu, Yao, et al.
Published: (2023)
ARROW: An Adaptive Rollout and Routing Method for Global Weather Forecasting
by: Tian, Jindong, et al.
Published: (2025)
by: Tian, Jindong, et al.
Published: (2025)
Learning to Route: Per-Sample Adaptive Routing for Multimodal Multitask Prediction
by: Ajirak, Marzieh, et al.
Published: (2025)
by: Ajirak, Marzieh, et al.
Published: (2025)
Advancing Routing-Awareness in Analog ICs Floorplanning
by: Basso, Davide, et al.
Published: (2025)
by: Basso, Davide, et al.
Published: (2025)
BEST-Route: Adaptive LLM Routing with Test-Time Optimal Compute
by: Ding, Dujian, et al.
Published: (2025)
by: Ding, Dujian, et al.
Published: (2025)
SAMG: Offline-to-Online Reinforcement Learning via State-Action-Conditional Offline Model Guidance
by: Zhang, Liyu, et al.
Published: (2024)
by: Zhang, Liyu, et al.
Published: (2024)
SkillOrchestra: Learning to Route Agents via Skill Transfer
by: Wang, Jiayu, et al.
Published: (2026)
by: Wang, Jiayu, et al.
Published: (2026)
Reinforcement Learning for Multi-Truck Vehicle Routing Problems
by: Levin, Joshua, et al.
Published: (2022)
by: Levin, Joshua, et al.
Published: (2022)
Tackling Data Corruption in Offline Reinforcement Learning via Sequence Modeling
by: Xu, Jiawei, et al.
Published: (2024)
by: Xu, Jiawei, et al.
Published: (2024)
Trust by Design: Skill Profiles for Transparent, Cost-Aware LLM Routing
by: Okamoto, Mika, et al.
Published: (2026)
by: Okamoto, Mika, et al.
Published: (2026)
Deep Reinforcement Learning for Solving the Fleet Size and Mix Vehicle Routing Problem
by: Wan, Pengfu, et al.
Published: (2025)
by: Wan, Pengfu, et al.
Published: (2025)
Localized Dynamics-Aware Domain Adaption for Off-Dynamics Offline Reinforcement Learning
by: Xia, Zhangjie, et al.
Published: (2026)
by: Xia, Zhangjie, et al.
Published: (2026)
Token-Level LLM Collaboration via FusionRoute
by: Xiong, Nuoya, et al.
Published: (2026)
by: Xiong, Nuoya, et al.
Published: (2026)
Prompt Learning for Generalized Vehicle Routing
by: Liu, Fei, et al.
Published: (2024)
by: Liu, Fei, et al.
Published: (2024)
RADAR: Reasoning-Ability and Difficulty-Aware Routing for Reasoning LLMs
by: Fernandez, Nigel, et al.
Published: (2025)
by: Fernandez, Nigel, et al.
Published: (2025)
Offline Reinforcement Learning for LLM Multi-Step Reasoning
by: Wang, Huaijie, et al.
Published: (2024)
by: Wang, Huaijie, et al.
Published: (2024)
Offline Trajectory Optimization for Offline Reinforcement Learning
by: Zhao, Ziqi, et al.
Published: (2024)
by: Zhao, Ziqi, et al.
Published: (2024)
Permutation Equivariant Model-based Offline Reinforcement Learning for Auto-bidding
by: Mou, Zhiyu, et al.
Published: (2025)
by: Mou, Zhiyu, et al.
Published: (2025)
Similar Items
-
Preserve Support, Not Correspondence: Dynamic Routing for Offline Reinforcement Learning
by: Mu, Zhancun, et al.
Published: (2026) -
Learning Query-Aware Budget-Tier Routing for Runtime Agent Memory
by: Zhang, Haozhen, et al.
Published: (2026) -
RouteLLM: Learning to Route LLMs with Preference Data
by: Ong, Isaac, et al.
Published: (2024) -
RouteHijack: Routing-Aware Attack on Mixture-of-Experts LLMs
by: Xu, Zhiyuan, et al.
Published: (2026) -
Safe Offline Reinforcement Learning with Real-Time Budget Constraints
by: Lin, Qian, et al.
Published: (2023)