Saved in:
Bibliographic Details
Main Authors: Zhou, Shizhong, Liu, Haifeng, Zhang, Zheng, Zhang, Shiyu, Yang, Bo, Lin, Yi
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2605.08754
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866916003159474176
author Zhou, Shizhong
Liu, Haifeng
Zhang, Zheng
Zhang, Shiyu
Yang, Bo
Lin, Yi
author_facet Zhou, Shizhong
Liu, Haifeng
Zhang, Zheng
Zhang, Shiyu
Yang, Bo
Lin, Yi
contents Taxiway routing and on-surface conflict avoidance are coupled safety-critical decision problems in airport surface operations. Existing planning and optimization methods are often limited by online computational cost, while reinforcement learning methods may struggle to represent downstream traffic conflicts and balance multiple objectives. This paper presents Conflict-aware Taxiway Routing (CaTR), a reinforcement learning framework for real-time multi-aircraft taxiway routing. CaTR constructs a grid-based airport surface environment with action masking, introduces a hierarchical foresight traffic representation to encode current and downstream conflict-related traffic conditions, and adopts a value-decomposed reinforcement learning strategy to prioritize sparse but safety-critical objectives. Experiments are conducted on a realistic environment based on Changsha Huanghua International Airport under multiple traffic density levels. Results show that CaTR achieves better safety--efficiency trade-offs than representative planning, optimization, and reinforcement learning baselines while maintaining practical runtime.
format Preprint
id arxiv_https___arxiv_org_abs_2605_08754
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Value-Decomposed Reinforcement Learning Framework for Taxiway Routing with Hierarchical Conflict-Aware Observations
Zhou, Shizhong
Liu, Haifeng
Zhang, Zheng
Zhang, Shiyu
Yang, Bo
Lin, Yi
Artificial Intelligence
Taxiway routing and on-surface conflict avoidance are coupled safety-critical decision problems in airport surface operations. Existing planning and optimization methods are often limited by online computational cost, while reinforcement learning methods may struggle to represent downstream traffic conflicts and balance multiple objectives. This paper presents Conflict-aware Taxiway Routing (CaTR), a reinforcement learning framework for real-time multi-aircraft taxiway routing. CaTR constructs a grid-based airport surface environment with action masking, introduces a hierarchical foresight traffic representation to encode current and downstream conflict-related traffic conditions, and adopts a value-decomposed reinforcement learning strategy to prioritize sparse but safety-critical objectives. Experiments are conducted on a realistic environment based on Changsha Huanghua International Airport under multiple traffic density levels. Results show that CaTR achieves better safety--efficiency trade-offs than representative planning, optimization, and reinforcement learning baselines while maintaining practical runtime.
title Value-Decomposed Reinforcement Learning Framework for Taxiway Routing with Hierarchical Conflict-Aware Observations
topic Artificial Intelligence
url https://arxiv.org/abs/2605.08754