Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Pang, Yutian, Kendall, Andrew Paul, Porcayo, Alex, Barsotti, Mariah, Jain, Anahita, Clarke, John-Paul
Format:	Preprint
Published:	2025
Subjects:	Audio and Speech Processing Sound
Online Access:	https://arxiv.org/abs/2503.04974
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866909815767302144
author	Pang, Yutian Kendall, Andrew Paul Porcayo, Alex Barsotti, Mariah Jain, Anahita Clarke, John-Paul
author_facet	Pang, Yutian Kendall, Andrew Paul Porcayo, Alex Barsotti, Mariah Jain, Anahita Clarke, John-Paul
contents	This work provides a feasible solution to the existing airport surface safety monitoring capabilities (i.e., Airport Surface Surveillance Capability (ASSC)), namely language AI-based voice communication understanding for collision risk assessment. The proposed framework consists of two major parts, (a) rule-enhanced Named Entity Recognition (NER); (b) surface collision risk modeling. NER module generates information tables by processing voice communication transcripts, which serve as references for producing potential taxi plans and calculating the surface movement collision risk. We first collect and annotate our dataset based on open-sourced video recordings and safety investigation reports. Additionally, we refer to FAA Order JO 7110.65W and FAA Order JO 7340.2N to get the list of heuristic rules and phase contractions of communication between the pilot and the Air Traffic Controller (ATCo). Then, we propose the novel ATC Rule-Enhanced NER method, which integrates the heuristic rules into the model training and inference stages, resulting in a hybrid rule-based NER model. We show the effectiveness of this hybrid approach by comparing different setups with different token-level embedding models. For the risk modeling, we adopt the node-link airport layout graph from NASA FACET and model the aircraft taxi speed at each link as a log-normal distribution and derive the total taxi time distribution. Then, we propose a spatiotemporal formulation of the risk probability of two aircraft moving across potential collision nodes during ground movement. Furthermore, we propose the real-time implementation of such a method to obtain the lead time, with a comparison with a Petri-Net based method.
format	Preprint
id	arxiv_https___arxiv_org_abs_2503_04974
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	From Voice to Safety: Language AI Powered Pilot-ATC Communication Understanding for Airport Surface Movement Collision Risk Assessment Pang, Yutian Kendall, Andrew Paul Porcayo, Alex Barsotti, Mariah Jain, Anahita Clarke, John-Paul Audio and Speech Processing Sound This work provides a feasible solution to the existing airport surface safety monitoring capabilities (i.e., Airport Surface Surveillance Capability (ASSC)), namely language AI-based voice communication understanding for collision risk assessment. The proposed framework consists of two major parts, (a) rule-enhanced Named Entity Recognition (NER); (b) surface collision risk modeling. NER module generates information tables by processing voice communication transcripts, which serve as references for producing potential taxi plans and calculating the surface movement collision risk. We first collect and annotate our dataset based on open-sourced video recordings and safety investigation reports. Additionally, we refer to FAA Order JO 7110.65W and FAA Order JO 7340.2N to get the list of heuristic rules and phase contractions of communication between the pilot and the Air Traffic Controller (ATCo). Then, we propose the novel ATC Rule-Enhanced NER method, which integrates the heuristic rules into the model training and inference stages, resulting in a hybrid rule-based NER model. We show the effectiveness of this hybrid approach by comparing different setups with different token-level embedding models. For the risk modeling, we adopt the node-link airport layout graph from NASA FACET and model the aircraft taxi speed at each link as a log-normal distribution and derive the total taxi time distribution. Then, we propose a spatiotemporal formulation of the risk probability of two aircraft moving across potential collision nodes during ground movement. Furthermore, we propose the real-time implementation of such a method to obtain the lead time, with a comparison with a Petri-Net based method.
title	From Voice to Safety: Language AI Powered Pilot-ATC Communication Understanding for Airport Surface Movement Collision Risk Assessment
topic	Audio and Speech Processing Sound
url	https://arxiv.org/abs/2503.04974

Similar Items