Saved in:
Bibliographic Details
Main Authors: Tanvir, Abdullah All, Huang, Chenyu, Alahmad, Moe, Yang, Chuyang, Zhong, Xin
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2509.09752
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866915491374694400
author Tanvir, Abdullah All
Huang, Chenyu
Alahmad, Moe
Yang, Chuyang
Zhong, Xin
author_facet Tanvir, Abdullah All
Huang, Chenyu
Alahmad, Moe
Yang, Chuyang
Zhong, Xin
contents Accurate estimation of aircraft operations, such as takeoffs and landings, is critical for effective airport management, yet remains challenging, especially at non-towered facilities lacking dedicated surveillance infrastructure. This paper presents a novel dual pipeline machine learning framework that classifies pilot radio communications using both textual and spectral features. Audio data collected from a non-towered U.S. airport was annotated by certified pilots with operational intent labels and preprocessed through automatic speech recognition and Mel-spectrogram extraction. We evaluate a wide range of traditional classifiers and deep learning models, including ensemble methods, LSTM, and CNN across both pipelines. To our knowledge, this is the first system to classify operational aircraft intent using a dual-pipeline ML framework on real-world air traffic audio. Our results demonstrate that spectral features combined with deep architectures consistently yield superior classification performance, with F1-scores exceeding 91%. Data augmentation further improves robustness to real-world audio variability. The proposed approach is scalable, cost-effective, and deployable without additional infrastructure, offering a practical solution for air traffic monitoring at general aviation airports.
format Preprint
id arxiv_https___arxiv_org_abs_2509_09752
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Combining Textual and Spectral Features for Robust Classification of Pilot Communications
Tanvir, Abdullah All
Huang, Chenyu
Alahmad, Moe
Yang, Chuyang
Zhong, Xin
Sound
Computers and Society
Audio and Speech Processing
Accurate estimation of aircraft operations, such as takeoffs and landings, is critical for effective airport management, yet remains challenging, especially at non-towered facilities lacking dedicated surveillance infrastructure. This paper presents a novel dual pipeline machine learning framework that classifies pilot radio communications using both textual and spectral features. Audio data collected from a non-towered U.S. airport was annotated by certified pilots with operational intent labels and preprocessed through automatic speech recognition and Mel-spectrogram extraction. We evaluate a wide range of traditional classifiers and deep learning models, including ensemble methods, LSTM, and CNN across both pipelines. To our knowledge, this is the first system to classify operational aircraft intent using a dual-pipeline ML framework on real-world air traffic audio. Our results demonstrate that spectral features combined with deep architectures consistently yield superior classification performance, with F1-scores exceeding 91%. Data augmentation further improves robustness to real-world audio variability. The proposed approach is scalable, cost-effective, and deployable without additional infrastructure, offering a practical solution for air traffic monitoring at general aviation airports.
title Combining Textual and Spectral Features for Robust Classification of Pilot Communications
topic Sound
Computers and Society
Audio and Speech Processing
url https://arxiv.org/abs/2509.09752