Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Tanvir, Abdullah All, Huang, Chenyu, Alahmad, Moe, Yang, Chuyang, Zhong, Xin
Format:	Preprint
Published:	2025
Subjects:	Sound Computers and Society Audio and Speech Processing
Online Access:	https://arxiv.org/abs/2509.09752
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866915491374694400
author	Tanvir, Abdullah All Huang, Chenyu Alahmad, Moe Yang, Chuyang Zhong, Xin
author_facet	Tanvir, Abdullah All Huang, Chenyu Alahmad, Moe Yang, Chuyang Zhong, Xin
contents	Accurate estimation of aircraft operations, such as takeoffs and landings, is critical for effective airport management, yet remains challenging, especially at non-towered facilities lacking dedicated surveillance infrastructure. This paper presents a novel dual pipeline machine learning framework that classifies pilot radio communications using both textual and spectral features. Audio data collected from a non-towered U.S. airport was annotated by certified pilots with operational intent labels and preprocessed through automatic speech recognition and Mel-spectrogram extraction. We evaluate a wide range of traditional classifiers and deep learning models, including ensemble methods, LSTM, and CNN across both pipelines. To our knowledge, this is the first system to classify operational aircraft intent using a dual-pipeline ML framework on real-world air traffic audio. Our results demonstrate that spectral features combined with deep architectures consistently yield superior classification performance, with F1-scores exceeding 91%. Data augmentation further improves robustness to real-world audio variability. The proposed approach is scalable, cost-effective, and deployable without additional infrastructure, offering a practical solution for air traffic monitoring at general aviation airports.
format	Preprint
id	arxiv_https___arxiv_org_abs_2509_09752
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Combining Textual and Spectral Features for Robust Classification of Pilot Communications Tanvir, Abdullah All Huang, Chenyu Alahmad, Moe Yang, Chuyang Zhong, Xin Sound Computers and Society Audio and Speech Processing Accurate estimation of aircraft operations, such as takeoffs and landings, is critical for effective airport management, yet remains challenging, especially at non-towered facilities lacking dedicated surveillance infrastructure. This paper presents a novel dual pipeline machine learning framework that classifies pilot radio communications using both textual and spectral features. Audio data collected from a non-towered U.S. airport was annotated by certified pilots with operational intent labels and preprocessed through automatic speech recognition and Mel-spectrogram extraction. We evaluate a wide range of traditional classifiers and deep learning models, including ensemble methods, LSTM, and CNN across both pipelines. To our knowledge, this is the first system to classify operational aircraft intent using a dual-pipeline ML framework on real-world air traffic audio. Our results demonstrate that spectral features combined with deep architectures consistently yield superior classification performance, with F1-scores exceeding 91%. Data augmentation further improves robustness to real-world audio variability. The proposed approach is scalable, cost-effective, and deployable without additional infrastructure, offering a practical solution for air traffic monitoring at general aviation airports.
title	Combining Textual and Spectral Features for Robust Classification of Pilot Communications
topic	Sound Computers and Society Audio and Speech Processing
url	https://arxiv.org/abs/2509.09752

Similar Items