Saved in:
Bibliographic Details
Main Authors: Teo, Darryl, Sam, Adharsha, Koh, Chuan Shen Marcus, Nagi, Rakesh, Ribeiro, Nuno Antunes
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2603.26076
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866910077774987264
author Teo, Darryl
Sam, Adharsha
Koh, Chuan Shen Marcus
Nagi, Rakesh
Ribeiro, Nuno Antunes
author_facet Teo, Darryl
Sam, Adharsha
Koh, Chuan Shen Marcus
Nagi, Rakesh
Ribeiro, Nuno Antunes
contents Documentation of airport operations is inherently complex due to extensive technical terminology, rigorous regulations, proprietary regional information, and fragmented communication across multiple stakeholders. The resulting data silos and semantic inconsistencies present a significant impediment to the Total Airport Management (TAM) initiative. This paper presents a methodological framework for constructing a domain-grounded, machine-readable Knowledge Graph (KG) through a dual-stage fusion of symbolic Knowledge Engineering (KE) and generative Large Language Models (LLMs). The framework employs a scaffolded fusion strategy in which expert-curated KE structures guide LLM prompts to facilitate the discovery of semantically aligned knowledge triples. We evaluate this methodology on the Google LangExtract library and investigate the impact of context window utilization by comparing localized segment-based inference with document-level processing. Contrary to prior empirical observations of long-context degradation in LLMs, document-level processing improves the recovery of non-linear procedural dependencies. To ensure the high-fidelity provenance required in airport operations, the proposed framework fuses a probabilistic model for discovery and a deterministic algorithm for anchoring every extraction to its ground source. This ensures absolute traceability and verifiability, bridging the gap between "black-box" generative outputs and the transparency required for operational tooling. Finally, we introduce an automated framework that operationalizes this pipeline to synthesize complex operational workflows from unstructured textual corpora.
format Preprint
id arxiv_https___arxiv_org_abs_2603_26076
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Semi-Automated Knowledge Engineering and Process Mapping for Total Airport Management
Teo, Darryl
Sam, Adharsha
Koh, Chuan Shen Marcus
Nagi, Rakesh
Ribeiro, Nuno Antunes
Artificial Intelligence
Computation and Language
Information Retrieval
Documentation of airport operations is inherently complex due to extensive technical terminology, rigorous regulations, proprietary regional information, and fragmented communication across multiple stakeholders. The resulting data silos and semantic inconsistencies present a significant impediment to the Total Airport Management (TAM) initiative. This paper presents a methodological framework for constructing a domain-grounded, machine-readable Knowledge Graph (KG) through a dual-stage fusion of symbolic Knowledge Engineering (KE) and generative Large Language Models (LLMs). The framework employs a scaffolded fusion strategy in which expert-curated KE structures guide LLM prompts to facilitate the discovery of semantically aligned knowledge triples. We evaluate this methodology on the Google LangExtract library and investigate the impact of context window utilization by comparing localized segment-based inference with document-level processing. Contrary to prior empirical observations of long-context degradation in LLMs, document-level processing improves the recovery of non-linear procedural dependencies. To ensure the high-fidelity provenance required in airport operations, the proposed framework fuses a probabilistic model for discovery and a deterministic algorithm for anchoring every extraction to its ground source. This ensures absolute traceability and verifiability, bridging the gap between "black-box" generative outputs and the transparency required for operational tooling. Finally, we introduce an automated framework that operationalizes this pipeline to synthesize complex operational workflows from unstructured textual corpora.
title Semi-Automated Knowledge Engineering and Process Mapping for Total Airport Management
topic Artificial Intelligence
Computation and Language
Information Retrieval
url https://arxiv.org/abs/2603.26076