Guardado en:
Detalles Bibliográficos
Autores principales: Yuan, Ye, Wu, Haolun, Zhou, Hao, Liu, Xue, Chen, Hao, Xin, Yan, Jianzhong, Zhang
Formato: Preprint
Publicado: 2025
Materias:
Acceso en línea:https://arxiv.org/abs/2505.14906
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
_version_ 1866908372901560320
author Yuan, Ye
Wu, Haolun
Zhou, Hao
Liu, Xue
Chen, Hao
Xin, Yan
Jianzhong
Zhang
author_facet Yuan, Ye
Wu, Haolun
Zhou, Hao
Liu, Xue
Chen, Hao
Xin, Yan
Jianzhong
Zhang
contents Knowledge understanding is a foundational part of envisioned 6G networks to advance network intelligence and AI-native network architectures. In this paradigm, information extraction plays a pivotal role in transforming fragmented telecom knowledge into well-structured formats, empowering diverse AI models to better understand network terminologies. This work proposes a novel language model-based information extraction technique, aiming to extract structured entities from the telecom context. The proposed telecom structured entity extraction (TeleSEE) technique applies a token-efficient representation method to predict entity types and attribute keys, aiming to save the number of output tokens and improve prediction accuracy. Meanwhile, TeleSEE involves a hierarchical parallel decoding method, improving the standard encoder-decoder architecture by integrating additional prompting and decoding strategies into entity extraction tasks. In addition, to better evaluate the performance of the proposed technique in the telecom domain, we further designed a dataset named 6GTech, including 2390 sentences and 23747 words from more than 100 6G-related technical publications. Finally, the experiment shows that the proposed TeleSEE method achieves higher accuracy than other baseline techniques, and also presents 5 to 9 times higher sample processing speed.
format Preprint
id arxiv_https___arxiv_org_abs_2505_14906
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Understanding 6G through Language Models: A Case Study on LLM-aided Structured Entity Extraction in Telecom Domain
Yuan, Ye
Wu, Haolun
Zhou, Hao
Liu, Xue
Chen, Hao
Xin, Yan
Jianzhong
Zhang
Computation and Language
Systems and Control
Knowledge understanding is a foundational part of envisioned 6G networks to advance network intelligence and AI-native network architectures. In this paradigm, information extraction plays a pivotal role in transforming fragmented telecom knowledge into well-structured formats, empowering diverse AI models to better understand network terminologies. This work proposes a novel language model-based information extraction technique, aiming to extract structured entities from the telecom context. The proposed telecom structured entity extraction (TeleSEE) technique applies a token-efficient representation method to predict entity types and attribute keys, aiming to save the number of output tokens and improve prediction accuracy. Meanwhile, TeleSEE involves a hierarchical parallel decoding method, improving the standard encoder-decoder architecture by integrating additional prompting and decoding strategies into entity extraction tasks. In addition, to better evaluate the performance of the proposed technique in the telecom domain, we further designed a dataset named 6GTech, including 2390 sentences and 23747 words from more than 100 6G-related technical publications. Finally, the experiment shows that the proposed TeleSEE method achieves higher accuracy than other baseline techniques, and also presents 5 to 9 times higher sample processing speed.
title Understanding 6G through Language Models: A Case Study on LLM-aided Structured Entity Extraction in Telecom Domain
topic Computation and Language
Systems and Control
url https://arxiv.org/abs/2505.14906