MARC21: :: Library Catalog

Salvato in:

Dettagli Bibliografici
Autori principali:	Thi, Thuy Nguyen, Viet, Anh Nguyen, Van, Thin Dang, Thuy, Ngan Nguyen Luu
Natura:	Preprint
Pubblicazione:	2024
Soggetti:	Software Engineering Artificial Intelligence Computation and Language
Accesso online:	https://arxiv.org/abs/2405.01575
Tags:	Aggiungi Tag Nessun Tag, puoi essere il primo ad aggiungerne!!

_version_	1866916232937078784
author	Thi, Thuy Nguyen Viet, Anh Nguyen Van, Thin Dang Thuy, Ngan Nguyen Luu
author_facet	Thi, Thuy Nguyen Viet, Anh Nguyen Van, Thin Dang Thuy, Ngan Nguyen Luu
contents	This paper describes our systems for the sub-task I in the Software Mention Detection in Scholarly Publications shared-task. We propose three approaches leveraging different pre-trained language models (BERT, SciBERT, and XLM-R) to tackle this challenge. Our bestperforming system addresses the named entity recognition (NER) problem through a three-stage framework. (1) Entity Sentence Classification - classifies sentences containing potential software mentions; (2) Entity Extraction - detects mentions within classified sentences; (3) Entity Type Classification - categorizes detected mentions into specific software types. Experiments on the official dataset demonstrate that our three-stage framework achieves competitive performance, surpassing both other participating teams and our alternative approaches. As a result, our framework based on the XLM-R-based model achieves a weighted F1-score of 67.80%, delivering our team the 3rd rank in Sub-task I for the Software Mention Recognition task.
format	Preprint
id	arxiv_https___arxiv_org_abs_2405_01575
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Software Mention Recognition with a Three-Stage Framework Based on BERTology Models at SOMD 2024 Thi, Thuy Nguyen Viet, Anh Nguyen Van, Thin Dang Thuy, Ngan Nguyen Luu Software Engineering Artificial Intelligence Computation and Language This paper describes our systems for the sub-task I in the Software Mention Detection in Scholarly Publications shared-task. We propose three approaches leveraging different pre-trained language models (BERT, SciBERT, and XLM-R) to tackle this challenge. Our bestperforming system addresses the named entity recognition (NER) problem through a three-stage framework. (1) Entity Sentence Classification - classifies sentences containing potential software mentions; (2) Entity Extraction - detects mentions within classified sentences; (3) Entity Type Classification - categorizes detected mentions into specific software types. Experiments on the official dataset demonstrate that our three-stage framework achieves competitive performance, surpassing both other participating teams and our alternative approaches. As a result, our framework based on the XLM-R-based model achieves a weighted F1-score of 67.80%, delivering our team the 3rd rank in Sub-task I for the Software Mention Recognition task.
title	Software Mention Recognition with a Three-Stage Framework Based on BERTology Models at SOMD 2024
topic	Software Engineering Artificial Intelligence Computation and Language
url	https://arxiv.org/abs/2405.01575

Documenti analoghi