Enregistré dans:
Détails bibliographiques
Auteurs principaux: Sanosi, Abdulaziz, Abdalla, Mohamed
Format: Recurso educativo Open Access
Langue:en
Publié: 2021
Sujets:
Accès en ligne:https://eric.ed.gov/?id=EJ1334636
Tags: Ajouter un tag
Pas de tags, Soyez le premier à ajouter un tag!
_version_ 1867181789701734400
author Sanosi, Abdulaziz
Abdalla, Mohamed
author_facet Sanosi, Abdulaziz
Abdalla, Mohamed
Sanosi, Abdulaziz
Abdalla, Mohamed
collection Education Resources Information Center
contents Automated Identification of Discourse Markers Using the NLP Approach: The Case of "Okay" Sanosi, Abdulaziz Abdalla, Mohamed Natural Language Processing Computational Linguistics Programming Languages Accuracy Punctuation Discourse Analysis Identification Evaluators Comparative Analysis Computer Software This study aimed to examine the potentials of the NLP approach in detecting discourse markers (DMs), namely okay, in transcribed spoken data. One hundred thirty-eight concordance lines were presented to human referees to judge the functions of okay in them as a DM or Non-DM. After that, the researchers used a Python script written according to the POS tagging scheme of the NLTK library to set rules for identifying cases where okay is used as non-DM. The output of the script was compared to the reference human-annotated data. The results showed that the script could accurately identify the function of okay as DM or non-DM in 92% of the cases. The inaccuracy of detecting the rest was found to be caused by a lack of proper and detailed punctuations. The main implications of the results are that new NLP approaches can detect DMS; however, proper punctuation is required to enable the proper identification of DMs. In accordance with the findings, the researcher recommended adopting the approach after conducting further comprehensive studies.
format Recurso educativo Open Access
id eric_EJ1334636
institution ERIC Institute of Education Sciences
language en
publishDate 2021
record_format eric
spellingShingle Automated Identification of Discourse Markers Using the NLP Approach: The Case of "Okay"
Sanosi, Abdulaziz
Abdalla, Mohamed
Natural Language Processing
Computational Linguistics
Programming Languages
Accuracy
Punctuation
Discourse Analysis
Identification
Evaluators
Comparative Analysis
Computer Software
Automated Identification of Discourse Markers Using the NLP Approach: The Case of "Okay" Sanosi, Abdulaziz Abdalla, Mohamed Natural Language Processing Computational Linguistics Programming Languages Accuracy Punctuation Discourse Analysis Identification Evaluators Comparative Analysis Computer Software This study aimed to examine the potentials of the NLP approach in detecting discourse markers (DMs), namely okay, in transcribed spoken data. One hundred thirty-eight concordance lines were presented to human referees to judge the functions of okay in them as a DM or Non-DM. After that, the researchers used a Python script written according to the POS tagging scheme of the NLTK library to set rules for identifying cases where okay is used as non-DM. The output of the script was compared to the reference human-annotated data. The results showed that the script could accurately identify the function of okay as DM or non-DM in 92% of the cases. The inaccuracy of detecting the rest was found to be caused by a lack of proper and detailed punctuations. The main implications of the results are that new NLP approaches can detect DMS; however, proper punctuation is required to enable the proper identification of DMs. In accordance with the findings, the researcher recommended adopting the approach after conducting further comprehensive studies.
title Automated Identification of Discourse Markers Using the NLP Approach: The Case of "Okay"
topic Natural Language Processing
Computational Linguistics
Programming Languages
Accuracy
Punctuation
Discourse Analysis
Identification
Evaluators
Comparative Analysis
Computer Software
url https://eric.ed.gov/?id=EJ1334636