Enregistré dans:
| Auteurs principaux: | , |
|---|---|
| Format: | Recurso educativo Open Access |
| Langue: | en |
| Publié: |
2021
|
| Sujets: | |
| Accès en ligne: | https://eric.ed.gov/?id=EJ1334636 |
| Tags: |
Ajouter un tag
Pas de tags, Soyez le premier à ajouter un tag!
|
| _version_ | 1867181789701734400 |
|---|---|
| author | Sanosi, Abdulaziz Abdalla, Mohamed |
| author_facet | Sanosi, Abdulaziz Abdalla, Mohamed Sanosi, Abdulaziz Abdalla, Mohamed |
| collection | Education Resources Information Center |
| contents | Automated Identification of Discourse Markers Using the NLP Approach: The Case of "Okay" Sanosi, Abdulaziz Abdalla, Mohamed Natural Language Processing Computational Linguistics Programming Languages Accuracy Punctuation Discourse Analysis Identification Evaluators Comparative Analysis Computer Software This study aimed to examine the potentials of the NLP approach in detecting discourse markers (DMs), namely okay, in transcribed spoken data. One hundred thirty-eight concordance lines were presented to human referees to judge the functions of okay in them as a DM or Non-DM. After that, the researchers used a Python script written according to the POS tagging scheme of the NLTK library to set rules for identifying cases where okay is used as non-DM. The output of the script was compared to the reference human-annotated data. The results showed that the script could accurately identify the function of okay as DM or non-DM in 92% of the cases. The inaccuracy of detecting the rest was found to be caused by a lack of proper and detailed punctuations. The main implications of the results are that new NLP approaches can detect DMS; however, proper punctuation is required to enable the proper identification of DMs. In accordance with the findings, the researcher recommended adopting the approach after conducting further comprehensive studies. |
| format | Recurso educativo Open Access |
| id | eric_EJ1334636 |
| institution | ERIC Institute of Education Sciences |
| language | en |
| publishDate | 2021 |
| record_format | eric |
| spellingShingle | Automated Identification of Discourse Markers Using the NLP Approach: The Case of "Okay" Sanosi, Abdulaziz Abdalla, Mohamed Natural Language Processing Computational Linguistics Programming Languages Accuracy Punctuation Discourse Analysis Identification Evaluators Comparative Analysis Computer Software Automated Identification of Discourse Markers Using the NLP Approach: The Case of "Okay" Sanosi, Abdulaziz Abdalla, Mohamed Natural Language Processing Computational Linguistics Programming Languages Accuracy Punctuation Discourse Analysis Identification Evaluators Comparative Analysis Computer Software This study aimed to examine the potentials of the NLP approach in detecting discourse markers (DMs), namely okay, in transcribed spoken data. One hundred thirty-eight concordance lines were presented to human referees to judge the functions of okay in them as a DM or Non-DM. After that, the researchers used a Python script written according to the POS tagging scheme of the NLTK library to set rules for identifying cases where okay is used as non-DM. The output of the script was compared to the reference human-annotated data. The results showed that the script could accurately identify the function of okay as DM or non-DM in 92% of the cases. The inaccuracy of detecting the rest was found to be caused by a lack of proper and detailed punctuations. The main implications of the results are that new NLP approaches can detect DMS; however, proper punctuation is required to enable the proper identification of DMs. In accordance with the findings, the researcher recommended adopting the approach after conducting further comprehensive studies. |
| title | Automated Identification of Discourse Markers Using the NLP Approach: The Case of "Okay" |
| topic | Natural Language Processing Computational Linguistics Programming Languages Accuracy Punctuation Discourse Analysis Identification Evaluators Comparative Analysis Computer Software |
| url | https://eric.ed.gov/?id=EJ1334636 |