Saved in:
Bibliographic Details
Main Authors: Balyan, Renu, McCarthy, Kathryn S., McNamara, Danielle S.
Format: Recurso educativo Open Access
Language:en
Published: 2020
Subjects:
Online Access:https://eric.ed.gov/?id=ED613587
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1867181876143194112
author Balyan, Renu
McCarthy, Kathryn S.
McNamara, Danielle S.
author_facet Balyan, Renu
McCarthy, Kathryn S.
McNamara, Danielle S.
Balyan, Renu
McCarthy, Kathryn S.
McNamara, Danielle S.
collection Education Resources Information Center
contents Applying Natural Language Processing and Hierarchical Machine Learning Approaches to Text Difficulty Classification Balyan, Renu McCarthy, Kathryn S. McNamara, Danielle S. Natural Language Processing Artificial Intelligence Man Machine Systems Classification Readability Difficulty Level Reading Comprehension Intelligent Tutoring Systems Readability Formulas For decades, educators have relied on readability metrics that tend to oversimplify dimensions of text difficulty. This study examines the potential of applying advanced artificial intelligence methods to the educational problem of assessing text difficulty. The combination of hierarchical machine learning and natural language processing (NLP) is leveraged to predict the difficulty of practice texts used in a reading comprehension intelligent tutoring system, iSTART. Human raters estimated the text difficulty level of 262 texts across two text sets (Set A and Set B) in the iSTART library. NLP tools were used to identify linguistic features predictive of text difficulty and these indices were submitted to both flat and hierarchical machine learning algorithms. Results indicated that including NLP indices and machine learning increased accuracy by more than 10% as compared to classic readability metrics (e.g., Flesch-Kincaid Grade Level). Further, hierarchical outperformed non-hierarchical (flat) machine learning classification for Set B (72%) and the combined set A + B (65%), whereas the nonhierarchical approach performed slightly better than the hierarchical approach for Set A (79%). These findings demonstrate the importance of considering deeper features of language related to text difficulty as well as the potential utility of hierarchical machine learning approaches in the development of meaningful text difficulty classification. [This article was published in "International Journal of Artificial Intelligence in Education" (EJ1271922).]
format Recurso educativo Open Access
id eric_ED613587
institution ERIC Institute of Education Sciences
language en
publishDate 2020
record_format eric
spellingShingle Applying Natural Language Processing and Hierarchical Machine Learning Approaches to Text Difficulty Classification
Balyan, Renu
McCarthy, Kathryn S.
McNamara, Danielle S.
Natural Language Processing
Artificial Intelligence
Man Machine Systems
Classification
Readability
Difficulty Level
Reading Comprehension
Intelligent Tutoring Systems
Readability Formulas
Applying Natural Language Processing and Hierarchical Machine Learning Approaches to Text Difficulty Classification Balyan, Renu McCarthy, Kathryn S. McNamara, Danielle S. Natural Language Processing Artificial Intelligence Man Machine Systems Classification Readability Difficulty Level Reading Comprehension Intelligent Tutoring Systems Readability Formulas For decades, educators have relied on readability metrics that tend to oversimplify dimensions of text difficulty. This study examines the potential of applying advanced artificial intelligence methods to the educational problem of assessing text difficulty. The combination of hierarchical machine learning and natural language processing (NLP) is leveraged to predict the difficulty of practice texts used in a reading comprehension intelligent tutoring system, iSTART. Human raters estimated the text difficulty level of 262 texts across two text sets (Set A and Set B) in the iSTART library. NLP tools were used to identify linguistic features predictive of text difficulty and these indices were submitted to both flat and hierarchical machine learning algorithms. Results indicated that including NLP indices and machine learning increased accuracy by more than 10% as compared to classic readability metrics (e.g., Flesch-Kincaid Grade Level). Further, hierarchical outperformed non-hierarchical (flat) machine learning classification for Set B (72%) and the combined set A + B (65%), whereas the nonhierarchical approach performed slightly better than the hierarchical approach for Set A (79%). These findings demonstrate the importance of considering deeper features of language related to text difficulty as well as the potential utility of hierarchical machine learning approaches in the development of meaningful text difficulty classification. [This article was published in "International Journal of Artificial Intelligence in Education" (EJ1271922).]
title Applying Natural Language Processing and Hierarchical Machine Learning Approaches to Text Difficulty Classification
topic Natural Language Processing
Artificial Intelligence
Man Machine Systems
Classification
Readability
Difficulty Level
Reading Comprehension
Intelligent Tutoring Systems
Readability Formulas
url https://eric.ed.gov/?id=ED613587