Saved in:
Bibliographic Details
Main Authors: Zhang, Yigeng, González, Fabio A., Solorio, Thamar
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2404.05250
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866910402401533952
author Zhang, Yigeng
González, Fabio A.
Solorio, Thamar
author_facet Zhang, Yigeng
González, Fabio A.
Solorio, Thamar
contents Reading comprehension continues to be a crucial research focus in the NLP community. Recent advances in Machine Reading Comprehension (MRC) have mostly centered on literal comprehension, referring to the surface-level understanding of content. In this work, we focus on the next level - interpretive comprehension, with a particular emphasis on inferring the themes of a narrative text. We introduce the first dataset specifically designed for interpretive comprehension of educational narratives, providing corresponding well-edited theme texts. The dataset spans a variety of genres and cultural origins and includes human-annotated theme keywords with varying levels of granularity. We further formulate NLP tasks under different abstractions of interpretive comprehension toward the main idea of a story. After conducting extensive experiments with state-of-the-art methods, we found the task to be both challenging and significant for NLP research. The dataset and source code have been made publicly available to the research community at https://github.com/RiTUAL-UH/EduStory.
format Preprint
id arxiv_https___arxiv_org_abs_2404_05250
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Interpreting Themes from Educational Stories
Zhang, Yigeng
González, Fabio A.
Solorio, Thamar
Computation and Language
Reading comprehension continues to be a crucial research focus in the NLP community. Recent advances in Machine Reading Comprehension (MRC) have mostly centered on literal comprehension, referring to the surface-level understanding of content. In this work, we focus on the next level - interpretive comprehension, with a particular emphasis on inferring the themes of a narrative text. We introduce the first dataset specifically designed for interpretive comprehension of educational narratives, providing corresponding well-edited theme texts. The dataset spans a variety of genres and cultural origins and includes human-annotated theme keywords with varying levels of granularity. We further formulate NLP tasks under different abstractions of interpretive comprehension toward the main idea of a story. After conducting extensive experiments with state-of-the-art methods, we found the task to be both challenging and significant for NLP research. The dataset and source code have been made publicly available to the research community at https://github.com/RiTUAL-UH/EduStory.
title Interpreting Themes from Educational Stories
topic Computation and Language
url https://arxiv.org/abs/2404.05250