Salvato in:
Dettagli Bibliografici
Autori principali: Jakobi, Deborah N., Reich, David R., Prasse, Paul, Hofmann, Jana M., Bolliger, Lena S., Jäger, Lena A.
Natura: Preprint
Pubblicazione: 2026
Soggetti:
Accesso online:https://arxiv.org/abs/2602.19598
Tags: Aggiungi Tag
Nessun Tag, puoi essere il primo ad aggiungerne!!
_version_ 1866914343904346112
author Jakobi, Deborah N.
Reich, David R.
Prasse, Paul
Hofmann, Jana M.
Bolliger, Lena S.
Jäger, Lena A.
author_facet Jakobi, Deborah N.
Reich, David R.
Prasse, Paul
Hofmann, Jana M.
Bolliger, Lena S.
Jäger, Lena A.
contents Eye-tracking-while-reading corpora are a valuable resource for many different disciplines and use cases. Use cases range from studying the cognitive processes underlying reading to machine-learning-based applications, such as gaze-based assessments of reading comprehension. The past decades have seen an increase in the number and size of eye-tracking-while-reading datasets as well as increasing diversity with regard to the stimulus languages covered, the linguistic background of the participants, or accompanying psychometric or demographic data. The spread of data across different disciplines and the lack of data sharing standards across the communities lead to many existing datasets that cannot be easily reused due to a lack of interoperability. In this work, we aim at creating more transparency and clarity with regards to existing datasets and their features across different disciplines by i) presenting an extensive overview of existing datasets, ii) simplifying the sharing of newly created datasets by publishing a living overview online, https://dili-lab.github.io/datasets.html, presenting over 45 features for each dataset, and iii) integrating all publicly available datasets into the Python package pymovements which offers an eye-tracking datasets library. By doing so, we aim to strengthen the FAIR principles in eye-tracking-while-reading research and promote good scientific practices, such as reproducing and replicating studies.
format Preprint
id arxiv_https___arxiv_org_abs_2602_19598
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Eye-Tracking-while-Reading: A Living Survey of Datasets with Open Library Support
Jakobi, Deborah N.
Reich, David R.
Prasse, Paul
Hofmann, Jana M.
Bolliger, Lena S.
Jäger, Lena A.
Computation and Language
Eye-tracking-while-reading corpora are a valuable resource for many different disciplines and use cases. Use cases range from studying the cognitive processes underlying reading to machine-learning-based applications, such as gaze-based assessments of reading comprehension. The past decades have seen an increase in the number and size of eye-tracking-while-reading datasets as well as increasing diversity with regard to the stimulus languages covered, the linguistic background of the participants, or accompanying psychometric or demographic data. The spread of data across different disciplines and the lack of data sharing standards across the communities lead to many existing datasets that cannot be easily reused due to a lack of interoperability. In this work, we aim at creating more transparency and clarity with regards to existing datasets and their features across different disciplines by i) presenting an extensive overview of existing datasets, ii) simplifying the sharing of newly created datasets by publishing a living overview online, https://dili-lab.github.io/datasets.html, presenting over 45 features for each dataset, and iii) integrating all publicly available datasets into the Python package pymovements which offers an eye-tracking datasets library. By doing so, we aim to strengthen the FAIR principles in eye-tracking-while-reading research and promote good scientific practices, such as reproducing and replicating studies.
title Eye-Tracking-while-Reading: A Living Survey of Datasets with Open Library Support
topic Computation and Language
url https://arxiv.org/abs/2602.19598