Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Nikolaev, Dmitry, Grotenhuis, Jorke, Harel, Haleli, Goldwasser, Orly
Format:	Preprint
Published:	2024
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2407.00475
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866916307548504064
author	Nikolaev, Dmitry Grotenhuis, Jorke Harel, Haleli Goldwasser, Orly
author_facet	Nikolaev, Dmitry Grotenhuis, Jorke Harel, Haleli Goldwasser, Orly
contents	The complex Ancient Egyptian (AE) writing system was characterised by widespread use of graphemic classifiers (determinatives): silent (unpronounced) hieroglyphic signs clarifying the meaning or indicating the pronunciation of the host word. The study of classifiers has intensified in recent years with the launch and quick growth of the iClassifier project, a web-based platform for annotation and analysis of classifiers in ancient and modern languages. Thanks to the data contributed by the project participants, it is now possible to formulate the identification of classifiers in AE texts as an NLP task. In this paper, we make first steps towards solving this task by implementing a series of sequence-labelling neural models, which achieve promising performance despite the modest amount of training data. We discuss tokenisation and operationalisation issues arising from tackling AE texts and contrast our approach with frequency-based baselines.
format	Preprint
id	arxiv_https___arxiv_org_abs_2407_00475
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Classifier identification in Ancient Egyptian as a low-resource sequence-labelling task Nikolaev, Dmitry Grotenhuis, Jorke Harel, Haleli Goldwasser, Orly Computation and Language The complex Ancient Egyptian (AE) writing system was characterised by widespread use of graphemic classifiers (determinatives): silent (unpronounced) hieroglyphic signs clarifying the meaning or indicating the pronunciation of the host word. The study of classifiers has intensified in recent years with the launch and quick growth of the iClassifier project, a web-based platform for annotation and analysis of classifiers in ancient and modern languages. Thanks to the data contributed by the project participants, it is now possible to formulate the identification of classifiers in AE texts as an NLP task. In this paper, we make first steps towards solving this task by implementing a series of sequence-labelling neural models, which achieve promising performance despite the modest amount of training data. We discuss tokenisation and operationalisation issues arising from tackling AE texts and contrast our approach with frequency-based baselines.
title	Classifier identification in Ancient Egyptian as a low-resource sequence-labelling task
topic	Computation and Language
url	https://arxiv.org/abs/2407.00475

Similar Items