MARC21: :: Library Catalog

Salvato in:

Dettagli Bibliografici
Autori principali:	Tanzer, Garrett, Shengelia, Maximus, Harrenstien, Ken, Uthus, David
Natura:	Preprint
Pubblicazione:	2024
Soggetti:	Computation and Language
Accesso online:	https://arxiv.org/abs/2406.11049
Tags:	Aggiungi Tag Nessun Tag, puoi essere il primo ad aggiungerne!!

_version_	1866913393309384704
author	Tanzer, Garrett Shengelia, Maximus Harrenstien, Ken Uthus, David
author_facet	Tanzer, Garrett Shengelia, Maximus Harrenstien, Ken Uthus, David
contents	Historically, sign language machine translation has been posed as a sentence-level task: datasets consisting of continuous narratives are chopped up and presented to the model as isolated clips. In this work, we explore the limitations of this task framing. First, we survey a number of linguistic phenomena in sign languages that depend on discourse-level context. Then as a case study, we perform the first human baseline for sign language translation that actually substitutes a human into the machine learning task framing, rather than provide the human with the entire document as context. This human baseline -- for ASL to English translation on the How2Sign dataset -- shows that for 33% of sentences in our sample, our fluent Deaf signer annotators were only able to understand key parts of the clip in light of additional discourse-level context. These results underscore the importance of understanding and sanity checking examples when adapting machine learning to new domains.
format	Preprint
id	arxiv_https___arxiv_org_abs_2406_11049
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Reconsidering Sentence-Level Sign Language Translation Tanzer, Garrett Shengelia, Maximus Harrenstien, Ken Uthus, David Computation and Language Historically, sign language machine translation has been posed as a sentence-level task: datasets consisting of continuous narratives are chopped up and presented to the model as isolated clips. In this work, we explore the limitations of this task framing. First, we survey a number of linguistic phenomena in sign languages that depend on discourse-level context. Then as a case study, we perform the first human baseline for sign language translation that actually substitutes a human into the machine learning task framing, rather than provide the human with the entire document as context. This human baseline -- for ASL to English translation on the How2Sign dataset -- shows that for 33% of sentences in our sample, our fluent Deaf signer annotators were only able to understand key parts of the clip in light of additional discourse-level context. These results underscore the importance of understanding and sanity checking examples when adapting machine learning to new domains.
title	Reconsidering Sentence-Level Sign Language Translation
topic	Computation and Language
url	https://arxiv.org/abs/2406.11049

Documenti analoghi