Saved in:
Bibliographic Details
Main Authors: Chaoui, Nasma, Khoury, Richard
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2508.10683
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866909038131806208
author Chaoui, Nasma
Khoury, Richard
author_facet Chaoui, Nasma
Khoury, Richard
contents This paper presents the first systematic study of strategies for translating Coptic into French. Our comprehensive pipeline systematically evaluates: pivot versus direct translation, the impact of pre-training, the benefits of multi-version fine-tuning, and model robustness to noise. Utilizing aligned biblical corpora, we demonstrate that fine-tuning with a stylistically-varied and noise-aware training corpus significantly enhances translation quality. Our findings provide crucial practical insights for developing translation tools for historical languages in general.
format Preprint
id arxiv_https___arxiv_org_abs_2508_10683
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Neural Machine Translation for Coptic-French: Strategies for Low-Resource Ancient Languages
Chaoui, Nasma
Khoury, Richard
Computation and Language
This paper presents the first systematic study of strategies for translating Coptic into French. Our comprehensive pipeline systematically evaluates: pivot versus direct translation, the impact of pre-training, the benefits of multi-version fine-tuning, and model robustness to noise. Utilizing aligned biblical corpora, we demonstrate that fine-tuning with a stylistically-varied and noise-aware training corpus significantly enhances translation quality. Our findings provide crucial practical insights for developing translation tools for historical languages in general.
title Neural Machine Translation for Coptic-French: Strategies for Low-Resource Ancient Languages
topic Computation and Language
url https://arxiv.org/abs/2508.10683