Table of Contents: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Kim, Sojung Lucia, Jang, Taehong, Ahn, Joonmo
Format:	Preprint
Published:	2024
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2407.11368
Tags:	Add Tag No Tags, Be the first to tag this record!

Table of Contents:

This study aims to compare three methods for translating ancient texts with sparse corpora: (1) the traditional statistical translation method of phrase alignment, (2) in-context LLM learning, and (3) proposed inter methodological approach - statistical machine translation method using sentence piece tokens derived from unified set of source-target corpus. The performance of the proposed approach in this study is 36.71 in BLEU score, surpassing the scores of SOLAR-10.7B context learning and the best existing Seq2Seq model. Further analysis and discussion are presented.

Similar Items