Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Merad, Ibrahim, Wolf, Amos, Mazzawi, Ziad, Léo, Yannick
Format: Preprint
Veröffentlicht: 2024
Schlagworte:
Online-Zugang:https://arxiv.org/abs/2412.13924
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
_version_ 1866929637799493632
author Merad, Ibrahim
Wolf, Amos
Mazzawi, Ziad
Léo, Yannick
author_facet Merad, Ibrahim
Wolf, Amos
Mazzawi, Ziad
Léo, Yannick
contents In the quest to overcome language barriers, encoder-decoder models like NLLB have expanded machine translation to rare languages, with some models (e.g., NLLB 1.3B) even trainable on a single GPU. While general-purpose LLMs perform well in translation, open LLMs prove highly competitive when fine-tuned for specific tasks involving unknown corpora. We introduce LYRA (Language verY Rare for All), a novel approach that combines open LLM fine-tuning, retrieval-augmented generation (RAG), and transfer learning from related high-resource languages. This study is exclusively focused on single-GPU training to facilitate ease of adoption. Our study focuses on two-way translation between French and Monégasque, a rare language unsupported by existing translation tools due to limited corpus availability. Our results demonstrate LYRA's effectiveness, frequently surpassing and consistently matching state-of-the-art encoder-decoder models in rare language translation.
format Preprint
id arxiv_https___arxiv_org_abs_2412_13924
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Language verY Rare for All
Merad, Ibrahim
Wolf, Amos
Mazzawi, Ziad
Léo, Yannick
Computation and Language
Machine Learning
In the quest to overcome language barriers, encoder-decoder models like NLLB have expanded machine translation to rare languages, with some models (e.g., NLLB 1.3B) even trainable on a single GPU. While general-purpose LLMs perform well in translation, open LLMs prove highly competitive when fine-tuned for specific tasks involving unknown corpora. We introduce LYRA (Language verY Rare for All), a novel approach that combines open LLM fine-tuning, retrieval-augmented generation (RAG), and transfer learning from related high-resource languages. This study is exclusively focused on single-GPU training to facilitate ease of adoption. Our study focuses on two-way translation between French and Monégasque, a rare language unsupported by existing translation tools due to limited corpus availability. Our results demonstrate LYRA's effectiveness, frequently surpassing and consistently matching state-of-the-art encoder-decoder models in rare language translation.
title Language verY Rare for All
topic Computation and Language
Machine Learning
url https://arxiv.org/abs/2412.13924