Saved in:
| Main Authors: | Nguefack, Idriss Nguepi, Finkelstein, Mara, Sakayo, Toadoum Sari |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.25116 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Introducing the NewsPaLM MBR and QE Dataset: LLM-Generated High-Quality Parallel Data Outperforms Traditional Web-Crawled Data
by: Finkelstein, Mara, et al.
Published: (2024)
by: Finkelstein, Mara, et al.
Published: (2024)
Improving Retrieval-Augmented Neural Machine Translation with Monolingual Data
by: Bouthors, Maxime, et al.
Published: (2025)
by: Bouthors, Maxime, et al.
Published: (2025)
Context-Aware Monolingual Human Evaluation of Machine Translation
by: Picinini, Silvio, et al.
Published: (2025)
by: Picinini, Silvio, et al.
Published: (2025)
TopXGen: Topic-Diverse Parallel Data Generation for Low-Resource Machine Translation
by: Zebaze, Armel, et al.
Published: (2025)
by: Zebaze, Armel, et al.
Published: (2025)
Cross-lingual Transfer or Machine Translation? On Data Augmentation for Monolingual Semantic Textual Similarity
by: Hoshino, Sho, et al.
Published: (2024)
by: Hoshino, Sho, et al.
Published: (2024)
MQM Re-Annotation: A Technique for Collaborative Evaluation of Machine Translation
by: Riley, Parker, et al.
Published: (2025)
by: Riley, Parker, et al.
Published: (2025)
Efficient Minimum Bayes Risk Decoding using Low-Rank Matrix Completion Algorithms
by: Trabelsi, Firas, et al.
Published: (2024)
by: Trabelsi, Firas, et al.
Published: (2024)
Monolingual and Multilingual Misinformation Detection for Low-Resource Languages: A Comprehensive Survey
by: Wang, Xinyu, et al.
Published: (2024)
by: Wang, Xinyu, et al.
Published: (2024)
Compensating for Data with Reasoning: Low-Resource Machine Translation with LLMs
by: Frontull, Samuel, et al.
Published: (2025)
by: Frontull, Samuel, et al.
Published: (2025)
Translatotron 3: Speech to Speech Translation with Monolingual Data
by: Nachmani, Eliya, et al.
Published: (2023)
by: Nachmani, Eliya, et al.
Published: (2023)
Mitigating Stylistic Biases of Machine Translation Systems via Monolingual Corpora Only
by: Gao, Xuanqi, et al.
Published: (2025)
by: Gao, Xuanqi, et al.
Published: (2025)
Neural Machine Translation for Coptic-French: Strategies for Low-Resource Ancient Languages
by: Chaoui, Nasma, et al.
Published: (2025)
by: Chaoui, Nasma, et al.
Published: (2025)
When Does Monolingual Data Help Multilingual Translation: The Role of Domain and Model Scale
by: Baziotis, Christos, et al.
Published: (2023)
by: Baziotis, Christos, et al.
Published: (2023)
Dialectal and Low-Resource Machine Translation for Aromanian
by: Jerpelea, Alexandru-Iulius, et al.
Published: (2024)
by: Jerpelea, Alexandru-Iulius, et al.
Published: (2024)
ACADATA: Parallel Dataset of Academic Data for Machine Translation
by: Lacunza, Iñaki, et al.
Published: (2025)
by: Lacunza, Iñaki, et al.
Published: (2025)
Open Language Data Initiative: Advancing Low-Resource Machine Translation for Karakalpak
by: Mamasaidov, Mukhammadsaid, et al.
Published: (2024)
by: Mamasaidov, Mukhammadsaid, et al.
Published: (2024)
Generative-Adversarial Networks for Low-Resource Language Data Augmentation in Machine Translation
by: Zeng, Linda
Published: (2024)
by: Zeng, Linda
Published: (2024)
Generating Difficult-to-Translate Texts
by: Zouhar, Vilém, et al.
Published: (2025)
by: Zouhar, Vilém, et al.
Published: (2025)
Paramanu: Compact and Competitive Monolingual Language Models for Low-Resource Morphologically Rich Indian Languages
by: Niyogi, Mitodru, et al.
Published: (2024)
by: Niyogi, Mitodru, et al.
Published: (2024)
BhashaSetu: A Data-Centric Approach to Low-Resource Machine Translation
by: Thakkar, Param, et al.
Published: (2026)
by: Thakkar, Param, et al.
Published: (2026)
Quantity vs. Quality of Monolingual Source Data in Automatic Text Translation: Can It Be Too Little If It Is Too Good?
by: Abdulmumin, Idris, et al.
Published: (2024)
by: Abdulmumin, Idris, et al.
Published: (2024)
Many-to-English Machine Translation Tools, Data, and Pretrained Models
by: Gowda, Thamme, et al.
Published: (2021)
by: Gowda, Thamme, et al.
Published: (2021)
Pointer-Generator Networks for Low-Resource Machine Translation: Don't Copy That!
by: Bafna, Niyati, et al.
Published: (2024)
by: Bafna, Niyati, et al.
Published: (2024)
LLM-Assisted Rule Based Machine Translation for Low/No-Resource Languages
by: Coleman, Jared, et al.
Published: (2024)
by: Coleman, Jared, et al.
Published: (2024)
UrduLM: A Resource-Efficient Monolingual Urdu Language Model
by: Ali, Syed Muhammad, et al.
Published: (2026)
by: Ali, Syed Muhammad, et al.
Published: (2026)
Translation or Recitation? Calibrating Evaluation Scores for Machine Translation of Extremely Low-Resource Languages
by: Chen, Danlu, et al.
Published: (2026)
by: Chen, Danlu, et al.
Published: (2026)
Pivot Language for Low-Resource Machine Translation
by: Talwar, Abhimanyu, et al.
Published: (2025)
by: Talwar, Abhimanyu, et al.
Published: (2025)
Is Small Language Model the Silver Bullet to Low-Resource Languages Machine Translation?
by: Song, Yewei, et al.
Published: (2025)
by: Song, Yewei, et al.
Published: (2025)
Misgendering and Assuming Gender in Machine Translation when Working with Low-Resource Languages
by: Ghosh, Sourojit, et al.
Published: (2024)
by: Ghosh, Sourojit, et al.
Published: (2024)
Multilingual Language Model Pretraining using Machine-translated Data
by: Wang, Jiayi, et al.
Published: (2025)
by: Wang, Jiayi, et al.
Published: (2025)
Reflective Translation: Improving Low-Resource Machine Translation via Structured Self-Reflection
by: Cheng, Nicholas
Published: (2026)
by: Cheng, Nicholas
Published: (2026)
Parallel Corpora for Machine Translation in Low-resource Indic Languages: A Comprehensive Review
by: Raja, Rahul, et al.
Published: (2025)
by: Raja, Rahul, et al.
Published: (2025)
OpenWHO: A Document-Level Parallel Corpus for Health Translation in Low-Resource Languages
by: Merx, Raphaël, et al.
Published: (2025)
by: Merx, Raphaël, et al.
Published: (2025)
The Effectiveness of Morphology-aware Segmentation in Low-Resource Neural Machine Translation
by: Sälevä, Jonne, et al.
Published: (2021)
by: Sälevä, Jonne, et al.
Published: (2021)
From LLM to NMT: Advancing Low-Resource Machine Translation with Claude
by: Enis, Maxim, et al.
Published: (2024)
by: Enis, Maxim, et al.
Published: (2024)
Beyond MLE: Investigating SEARNN for Low-Resourced Neural Machine Translation
by: Emezue, Chris
Published: (2024)
by: Emezue, Chris
Published: (2024)
Machine Translation Advancements of Low-Resource Indian Languages by Transfer Learning
by: Wei, Bin, et al.
Published: (2024)
by: Wei, Bin, et al.
Published: (2024)
Low-Resource Machine Translation through the Lens of Personalized Federated Learning
by: Moskvoretskii, Viktor, et al.
Published: (2024)
by: Moskvoretskii, Viktor, et al.
Published: (2024)
ViDia2Std: A Parallel Corpus and Methods for Low-Resource Vietnamese Dialect-to-Standard Translation
by: Ta, Khoa Anh, et al.
Published: (2026)
by: Ta, Khoa Anh, et al.
Published: (2026)
A Tulu Resource for Machine Translation
by: Narayanan, Manu, et al.
Published: (2024)
by: Narayanan, Manu, et al.
Published: (2024)
Similar Items
-
Introducing the NewsPaLM MBR and QE Dataset: LLM-Generated High-Quality Parallel Data Outperforms Traditional Web-Crawled Data
by: Finkelstein, Mara, et al.
Published: (2024) -
Improving Retrieval-Augmented Neural Machine Translation with Monolingual Data
by: Bouthors, Maxime, et al.
Published: (2025) -
Context-Aware Monolingual Human Evaluation of Machine Translation
by: Picinini, Silvio, et al.
Published: (2025) -
TopXGen: Topic-Diverse Parallel Data Generation for Low-Resource Machine Translation
by: Zebaze, Armel, et al.
Published: (2025) -
Cross-lingual Transfer or Machine Translation? On Data Augmentation for Monolingual Semantic Textual Similarity
by: Hoshino, Sho, et al.
Published: (2024)