Saved in:
Bibliographic Details
Main Authors: Mutal, Jonathan, Almaoui, Perla Al, Hengchen, Simon, Bouillon, Pierrette
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2602.16290
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866914336518176768
author Mutal, Jonathan
Almaoui, Perla Al
Hengchen, Simon
Bouillon, Pierrette
author_facet Mutal, Jonathan
Almaoui, Perla Al
Hengchen, Simon
Bouillon, Pierrette
contents Arabic dialects have long been under-represented in Natural Language Processing (NLP) research due to their non-standardization and high variability, which pose challenges for computational modeling. Recent advances in the field, such as Large Language Models (LLMs), offer promising avenues to address this gap by enabling Arabic to be modeled as a pluricentric language rather than a monolithic system. This paper presents Aladdin-FTI, our submission to the AMIYA shared task. The proposed system is designed to both generate and translate dialectal Arabic (DA). Specifically, the model supports text generation in Moroccan, Egyptian, Palestinian, Syrian, and Saudi dialects, as well as bidirectional translation between these dialects, Modern Standard Arabic (MSA), and English. The code and trained model are publicly available.
format Preprint
id arxiv_https___arxiv_org_abs_2602_16290
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Aladdin-FTI @ AMIYA Three Wishes for Arabic NLP: Fidelity, Diglossia, and Multidialectal Generation
Mutal, Jonathan
Almaoui, Perla Al
Hengchen, Simon
Bouillon, Pierrette
Computation and Language
Arabic dialects have long been under-represented in Natural Language Processing (NLP) research due to their non-standardization and high variability, which pose challenges for computational modeling. Recent advances in the field, such as Large Language Models (LLMs), offer promising avenues to address this gap by enabling Arabic to be modeled as a pluricentric language rather than a monolithic system. This paper presents Aladdin-FTI, our submission to the AMIYA shared task. The proposed system is designed to both generate and translate dialectal Arabic (DA). Specifically, the model supports text generation in Moroccan, Egyptian, Palestinian, Syrian, and Saudi dialects, as well as bidirectional translation between these dialects, Modern Standard Arabic (MSA), and English. The code and trained model are publicly available.
title Aladdin-FTI @ AMIYA Three Wishes for Arabic NLP: Fidelity, Diglossia, and Multidialectal Generation
topic Computation and Language
url https://arxiv.org/abs/2602.16290