Saved in:
| Main Authors: | Mutal, Jonathan, Almaoui, Perla Al, Hengchen, Simon, Bouillon, Pierrette |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.16290 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Arabizi vs LLMs: Can the Genie Understand the Language of Aladdin?
by: Almaoui, Perla Al, et al.
Published: (2025)
by: Almaoui, Perla Al, et al.
Published: (2025)
Casablanca: Data and Models for Multidialectal Arabic Speech Recognition
by: Talafha, Bashar, et al.
Published: (2024)
by: Talafha, Bashar, et al.
Published: (2024)
Jawaher: A Multidialectal Dataset of Arabic Proverbs for LLM Benchmarking
by: Magdy, Samar M., et al.
Published: (2025)
by: Magdy, Samar M., et al.
Published: (2025)
ZAEBUC-Spoken: A Multilingual Multidialectal Arabic-English Speech Corpus
by: Hamed, Injy, et al.
Published: (2024)
by: Hamed, Injy, et al.
Published: (2024)
NADI 2025: The First Multidialectal Arabic Speech Processing Shared Task
by: Talafha, Bashar, et al.
Published: (2025)
by: Talafha, Bashar, et al.
Published: (2025)
Maastricht University at AMIYA: Adapting LLMs for Dialectal Arabic using Fine-tuning and MBR Decoding
by: Alali, Abdulhai, et al.
Published: (2026)
by: Alali, Abdulhai, et al.
Published: (2026)
Detection of Non-recorded Word Senses in English and Swedish
by: Lautenschlager, Jonathan, et al.
Published: (2024)
by: Lautenschlager, Jonathan, et al.
Published: (2024)
LoFTI: Localization and Factuality Transfer to Indian Locales
by: Simon, Sona Elza, et al.
Published: (2024)
by: Simon, Sona Elza, et al.
Published: (2024)
Munsit at NADI 2025 Shared Task 2: Pushing the Boundaries of Multidialectal Arabic ASR with Weakly Supervised Pretraining and Continual Supervised Fine-tuning
by: Salhab, Mahmoud, et al.
Published: (2025)
by: Salhab, Mahmoud, et al.
Published: (2025)
AraFinNLP 2024: The First Arabic Financial NLP Shared Task
by: Malaysha, Sanad, et al.
Published: (2024)
by: Malaysha, Sanad, et al.
Published: (2024)
Automated Question Generation for Science Tests in Arabic Language Using NLP Techniques
by: Tami, Mohammad, et al.
Published: (2024)
by: Tami, Mohammad, et al.
Published: (2024)
Revisiting Common Assumptions about Arabic Dialects in NLP
by: Keleg, Amr, et al.
Published: (2025)
by: Keleg, Amr, et al.
Published: (2025)
DWUG: A large Resource of Diachronic Word Usage Graphs in Four Languages
by: Schlechtweg, Dominik, et al.
Published: (2021)
by: Schlechtweg, Dominik, et al.
Published: (2021)
AraReasoner: Evaluating Reasoning-Based LLMs for Arabic NLP
by: Hasanaath, Ahmed, et al.
Published: (2025)
by: Hasanaath, Ahmed, et al.
Published: (2025)
Enhancing Semantic Similarity Understanding in Arabic NLP with Nested Embedding Learning
by: Nacar, Omer, et al.
Published: (2024)
by: Nacar, Omer, et al.
Published: (2024)
Pandora's Box or Aladdin's Lamp: A Comprehensive Analysis Revealing the Role of RAG Noise in Large Language Models
by: Wu, Jinyang, et al.
Published: (2024)
by: Wu, Jinyang, et al.
Published: (2024)
Your Students Don't Use LLMs Like You Wish They Did
by: Kobler, Sebastian, et al.
Published: (2026)
by: Kobler, Sebastian, et al.
Published: (2026)
A Survey of Code-switched Arabic NLP: Progress, Challenges, and Future Directions
by: Hamed, Injy, et al.
Published: (2025)
by: Hamed, Injy, et al.
Published: (2025)
Data Augmentation for Maltese NLP using Transliterated and Machine Translated Arabic Data
by: Micallef, Kurt, et al.
Published: (2025)
by: Micallef, Kurt, et al.
Published: (2025)
The Landscape of Arabic Large Language Models (ALLMs): A New Era for Arabic Language Technology
by: Al-Khalifa, Shahad, et al.
Published: (2025)
by: Al-Khalifa, Shahad, et al.
Published: (2025)
Noise Steering for Controlled Text Generation: Improving Diversity and Reading-Level Fidelity in Arabic Educational Story Generation
by: Khalid, Haziq Mohammad, et al.
Published: (2026)
by: Khalid, Haziq Mohammad, et al.
Published: (2026)
Tokenization and Morphological Fidelity in Uralic NLP: A Cross-Lingual Evaluation
by: Xu, Nuo, et al.
Published: (2026)
by: Xu, Nuo, et al.
Published: (2026)
Building Arabic NLP from the Ground Up: Twenty Years of Lessons, Failures, and Open Problems
by: Zaghouani, Wajdi
Published: (2026)
by: Zaghouani, Wajdi
Published: (2026)
Strategies for Arabic Readability Modeling
by: Liberato, Juan Piñeros, et al.
Published: (2024)
by: Liberato, Juan Piñeros, et al.
Published: (2024)
Du Laboratoire de Langues a la Bibliotheque Sonore: l'Individualisation de l'Apprentissage en Langues Vivantes (From the Language Laboratory to the Tape Library: Individualized Modern Language Instruction). Melanges Pedagogiques, 1971.
by: Bouillon, C.
Published: (1971)
by: Bouillon, C.
Published: (1971)
QU-NLP at QIAS 2026: Multi-Stage QLoRA Fine-Tuning for Arabic Islamic Inheritance Reasoning
by: AL-Smadi, Mohammad
Published: (2026)
by: AL-Smadi, Mohammad
Published: (2026)
Arabic Little STT: Arabic Children Speech Recognition Dataset
by: Alkadri, Mouhand, et al.
Published: (2025)
by: Alkadri, Mouhand, et al.
Published: (2025)
PEACH: A sentence-aligned Parallel English-Arabic Corpus for Healthcare
by: Al-Sabbagh, Rania
Published: (2025)
by: Al-Sabbagh, Rania
Published: (2025)
SHAMI-MT: A Syrian Arabic Dialect to Modern Standard Arabic Bidirectional Machine Translation System
by: Sibaee, Serry, et al.
Published: (2025)
by: Sibaee, Serry, et al.
Published: (2025)
The Arabic Noun System Generation
by: Soudi, Abdelhadi, et al.
Published: (2024)
by: Soudi, Abdelhadi, et al.
Published: (2024)
The SAMER Arabic Text Simplification Corpus
by: Alhafni, Bashar, et al.
Published: (2024)
by: Alhafni, Bashar, et al.
Published: (2024)
The Qiyas Benchmark: Measuring ChatGPT Mathematical and Language Understanding in Arabic
by: Al-Khalifa, Shahad, et al.
Published: (2024)
by: Al-Khalifa, Shahad, et al.
Published: (2024)
GLARE: Google Apps Arabic Reviews Dataset
by: AlGhamdi, Fatima, et al.
Published: (2024)
by: AlGhamdi, Fatima, et al.
Published: (2024)
Ramsa: A Large Sociolinguistically Rich Emirati Arabic Speech Corpus for ASR and TTS
by: Al-Sabbagh, Rania
Published: (2026)
by: Al-Sabbagh, Rania
Published: (2026)
The Arabic Generality Score: Another Dimension of Modeling Arabic Dialectness
by: Shaban, Sanad, et al.
Published: (2025)
by: Shaban, Sanad, et al.
Published: (2025)
MultiProSE: A Multi-label Arabic Dataset for Propaganda, Sentiment, and Emotion Detection
by: Al-Henaki, Lubna, et al.
Published: (2025)
by: Al-Henaki, Lubna, et al.
Published: (2025)
From Code-Centric to Concept-Centric: Teaching NLP with LLM-Assisted "Vibe Coding"
by: Al-Khalifa, Hend
Published: (2026)
by: Al-Khalifa, Hend
Published: (2026)
A Survey of Large Language Models for Arabic Language and its Dialects
by: Mashaabi, Malak, et al.
Published: (2024)
by: Mashaabi, Malak, et al.
Published: (2024)
Ar-Spider: Text-to-SQL in Arabic
by: Almohaimeed, Saleh, et al.
Published: (2024)
by: Almohaimeed, Saleh, et al.
Published: (2024)
Evaluation of Semantic Search and its Role in Retrieved-Augmented-Generation (RAG) for Arabic Language
by: Mahboub, Ali, et al.
Published: (2024)
by: Mahboub, Ali, et al.
Published: (2024)
Similar Items
-
Arabizi vs LLMs: Can the Genie Understand the Language of Aladdin?
by: Almaoui, Perla Al, et al.
Published: (2025) -
Casablanca: Data and Models for Multidialectal Arabic Speech Recognition
by: Talafha, Bashar, et al.
Published: (2024) -
Jawaher: A Multidialectal Dataset of Arabic Proverbs for LLM Benchmarking
by: Magdy, Samar M., et al.
Published: (2025) -
ZAEBUC-Spoken: A Multilingual Multidialectal Arabic-English Speech Corpus
by: Hamed, Injy, et al.
Published: (2024) -
NADI 2025: The First Multidialectal Arabic Speech Processing Shared Task
by: Talafha, Bashar, et al.
Published: (2025)