Saved in:
| Main Authors: | Alshahrani, Saied, Haroon, Hesham, Elfilali, Ali, Njie, Mariama, Matthews, Jeanna |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2404.00565 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Arabic Synonym BERT-based Adversarial Examples for Text Classification
by: Alshahrani, Norah, et al.
Published: (2024)
by: Alshahrani, Norah, et al.
Published: (2024)
Mind the Gap: A Review of Arabic Post-Training Datasets and Their Limitations
by: Alkhowaiter, Mohammed, et al.
Published: (2025)
by: Alkhowaiter, Mohammed, et al.
Published: (2025)
EgMM-Corpus: A Multimodal Vision-Language Dataset for Egyptian Culture
by: Gamil, Mohamed, et al.
Published: (2025)
by: Gamil, Mohamed, et al.
Published: (2025)
CIDAR: Culturally Relevant Instruction Dataset For Arabic
by: Alyafeai, Zaid, et al.
Published: (2024)
by: Alyafeai, Zaid, et al.
Published: (2024)
ASCAT: An Arabic Scientific Corpus and Benchmark for Advanced Translation Evaluation
by: Sibaee, Serry, et al.
Published: (2026)
by: Sibaee, Serry, et al.
Published: (2026)
ArzEn-LLM: Code-Switched Egyptian Arabic-English Translation and Speech Recognition Using LLMs
by: Heakl, Ahmed, et al.
Published: (2024)
by: Heakl, Ahmed, et al.
Published: (2024)
Detecting Corpus-Level Knowledge Inconsistencies in Wikipedia with Large Language Models
by: Semnani, Sina J., et al.
Published: (2025)
by: Semnani, Sina J., et al.
Published: (2025)
Proper Noun Diacritization for Arabic Wikipedia: A Benchmark Dataset
by: Bondok, Rawan, et al.
Published: (2025)
by: Bondok, Rawan, et al.
Published: (2025)
SyriSign: A Parallel Corpus for Arabic Text to Syrian Arabic Sign Language Translation
by: Khalil, Mohammad Amer, et al.
Published: (2026)
by: Khalil, Mohammad Amer, et al.
Published: (2026)
The SAMER Arabic Text Simplification Corpus
by: Alhafni, Bashar, et al.
Published: (2024)
by: Alhafni, Bashar, et al.
Published: (2024)
Nile-Chat: Egyptian Language Models for Arabic and Latin Scripts
by: Shang, Guokan, et al.
Published: (2025)
by: Shang, Guokan, et al.
Published: (2025)
TEDxTN: A Three-way Speech Translation Corpus for Code-Switched Tunisian Arabic - English
by: Bougares, Fethi, et al.
Published: (2025)
by: Bougares, Fethi, et al.
Published: (2025)
Automatic Construction of a Large-Scale Corpus for Geoparsing Using Wikipedia Hyperlinks
by: Ohno, Keyaki, et al.
Published: (2024)
by: Ohno, Keyaki, et al.
Published: (2024)
Language Models Learn Metadata: Political Stance Detection Case Study
by: Cao, Stanley, et al.
Published: (2024)
by: Cao, Stanley, et al.
Published: (2024)
An Annotated Corpus of Arabic Tweets for Hate Speech Analysis
by: Zaghouani, Wajdi, et al.
Published: (2025)
by: Zaghouani, Wajdi, et al.
Published: (2025)
Advancing Dialectal Arabic to Modern Standard Arabic Machine Translation
by: Alabdullah, Abdullah, et al.
Published: (2025)
by: Alabdullah, Abdullah, et al.
Published: (2025)
Leveraging the Cross-Domain & Cross-Linguistic Corpus for Low Resource NMT: A Case Study On Bhili-Hindi-English Parallel Corpus
by: Singh, Pooja, et al.
Published: (2025)
by: Singh, Pooja, et al.
Published: (2025)
Beyond Training for Cultural Awareness: The Role of Dataset Linguistic Structure in Large Language Models
by: Masoud, Reem I., et al.
Published: (2026)
by: Masoud, Reem I., et al.
Published: (2026)
Cyber Risks of Machine Translation Critical Errors : Arabic Mental Health Tweets as a Case Study
by: Saadany, Hadeel, et al.
Published: (2024)
by: Saadany, Hadeel, et al.
Published: (2024)
Event-Arguments Extraction Corpus and Modeling using BERT for Arabic
by: Aljabari, Alaa, et al.
Published: (2024)
by: Aljabari, Alaa, et al.
Published: (2024)
Tarab: A Multi-Dialect Corpus of Arabic Lyrics and Poetry
by: El-Haj, Mo
Published: (2026)
by: El-Haj, Mo
Published: (2026)
ArabJobs: A Multinational Corpus of Arabic Job Ads
by: El-Haj, Mo
Published: (2025)
by: El-Haj, Mo
Published: (2025)
A Large and Balanced Corpus for Fine-grained Arabic Readability Assessment
by: Elmadani, Khalid N., et al.
Published: (2025)
by: Elmadani, Khalid N., et al.
Published: (2025)
ZAEBUC-Spoken: A Multilingual Multidialectal Arabic-English Speech Corpus
by: Hamed, Injy, et al.
Published: (2024)
by: Hamed, Injy, et al.
Published: (2024)
PEACH: A sentence-aligned Parallel English-Arabic Corpus for Healthcare
by: Al-Sabbagh, Rania
Published: (2025)
by: Al-Sabbagh, Rania
Published: (2025)
MentalQA: An Annotated Arabic Corpus for Questions and Answers of Mental Healthcare
by: Alhuzali, Hassan, et al.
Published: (2024)
by: Alhuzali, Hassan, et al.
Published: (2024)
Tibyan Corpus: Balanced and Comprehensive Error Coverage Corpus Using ChatGPT for Arabic Grammatical Error Correction
by: Alrehili, Ahlam, et al.
Published: (2024)
by: Alrehili, Ahlam, et al.
Published: (2024)
JobArabi: An Arabic Corpus and Analysis of Job Announcements from Social Media
by: Zaghouani, Wajdi, et al.
Published: (2026)
by: Zaghouani, Wajdi, et al.
Published: (2026)
ArabDiscrim: A Decade-Long Arabic Facebook Corpus on Racism and Discrimination
by: Zaghouani, Wajdi, et al.
Published: (2026)
by: Zaghouani, Wajdi, et al.
Published: (2026)
Audience Engagement with Arabic Women's Social Empowerment and Wellbeing: A Decadal Corpus
by: Zaghouani, Wajdi, et al.
Published: (2026)
by: Zaghouani, Wajdi, et al.
Published: (2026)
The Translation of Circumlocution in Arabic Short Stories into English
by: Shehab, Dalal Waadallah
Published: (2024)
by: Shehab, Dalal Waadallah
Published: (2024)
LLM-to-Speech: A Synthetic Data Pipeline for Training Dialectal Text-to-Speech Models
by: Khamis, Ahmed Khaled, et al.
Published: (2026)
by: Khamis, Ahmed Khaled, et al.
Published: (2026)
Translate, then Detect: Leveraging Machine Translation for Cross-Lingual Toxicity Classification
by: Bell, Samuel J., et al.
Published: (2025)
by: Bell, Samuel J., et al.
Published: (2025)
ArzEn-MultiGenre: An aligned parallel dataset of Egyptian Arabic song lyrics, novels, and subtitles, with English translations
by: Al-Sabbagh, Rania
Published: (2025)
by: Al-Sabbagh, Rania
Published: (2025)
Ramsa: A Large Sociolinguistically Rich Emirati Arabic Speech Corpus for ASR and TTS
by: Al-Sabbagh, Rania
Published: (2026)
by: Al-Sabbagh, Rania
Published: (2026)
A Fisheries Co-management Case Study from The Gambia
by: Njie, M., et al.
Published: (2001)
by: Njie, M., et al.
Published: (2001)
Investigating Persuasion Techniques in Arabic: An Empirical Study Leveraging Large Language Models
by: Alzahrani, Abdurahmman, et al.
Published: (2024)
by: Alzahrani, Abdurahmman, et al.
Published: (2024)
Biomedical Entity Linking for Dutch: Fine-tuning a Self-alignment BERT Model on an Automatically Generated Wikipedia Corpus
by: Hartendorp, Fons, et al.
Published: (2024)
by: Hartendorp, Fons, et al.
Published: (2024)
TACOMORE: Leveraging the Potential of LLMs in Corpus-based Discourse Analysis with Prompt Engineering
by: Li, Bingru, et al.
Published: (2024)
by: Li, Bingru, et al.
Published: (2024)
FFSTC: Fongbe to French Speech Translation Corpus
by: Kponou, D. Fortune, et al.
Published: (2024)
by: Kponou, D. Fortune, et al.
Published: (2024)
Similar Items
-
Arabic Synonym BERT-based Adversarial Examples for Text Classification
by: Alshahrani, Norah, et al.
Published: (2024) -
Mind the Gap: A Review of Arabic Post-Training Datasets and Their Limitations
by: Alkhowaiter, Mohammed, et al.
Published: (2025) -
EgMM-Corpus: A Multimodal Vision-Language Dataset for Egyptian Culture
by: Gamil, Mohamed, et al.
Published: (2025) -
CIDAR: Culturally Relevant Instruction Dataset For Arabic
by: Alyafeai, Zaid, et al.
Published: (2024) -
ASCAT: An Arabic Scientific Corpus and Benchmark for Advanced Translation Evaluation
by: Sibaee, Serry, et al.
Published: (2026)