:: Library Catalog

Copertina

Salvato in:

Dettagli Bibliografici
Autori principali:	Khamis, Ahmed Khaled, Ali, Hesham
Natura:	Preprint
Pubblicazione:	2026
Soggetti:	Computation and Language
Accesso online:	https://arxiv.org/abs/2602.15675
Tags:	Aggiungi Tag Nessun Tag, puoi essere il primo ad aggiungerne!!

Documenti analoghi

GATech at AbjadGenEval Shared Task: Multilingual Embeddings for Arabic Machine-Generated Text Classification
di: Khamis, Ahmed Khaled
Pubblicazione: (2026)

SpeechWeave: Diverse Multilingual Synthetic Text & Audio Data Generation Pipeline for Training Text to Speech Models
di: Dua, Karan, et al.
Pubblicazione: (2025)

GATech at AbjadMed: Bidirectional Encoders vs. Causal Decoders: Insights from 82-Class Arabic Medical Classification
di: Khamis, Ahmed Khaled
Pubblicazione: (2026)

Phonetic Modeling of Dialectal Variation in Vietnamese Speech
di: Hoang, Quan Ngoc, et al.
Pubblicazione: (2026)

Towards Zero-Shot Text-To-Speech for Arabic Dialects
di: Doan, Khai Duy, et al.
Pubblicazione: (2024)

RegSpeech12: A Regional Corpus of Bengali Spontaneous Speech Across Dialects
di: Hassan, Md. Rezuwan, et al.
Pubblicazione: (2025)

Leveraging LLM and Self-Supervised Training Models for Speech Recognition in Chinese Dialects: A Comparative Analysis
di: Xu, Tianyi, et al.
Pubblicazione: (2025)

Computational Linguistics Meets Libyan Dialect: A Study on Dialect Identification
di: Essgaer, Mansour, et al.
Pubblicazione: (2025)

On the Problem of Text-To-Speech Model Selection for Synthetic Data Generation in Automatic Speech Recognition
di: Rossenbach, Nick, et al.
Pubblicazione: (2024)

Navigating Dialectal Bias and Ethical Complexities in Levantine Arabic Hate Speech Detection
di: Ahmed, Ahmed Haj, et al.
Pubblicazione: (2024)

Scaling Arabic Medical Chatbots Using Synthetic Data: Enhancing Generative AI with Synthetic Patient Records
di: Allam, Abdulrahman, et al.
Pubblicazione: (2025)

A Dialectic Pipeline for Improving LLM Robustness
di: Candussio, Sara
Pubblicazione: (2026)

A Multi-Dialectal Dataset for German Dialect ASR and Dialect-to-Standard Speech Translation
di: Blaschke, Verena, et al.
Pubblicazione: (2025)

Standard-to-Dialect Transfer Trends Differ across Text and Speech: A Case Study on Intent and Topic Classification in German Dialects
di: Blaschke, Verena, et al.
Pubblicazione: (2025)

Cross-Dialect Text-To-Speech in Pitch-Accent Language Incorporating Multi-Dialect Phoneme-Level BERT
di: Yamauchi, Kazuki, et al.
Pubblicazione: (2024)

Evaluating Speech-to-Text x LLM x Text-to-Speech Combinations for AI Interview Systems
di: Allbert, Rumi, et al.
Pubblicazione: (2025)

ArFake: A Multi-Dialect Benchmark and Baselines for Arabic Spoof-Speech Detection
di: Maged, Mohamed, et al.
Pubblicazione: (2025)

Scaling Speech-Text Pre-training with Synthetic Interleaved Data
di: Zeng, Aohan, et al.
Pubblicazione: (2024)

Scheduled Interleaved Speech-Text Training for Speech-to-Speech Translation with LLMs
di: Futami, Hayato, et al.
Pubblicazione: (2025)

Towards Comprehensive Semantic Speech Embeddings for Chinese Dialects
di: Chang, Kalvin, et al.
Pubblicazione: (2026)

TMD-TTS: A Unified Tibetan Multi-Dialect Text-to-Speech Framework for Ü-Tsang, Amdo and Kham Speech Dataset Generation
di: Liu, Yutong, et al.
Pubblicazione: (2025)

PolyNorm: Few-Shot LLM-Based Text Normalization for Text-to-Speech
di: Wong, Michel, et al.
Pubblicazione: (2025)

Speech-to-Speech Translation Pipelines for Conversations in Low-Resource Languages
di: Popescu-Belis, Andrei, et al.
Pubblicazione: (2025)

WenetSpeech-Chuan: A Large-Scale Sichuanese Corpus with Rich Annotation for Dialectal Speech Processing
di: Dai, Yuhang, et al.
Pubblicazione: (2025)

Harmful Speech Detection by Language Models Exhibits Gender-Queer Dialect Bias
di: Dorn, Rebecca, et al.
Pubblicazione: (2024)

Dialectal Coverage And Generalization in Arabic Speech Recognition
di: Djanibekov, Amirbek, et al.
Pubblicazione: (2024)

Doing More with Less: Data Augmentation for Sudanese Dialect Automatic Speech Recognition
di: Mansour, Ayman
Pubblicazione: (2026)

Arab Voices: Mapping Standard and Dialectal Arabic Speech Technology
di: Sullivan, Peter, et al.
Pubblicazione: (2026)

Streaming Speech-to-Text Translation with a SpeechLLM
di: Parcollet, Titouan, et al.
Pubblicazione: (2026)

Saar-Voice: A Multi-Speaker Saarbrücken Dialect Speech Corpus
di: Oberkircher, Lena S., et al.
Pubblicazione: (2026)

MOSS-Speech: Towards True Speech-to-Speech Models Without Text Guidance
di: Zhao, Xingjian, et al.
Pubblicazione: (2025)

LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM
di: Shikhar, Sambal, et al.
Pubblicazione: (2025)

Generating Data with Text-to-Speech and Large-Language Models for Conversational Speech Recognition
di: Cornell, Samuele, et al.
Pubblicazione: (2024)

VoxHakka: A Dialectally Diverse Multi-speaker Text-to-Speech System for Taiwanese Hakka
di: Chen, Li-Wei, et al.
Pubblicazione: (2024)

Adaptive Inner Speech-Text Alignment for LLM-based Speech Translation
di: Liu, Henglyu, et al.
Pubblicazione: (2025)

Named Entity Recognition for Address Extraction in Speech-to-Text Transcriptions Using Synthetic Data
di: Lajčinová, Bibiána, et al.
Pubblicazione: (2024)

LinTO Audio and Textual Datasets to Train and Evaluate Automatic Speech Recognition in Tunisian Arabic Dialect
di: Naouara, Hedi, et al.
Pubblicazione: (2025)

Multilingual Extraction and Recognition of Implicit Discourse Relations in Speech and Text
di: Ruby, Ahmed, et al.
Pubblicazione: (2026)

SpeechDialogueFactory: Generating High-Quality Speech Dialogue Data to Accelerate Your Speech-LLM Development
di: Wang, Minghan, et al.
Pubblicazione: (2025)

PolySpeech-100: A Large-Scale Benchmark for Speech Understanding Across 100+ Languages and Dialects
di: Yang, Sicheng, et al.
Pubblicazione: (2026)