Saved in:
| Main Author: | Rahman, Hanif |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.04598 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Fine-tuning Whisper for Pashto ASR: strategies and scale
by: Rahman, Hanif
Published: (2026)
by: Rahman, Hanif
Published: (2026)
PashtoTTS-Bench: automated screening for low-resource non-Latin-script text-to-speech
by: Rahman, Hanif
Published: (2026)
by: Rahman, Hanif
Published: (2026)
Pashto Common Voice: Building the First Open Speech Corpus for a 60-Million-Speaker Low-Resource Language
by: Rahman, Hanif, et al.
Published: (2026)
by: Rahman, Hanif, et al.
Published: (2026)
PashtoCorp: A 1.25-Billion-Word Corpus, Evaluation Suite, and Reproducible Pipeline for Low-Resource Language Development
by: Rahman, Hanif
Published: (2026)
by: Rahman, Hanif
Published: (2026)
Linear Script Representations in Speech Foundation Models Enable Zero-Shot Transliteration
by: Shim, Ryan Soh-Eun, et al.
Published: (2026)
by: Shim, Ryan Soh-Eun, et al.
Published: (2026)
SN-WER: Script-Normalized WER for Multi-Script Indic ASR Evaluation
by: Pattnayak, Priyaranjan
Published: (2026)
by: Pattnayak, Priyaranjan
Published: (2026)
XTTS: a Massively Multilingual Zero-Shot Text-to-Speech Model
by: Casanova, Edresson, et al.
Published: (2024)
by: Casanova, Edresson, et al.
Published: (2024)
Classification of Spontaneous and Scripted Speech for Multilingual Audio
by: Elisha, Shahar, et al.
Published: (2024)
by: Elisha, Shahar, et al.
Published: (2024)
Script collapse in multilingual ASR: A reference-free metric and 100-pair benchmark
by: Rahman, Hanif
Published: (2026)
by: Rahman, Hanif
Published: (2026)
Configurable Multilingual ASR with Speech Summary Representations
by: Zhu, Harrison, et al.
Published: (2024)
by: Zhu, Harrison, et al.
Published: (2024)
Omnilingual ASR: Open-Source Multilingual Speech Recognition for 1600+ Languages
by: Omnilingual ASR team, et al.
Published: (2025)
by: Omnilingual ASR team, et al.
Published: (2025)
Language-Universal Speech Attributes Modeling for Zero-Shot Multilingual Spoken Keyword Recognition
by: Yen, Hao, et al.
Published: (2024)
by: Yen, Hao, et al.
Published: (2024)
Evaluating Zero-Shot Multilingual Aspect-Based Sentiment Analysis with Large Language Models
by: Wu, Chengyan, et al.
Published: (2024)
by: Wu, Chengyan, et al.
Published: (2024)
Zero-Shot Context-Aware ASR for Diverse Arabic Varieties
by: Talafha, Bashar, et al.
Published: (2025)
by: Talafha, Bashar, et al.
Published: (2025)
Linguistically Informed Evaluation of Multilingual ASR for African Languages
by: Chen, Fei-Yueh, et al.
Published: (2026)
by: Chen, Fei-Yueh, et al.
Published: (2026)
ZeroNLG: Aligning and Autoencoding Domains for Zero-Shot Multimodal and Multilingual Natural Language Generation
by: Yang, Bang, et al.
Published: (2023)
by: Yang, Bang, et al.
Published: (2023)
Building a Few-Shot Cross-Domain Multilingual NLU Model for Customer Care
by: Kumar, Saurabh, et al.
Published: (2025)
by: Kumar, Saurabh, et al.
Published: (2025)
IITR-CIOL@NLU of Devanagari Script Languages 2025: Multilingual Hate Speech Detection and Target Identification in Devanagari-Scripted Languages
by: Gupta, Siddhant, et al.
Published: (2024)
by: Gupta, Siddhant, et al.
Published: (2024)
Speak in Context: Multilingual ASR with Speech Context Alignment via Contrastive Learning
by: Zhang, Yuchen, et al.
Published: (2026)
by: Zhang, Yuchen, et al.
Published: (2026)
Ethio-ASR: Joint Multilingual Speech Recognition and Language Identification for Ethiopian Languages
by: Abdullah, Badr M., et al.
Published: (2026)
by: Abdullah, Badr M., et al.
Published: (2026)
Transliterated Zero-Shot Domain Adaptation for Automatic Speech Recognition
by: Zhu, Han, et al.
Published: (2024)
by: Zhu, Han, et al.
Published: (2024)
Bi-directional Context-Enhanced Speech Large Language Models for Multilingual Conversational ASR
by: Peng, Yizhou, et al.
Published: (2025)
by: Peng, Yizhou, et al.
Published: (2025)
Low-Resource Safety Failures Are Action Failures, Not Representation Failures
by: Aziz, Rashad, et al.
Published: (2026)
by: Aziz, Rashad, et al.
Published: (2026)
Language-Aware Distillation for Multilingual Instruction-Following Speech LLMs with ASR-Only Supervision
by: Gopal, Shreyas, et al.
Published: (2026)
by: Gopal, Shreyas, et al.
Published: (2026)
MultiScript30k: Leveraging Multilingual Embeddings to Extend Cross Script Parallel Data
by: Driggers-Ellis, Christopher, et al.
Published: (2025)
by: Driggers-Ellis, Christopher, et al.
Published: (2025)
Cross-Lingual Auto Evaluation for Assessing Multilingual LLMs
by: Doddapaneni, Sumanth, et al.
Published: (2024)
by: Doddapaneni, Sumanth, et al.
Published: (2024)
Breaking the Script Barrier: Enabling Automatic Alignment for PoS-based ASR Error Analysis in Non-Latin Scripts
by: Mudi, Prasenjit K, et al.
Published: (2026)
by: Mudi, Prasenjit K, et al.
Published: (2026)
Open ASR Leaderboard: Towards Reproducible and Transparent Multilingual and Long-Form Speech Recognition Evaluation
by: Srivastav, Vaibhav, et al.
Published: (2025)
by: Srivastav, Vaibhav, et al.
Published: (2025)
Zero-Shot Cross-Domain Code Search without Fine-Tuning
by: Liang, Keyu, et al.
Published: (2025)
by: Liang, Keyu, et al.
Published: (2025)
MSA-ASR: Efficient Multilingual Speaker Attribution with frozen ASR Models
by: Nguyen, Thai-Binh, et al.
Published: (2024)
by: Nguyen, Thai-Binh, et al.
Published: (2024)
Advocating Character Error Rate for Multilingual ASR Evaluation
by: K, Thennal D, et al.
Published: (2024)
by: K, Thennal D, et al.
Published: (2024)
Benchmarking Children's ASR with Supervised and Self-supervised Speech Foundation Models
by: Fan, Ruchao, et al.
Published: (2024)
by: Fan, Ruchao, et al.
Published: (2024)
From Speech to Subtitles: Evaluating ASR Models in Subtitling Italian Television Programs
by: Lucca, Alessandro, et al.
Published: (2025)
by: Lucca, Alessandro, et al.
Published: (2025)
Language Representation Favored Zero-Shot Cross-Domain Cognitive Diagnosis
by: Liu, Shuo, et al.
Published: (2025)
by: Liu, Shuo, et al.
Published: (2025)
MEDAL: A Framework for Benchmarking LLMs as Multilingual Open-Domain Dialogue Evaluators
by: Mendonça, John, et al.
Published: (2025)
by: Mendonça, John, et al.
Published: (2025)
Unknown Script: Impact of Script on Cross-Lingual Transfer
by: Tufa, Wondimagegnhue Tsegaye, et al.
Published: (2024)
by: Tufa, Wondimagegnhue Tsegaye, et al.
Published: (2024)
ASR-EC Benchmark: Evaluating Large Language Models on Chinese ASR Error Correction
by: Wei, Victor Junqiu, et al.
Published: (2024)
by: Wei, Victor Junqiu, et al.
Published: (2024)
Large Multilingual Models Pivot Zero-Shot Multimodal Learning across Languages
by: Hu, Jinyi, et al.
Published: (2023)
by: Hu, Jinyi, et al.
Published: (2023)
NusaAksara: A Multimodal and Multilingual Benchmark for Preserving Indonesian Indigenous Scripts
by: Adilazuarda, Muhammad Farid, et al.
Published: (2025)
by: Adilazuarda, Muhammad Farid, et al.
Published: (2025)
Multilingual Language Models Encode Script Over Linguistic Structure
by: Verma, Aastha A K, et al.
Published: (2026)
by: Verma, Aastha A K, et al.
Published: (2026)
Similar Items
-
Fine-tuning Whisper for Pashto ASR: strategies and scale
by: Rahman, Hanif
Published: (2026) -
PashtoTTS-Bench: automated screening for low-resource non-Latin-script text-to-speech
by: Rahman, Hanif
Published: (2026) -
Pashto Common Voice: Building the First Open Speech Corpus for a 60-Million-Speaker Low-Resource Language
by: Rahman, Hanif, et al.
Published: (2026) -
PashtoCorp: A 1.25-Billion-Word Corpus, Evaluation Suite, and Reproducible Pipeline for Low-Resource Language Development
by: Rahman, Hanif
Published: (2026) -
Linear Script Representations in Speech Foundation Models Enable Zero-Shot Transliteration
by: Shim, Ryan Soh-Eun, et al.
Published: (2026)