:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Tipaksorn, Pattara, Thatphithakkul, Sumonmas, Chunwijitra, Vataya, Thangthai, Kwanchiva
Format:	Preprint
Published:	2025
Subjects:	Computation and Language Sound
Online Access:	https://arxiv.org/abs/2509.18722
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

HypR: A comprehensive study for ASR hypothesis revising with a reference corpus
by: Wang, Yi-Wei, et al.
Published: (2023)

A corpus-based investigation of pitch contours of monosyllabic words in conversational Taiwan Mandarin
by: Jin, Xiaoyun, et al.
Published: (2024)

Bridging the gap: A comparative exploration of Speech-LLM and end-to-end architecture for multilingual conversational ASR
by: Mei, Yuxiang, et al.
Published: (2026)

CantoASR: Prosody-Aware ASR-LALM Collaboration for Low-Resource Cantonese
by: Chen, Dazhong, et al.
Published: (2025)

Improving endpoint detection in end-to-end streaming ASR for conversational speech
by: C, Anandh, et al.
Published: (2025)

PromptASR for contextualized ASR with controllable style
by: Yang, Xiaoyu, et al.
Published: (2023)

MLMA: Towards Multilingual ASR With Mamba-based Architectures
by: Ali, Mohamed Nabih, et al.
Published: (2025)

A Calculus-Based Framework for Determining Vocabulary Size in End-to-End ASR
by: Kopparapu, Sunil Kumar
Published: (2026)

DARS: Dysarthria-Aware Rhythm-Style Synthesis for ASR Enhancement
by: Wu, Minghui, et al.
Published: (2026)

PROFASR-BENCH: A Benchmark for Context-Conditioned ASR in High-Stakes Professional Speech
by: Piskala, Deepak Babu
Published: (2025)

Elderly-Contextual Data Augmentation via Speech Synthesis for Elderly ASR
by: Lee, Minsik, et al.
Published: (2026)

MSA-ASR: Efficient Multilingual Speaker Attribution with frozen ASR Models
by: Nguyen, Thai-Binh, et al.
Published: (2024)

M-CIF: Multi-Scale Alignment For CIF-Based Non-Autoregressive ASR
by: Mao, Ruixiang, et al.
Published: (2025)

AutoMode-ASR: Learning to Select ASR Systems for Better Quality and Cost
by: Gündüz, Ahmet, et al.
Published: (2024)

On the Role of Encoder Depth: Pruning Whisper and LoRA Fine-Tuning in SLAM-ASR
by: Kolluri, Ganesh Pavan Kartikeya Bharadwaj, et al.
Published: (2026)

ASR-EC Benchmark: Evaluating Large Language Models on Chinese ASR Error Correction
by: Wei, Victor Junqiu, et al.
Published: (2024)

Uni-ASR: Unified LLM-Based Architecture for Non-Streaming and Streaming Automatic Speech Recognition
by: Xia, Yinfeng, et al.
Published: (2026)

Romanization Encoding For Multilingual ASR
by: Ding, Wen, et al.
Published: (2024)

CLiFT-ASR: A Cross-Lingual Fine-Tuning Framework for Low-Resource Taiwanese Hokkien Speech Recognition
by: Sung, Hung-Yang, et al.
Published: (2025)

NIM4-ASR: Towards Efficient, Robust, and Customizable Real-Time LLM-Based ASR
by: Xie, Yuan, et al.
Published: (2026)

Revealing the Role of Audio Channels in ASR Performance Degradation
by: Huang, Kuan-Tang, et al.
Published: (2025)

Promptformer: Prompted Conformer Transducer for ASR
by: Duarte-Torres, Sergio, et al.
Published: (2024)

Qwen3-ASR Technical Report
by: Shi, Xian, et al.
Published: (2026)

Revisiting Acoustic Features for Robust ASR
by: Shah, Muhammad A., et al.
Published: (2024)

Flavors of Moonshine: Tiny Specialized ASR Models for Edge Devices
by: King, Evan, et al.
Published: (2025)

Vedavani: A Benchmark Corpus for ASR on Vedic Sanskrit Poetry
by: Kumar, Sujeet, et al.
Published: (2025)

Semi-Autoregressive Streaming ASR With Label Context
by: Arora, Siddhant, et al.
Published: (2023)

Exploring SSL Discrete Tokens for Multilingual ASR
by: Cui, Mingyu, et al.
Published: (2024)

Configurable Multilingual ASR with Speech Summary Representations
by: Zhu, Harrison, et al.
Published: (2024)

ManWav: The First Manchu ASR Model
by: Seo, Jean, et al.
Published: (2024)

Mamba for Streaming ASR Combined with Unimodal Aggregation
by: Fang, Ying, et al.
Published: (2024)

The TTS-STT Flywheel: Synthetic Entity-Dense Audio Closes the Indic ASR Gap Where Commercial and Open-Source Systems Fail
by: Menta, Venkata Pushpak Teja
Published: (2026)

A new kid on the block: Distributional semantics predicts the word-specific tone signatures of monosyllabic words in conversational Taiwan Mandarin
by: Jin, Xiaoyun, et al.
Published: (2025)

ContextASR-Bench: A Massive Contextual Speech Recognition Benchmark
by: Wang, He, et al.
Published: (2025)

Nwāchā Munā: A Devanagari Speech Corpus and Proximal Transfer Benchmark for Nepal Bhasha ASR
by: Sharma, Rishikesh Kumar, et al.
Published: (2026)

Scalable Offline ASR for Command-Style Dictation in Courtrooms
by: Nethil, Kumarmanas, et al.
Published: (2025)

Unifying Diarization, Separation, and ASR with Multi-Speaker Encoder
by: Shakeel, Muhammad, et al.
Published: (2025)

Causal Structure Discovery for Error Diagnostics of Children's ASR
by: Singh, Vishwanath Pratap, et al.
Published: (2025)

Performant ASR Models for Medical Entities in Accented Speech
by: Afonja, Tejumade, et al.
Published: (2024)

Reverb: Open-Source ASR and Diarization from Rev
by: Bhandari, Nishchal, et al.
Published: (2024)