Saved in:
| Main Authors: | Taguchi, Chihiro, Saransig, Jefferson, Velásquez, Dayana, Chiang, David |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2404.15501 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Automatic Speech Recognition for Documenting Endangered Languages: Case Study of Ikema Miyakoan
by: Taguchi, Chihiro, et al.
Published: (2026)
by: Taguchi, Chihiro, et al.
Published: (2026)
Language Complexity and Speech Recognition Accuracy: Orthographic Complexity Hurts, Phonological Complexity Doesn't
by: Taguchi, Chihiro, et al.
Published: (2024)
by: Taguchi, Chihiro, et al.
Published: (2024)
Languages Still Left Behind: Toward a Better Multilingual Machine Translation Benchmark
by: Taguchi, Chihiro, et al.
Published: (2025)
by: Taguchi, Chihiro, et al.
Published: (2025)
Efficient Context Selection for Long-Context QA: No Tuning, No Iteration, Just Adaptive-$k$
by: Taguchi, Chihiro, et al.
Published: (2025)
by: Taguchi, Chihiro, et al.
Published: (2025)
Automatic Speech Recognition for Sanskrit with Transfer Learning
by: Sadhukhan, Bidit, et al.
Published: (2025)
by: Sadhukhan, Bidit, et al.
Published: (2025)
Automatic Speech Recognition for Greek Medical Dictation
by: Georgilas, Vardis, et al.
Published: (2025)
by: Georgilas, Vardis, et al.
Published: (2025)
Multi-Stage Multi-Modal Pre-Training for Automatic Speech Recognition
by: Jain, Yash, et al.
Published: (2024)
by: Jain, Yash, et al.
Published: (2024)
Augmenting Automatic Speech Recognition Models with Disfluency Detection
by: Amann, Robin, et al.
Published: (2024)
by: Amann, Robin, et al.
Published: (2024)
VietMed: A Dataset and Benchmark for Automatic Speech Recognition of Vietnamese in the Medical Domain
by: Le-Duc, Khai
Published: (2024)
by: Le-Duc, Khai
Published: (2024)
Error-preserving Automatic Speech Recognition of Young English Learners' Language
by: Michot, Janick, et al.
Published: (2024)
by: Michot, Janick, et al.
Published: (2024)
Fairness of Automatic Speech Recognition: Looking Through a Philosophical Lens
by: Choi, Anna Seo Gyeong, et al.
Published: (2025)
by: Choi, Anna Seo Gyeong, et al.
Published: (2025)
Speech Retrieval-Augmented Generation without Automatic Speech Recognition
by: Min, Do June, et al.
Published: (2024)
by: Min, Do June, et al.
Published: (2024)
Handling Numeric Expressions in Automatic Speech Recognition
by: Huber, Christian, et al.
Published: (2024)
by: Huber, Christian, et al.
Published: (2024)
A New Benchmark for Evaluating Automatic Speech Recognition in the Arabic Call Domain
by: Obaidah, Qusai Abo, et al.
Published: (2024)
by: Obaidah, Qusai Abo, et al.
Published: (2024)
Doing More with Less: Data Augmentation for Sudanese Dialect Automatic Speech Recognition
by: Mansour, Ayman
Published: (2026)
by: Mansour, Ayman
Published: (2026)
SimClass: A Classroom Speech Dataset Generated via Game Engine Simulation For Automatic Speech Recognition Research
by: Attia, Ahmed Adel, et al.
Published: (2025)
by: Attia, Ahmed Adel, et al.
Published: (2025)
SENS-ASR: Semantic Embedding injection in Neural-transducer for Streaming Automatic Speech Recognition
by: Dkhissi, Youness, et al.
Published: (2026)
by: Dkhissi, Youness, et al.
Published: (2026)
Benchmarking Rotary Position Embeddings for Automatic Speech Recognition
by: Zhang, Shucong, et al.
Published: (2025)
by: Zhang, Shucong, et al.
Published: (2025)
LoASR-Bench: Evaluating Large Speech Language Models on Low-Resource Automatic Speech Recognition Across Language Families
by: Chen, Jianan, et al.
Published: (2026)
by: Chen, Jianan, et al.
Published: (2026)
Semantically Corrected Amharic Automatic Speech Recognition
by: Adnew, Samuael, et al.
Published: (2024)
by: Adnew, Samuael, et al.
Published: (2024)
Improved Contextual Recognition In Automatic Speech Recognition Systems By Semantic Lattice Rescoring
by: Sudarshan, Ankitha, et al.
Published: (2023)
by: Sudarshan, Ankitha, et al.
Published: (2023)
ASKD-Whisper: Adaptive Self-knowledge Distillation for Efficient and Low-Latency Automatic Speech Recognition
by: Lee, Junseok, et al.
Published: (2026)
by: Lee, Junseok, et al.
Published: (2026)
Benchmarking Automatic Speech Recognition for Indian Languages in Agricultural Contexts
by: S, Chandrashekar M, et al.
Published: (2026)
by: S, Chandrashekar M, et al.
Published: (2026)
Towards Unsupervised Speech Recognition at the Syllable-Level
by: Wang, Liming, et al.
Published: (2025)
by: Wang, Liming, et al.
Published: (2025)
It's Never Too Late: Fusing Acoustic Information into Large Language Models for Automatic Speech Recognition
by: Chen, Chen, et al.
Published: (2024)
by: Chen, Chen, et al.
Published: (2024)
A Comparative Analysis of Bilingual and Trilingual Wav2Vec Models for Automatic Speech Recognition in Multilingual Oral History Archives
by: Lehečka, Jan, et al.
Published: (2024)
by: Lehečka, Jan, et al.
Published: (2024)
LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition
by: Ghosh, Sreyan, et al.
Published: (2024)
by: Ghosh, Sreyan, et al.
Published: (2024)
Multistage Fine-tuning Strategies for Automatic Speech Recognition in Low-resource Languages
by: Pillai, Leena G, et al.
Published: (2024)
by: Pillai, Leena G, et al.
Published: (2024)
Survey of End-to-End Multi-Speaker Automatic Speech Recognition for Monaural Audio
by: He, Xinlu, et al.
Published: (2025)
by: He, Xinlu, et al.
Published: (2025)
Arabic Little STT: Arabic Children Speech Recognition Dataset
by: Alkadri, Mouhand, et al.
Published: (2025)
by: Alkadri, Mouhand, et al.
Published: (2025)
Bias Vector: Mitigating Biases in Language Models with Task Arithmetic Approach
by: Shirafuji, Daiki, et al.
Published: (2024)
by: Shirafuji, Daiki, et al.
Published: (2024)
Improving Speech Recognition Error Prediction for Modern and Off-the-shelf Speech Recognizers
by: Serai, Prashant, et al.
Published: (2024)
by: Serai, Prashant, et al.
Published: (2024)
Breaking Through the Spike: Spike Window Decoding for Accelerated and Precise Automatic Speech Recognition
by: Zhang, Wei, et al.
Published: (2025)
by: Zhang, Wei, et al.
Published: (2025)
Gated Low-rank Adaptation for personalized Code-Switching Automatic Speech Recognition on the low-spec devices
by: Kim, Gwantae, et al.
Published: (2024)
by: Kim, Gwantae, et al.
Published: (2024)
How do Hyenas deal with Human Speech? Speech Recognition and Translation with ConfHyena
by: Gaido, Marco, et al.
Published: (2024)
by: Gaido, Marco, et al.
Published: (2024)
Semantic Differentiation in Speech Emotion Recognition: Insights from Descriptive and Expressive Speech Roles
by: Guo, Rongchen, et al.
Published: (2025)
by: Guo, Rongchen, et al.
Published: (2025)
Towards End-to-End Training of Automatic Speech Recognition for Nigerian Pidgin
by: Rufai, Amina Mardiyyah, et al.
Published: (2020)
by: Rufai, Amina Mardiyyah, et al.
Published: (2020)
Language Bias in Self-Supervised Learning For Automatic Speech Recognition
by: Storey, Edward, et al.
Published: (2025)
by: Storey, Edward, et al.
Published: (2025)
Empirical Evaluation of Public HateSpeech Datasets
by: Jaf, Sadar, et al.
Published: (2024)
by: Jaf, Sadar, et al.
Published: (2024)
In-context Language Learning for Endangered Languages in Speech Recognition
by: Li, Zhaolin, et al.
Published: (2025)
by: Li, Zhaolin, et al.
Published: (2025)
Similar Items
-
Automatic Speech Recognition for Documenting Endangered Languages: Case Study of Ikema Miyakoan
by: Taguchi, Chihiro, et al.
Published: (2026) -
Language Complexity and Speech Recognition Accuracy: Orthographic Complexity Hurts, Phonological Complexity Doesn't
by: Taguchi, Chihiro, et al.
Published: (2024) -
Languages Still Left Behind: Toward a Better Multilingual Machine Translation Benchmark
by: Taguchi, Chihiro, et al.
Published: (2025) -
Efficient Context Selection for Long-Context QA: No Tuning, No Iteration, Just Adaptive-$k$
by: Taguchi, Chihiro, et al.
Published: (2025) -
Automatic Speech Recognition for Sanskrit with Transfer Learning
by: Sadhukhan, Bidit, et al.
Published: (2025)