:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Lee, Seonwoo, Mun, Jihyun, Kim, Sunhee, Chung, Minhwa
Format:	Preprint
Published:	2024
Subjects:	Audio and Speech Processing Computation and Language
Online Access:	https://arxiv.org/abs/2402.15539
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Developing an End-to-End Framework for Predicting the Social Communication Severity Scores of Children with Autism Spectrum Disorder
by: Mun, Jihyun, et al.
Published: (2024)

Evaluating Automatic Speech Recognition Systems for Korean Meteorological Experts
by: Park, ChaeHun, et al.
Published: (2024)

Non-native Children's Automatic Speech Assessment Challenge (NOCASA)
by: Getman, Yaroslav, et al.
Published: (2025)

Automatic Speech Recognition (ASR) for the Diagnosis of pronunciation of Speech Sound Disorders in Korean children
by: Ahn, Taekyung, et al.
Published: (2024)

Automatic Screening for Children with Speech Disorder using Automatic Speech Recognition: Opportunities and Challenges
by: Liu, Dancheng, et al.
Published: (2024)

HiKE: Hierarchical Evaluation Framework for Korean-English Code-Switching Speech Recognition
by: Paik, Gio, et al.
Published: (2025)

Kid-Whisper: Towards Bridging the Performance Gap in Automatic Speech Recognition for Children VS. Adults
by: Attia, Ahmed Adel, et al.
Published: (2023)

Exploring Speech Pattern Disorders in Autism using Machine Learning
by: Hu, Chuanbo, et al.
Published: (2024)

Lightweight Audio Segmentation for Long-form Speech Translation
by: Lee, Jaesong, et al.
Published: (2024)

Optimizing Automatic Speech Assessment: W-RankSim Regularization and Hybrid Feature Fusion Strategies
by: Wu, Chung-Wen, et al.
Published: (2024)

ART: The Alternating Reading Task Corpus for Speech Entrainment and Imitation
by: Yuan, Zheng, et al.
Published: (2024)

FASA: a Flexible and Automatic Speech Aligner for Extracting High-quality Aligned Children Speech Data
by: Liu, Dancheng, et al.
Published: (2024)

ÌròyìnSpeech: A multi-purpose Yorùbá Speech Corpus
by: Ogunremi, Tolulope, et al.
Published: (2023)

Dub-S2ST: Textless Speech-to-Speech Translation for Seamless Dubbing
by: Choi, Jeongsoo, et al.
Published: (2025)

SMILE: Speech Meta In-Context Learning for Low-Resource Language Automatic Speech Recognition
by: Hsu, Ming-Hao, et al.
Published: (2024)

GSA-TTS : Toward Zero-Shot Speech Synthesis based on Gradual Style Adaptor
by: Lee, Seokgi, et al.
Published: (2025)

Advancing Speech Translation: A Corpus of Mandarin-English Conversational Telephone Speech
by: Wotherspoon, Shannon, et al.
Published: (2024)

Swedish Whispers; Leveraging a Massive Speech Corpus for Swedish Speech Recognition
by: Vesterbacka, Leonora, et al.
Published: (2025)

PhoWhisper: Automatic Speech Recognition for Vietnamese
by: Le, Thanh-Thien, et al.
Published: (2024)

SpeechColab Leaderboard: An Open-Source Platform for Automatic Speech Recognition Evaluation
by: Du, Jiayu, et al.
Published: (2024)

Gated Low-rank Adaptation for personalized Code-Switching Automatic Speech Recognition on the low-spec devices
by: Kim, Gwantae, et al.
Published: (2024)

TeluguST-46: A Benchmark Corpus and Comprehensive Evaluation for Telugu-English Speech Translation
by: Akkiraju, Bhavana, et al.
Published: (2025)

Automatic Speech Recognition for Hindi
by: Saha, Anish, et al.
Published: (2024)

Pseudo2Real: Task Arithmetic for Pseudo-Label Correction in Automatic Speech Recognition
by: Lin, Yi-Cheng, et al.
Published: (2025)

Leveraging the Interplay Between Syntactic and Acoustic Cues for Optimizing Korean TTS Pause Formation
by: Jeon, Yejin, et al.
Published: (2024)

Automatic Speech Recognition System-Independent Word Error Rate Estimation
by: Park, Chanho, et al.
Published: (2024)

A Novel Data Augmentation Approach for Automatic Speaking Assessment on Opinion Expressions
by: Wang, Chung-Chun, et al.
Published: (2025)

TI-ASU: Toward Robust Automatic Speech Understanding through Text-to-speech Imputation Against Missing Speech Modality
by: Feng, Tiantian, et al.
Published: (2024)

Automatic Speech Recognition with BERT and CTC Transformers: A Review
by: Djeffal, Noussaiba, et al.
Published: (2024)

Speech-Based Depression Prediction Using Encoder-Weight-Only Transfer Learning and a Large Corpus
by: Harati, Amir, et al.
Published: (2024)

AdaptVC: High Quality Voice Conversion with Adaptive Learning
by: Kim, Jaehun, et al.
Published: (2025)

A Large Dataset of Spontaneous Speech with the Accent Spoken in São Paulo for Automatic Speech Recognition Evaluation
by: Lima, Rodrigo, et al.
Published: (2024)

ELF: Encoding Speaker-Specific Latent Speech Feature for Speech Synthesis
by: Kong, Jungil, et al.
Published: (2023)

Multi-Level Embedding Conformer Framework for Bengali Automatic Speech Recognition
by: Sakib, Md. Nazmus, et al.
Published: (2025)

CAMÕES: A Comprehensive Automatic Speech Recognition Benchmark for European Portuguese
by: Carvalho, Carlos, et al.
Published: (2025)

Stateful Conformer with Cache-based Inference for Streaming Automatic Speech Recognition
by: Noroozi, Vahid, et al.
Published: (2023)

Lost in Transcription: Identifying and Quantifying the Accuracy Biases of Automatic Speech Recognition Systems Against Disfluent Speech
by: Mujtaba, Dena, et al.
Published: (2024)

Joint Automatic Speech Recognition And Structure Learning For Better Speech Understanding
by: Hu, Jiliang, et al.
Published: (2025)

A Preliminary Analysis of Automatic Word and Syllable Prominence Detection in Non-Native Speech With Text-to-Speech Prosody Embeddings
by: Mondal, Anindita, et al.
Published: (2024)

PI-Whisper: Designing an Adaptive and Incremental Automatic Speech Recognition System for Edge Devices
by: Nassereldine, Amir, et al.
Published: (2024)