:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Udagawa, Takuma, Suzuki, Masayuki, Muraoka, Masayasu, Kurata, Gakuto
Format:	Preprint
Published:	2024
Subjects:	Computation and Language Audio and Speech Processing
Online Access:	https://arxiv.org/abs/2407.13300
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Contextual Biasing for ASR in Speech LLM with Common Word Cues and Bias Word Position Prediction
by: Novitasari, Sashi, et al.
Published: (2026)

Large Language Models based ASR Error Correction for Child Conversations
by: Xu, Anfeng, et al.
Published: (2025)

ASR Error Correction using Large Language Models
by: Ma, Rao, et al.
Published: (2024)

Crossmodal ASR Error Correction with Discrete Speech Units
by: Li, Yuanchao, et al.
Published: (2024)

ASR-EC Benchmark: Evaluating Large Language Models on Chinese ASR Error Correction
by: Wei, Victor Junqiu, et al.
Published: (2024)

Failing Forward: Improving Generative Error Correction for ASR with Synthetic Data and Retrieval Augmentation
by: Ghosh, Sreyan, et al.
Published: (2024)

Evolutionary Prompt Design for LLM-Based Post-ASR Error Correction
by: Sachdev, Rithik, et al.
Published: (2024)

Fewer Hallucinations, More Verification: A Three-Stage LLM-Based Framework for ASR Error Correction
by: Fang, Yangui, et al.
Published: (2025)

Large Language Model Should Understand Pinyin for Chinese ASR Error Correction
by: Li, Yuang, et al.
Published: (2024)

Revisiting ASR Error Correction with Specialized Models
by: Gu, Zijin, et al.
Published: (2024)

Better Pseudo-labeling with Multi-ASR Fusion and Error Correction by SpeechLLM
by: Prakash, Jeena, et al.
Published: (2025)

Benchmarking Japanese Speech Recognition on ASR-LLM Setups with Multi-Pass Augmented Generative Error Correction
by: Ko, Yuka, et al.
Published: (2024)

Advocating Character Error Rate for Multilingual ASR Evaluation
by: K, Thennal D, et al.
Published: (2024)

Causal Structure Discovery for Error Diagnostics of Children's ASR
by: Singh, Vishwanath Pratap, et al.
Published: (2025)

Better Semi-supervised Learning for Multi-domain ASR Through Incremental Retraining and Data Filtering
by: Carofilis, Andres, et al.
Published: (2025)

Efficient Data Selection for Domain Adaptation of ASR Using Pseudo-Labels and Multi-Stage Filtering
by: Rangappa, Pradeep, et al.
Published: (2025)

FlanEC: Exploring Flan-T5 for Post-ASR Error Correction
by: La Quatra, Moreno, et al.
Published: (2025)

Revisiting Acoustic Features for Robust ASR
by: Shah, Muhammad A., et al.
Published: (2024)

Analyzing Error Propagation in Korean Spoken QA with ASR-LLM Cascades
by: Jung, Donghyuk, et al.
Published: (2026)

NIM4-ASR: Towards Efficient, Robust, and Customizable Real-Time LLM-Based ASR
by: Xie, Yuan, et al.
Published: (2026)

Spelling Correction through Rewriting of Non-Autoregressive ASR Lattices
by: Velikovich, Leonid, et al.
Published: (2024)

An Exhaustive Evaluation of TTS- and VC-based Data Augmentation for ASR
by: Ogun, Sewade, et al.
Published: (2025)

Minimising Biasing Word Errors for Contextual ASR with the Tree-Constrained Pointer Generator
by: Sun, Guangzhi, et al.
Published: (2022)

PMF-CEC: Phoneme-augmented Multimodal Fusion for Context-aware ASR Error Correction with Error-specific Selective Decoding
by: He, Jiajun, et al.
Published: (2025)

The Multicultural Medical Assistant: Can LLMs Improve Medical ASR Errors Across Borders?
by: Adedeji, Ayo, et al.
Published: (2025)

Towards ASR Robust Spoken Language Understanding Through In-Context Learning With Word Confusion Networks
by: Everson, Kevin, et al.
Published: (2024)

Exploring Generative Error Correction for Dysarthric Speech Recognition
by: La Quatra, Moreno, et al.
Published: (2025)

Efficient ASR for Low-Resource Languages: Leveraging Cross-Lingual Unlabeled Data
by: Bandarupalli, Srihari, et al.
Published: (2025)

PromptASR for contextualized ASR with controllable style
by: Yang, Xiaoyu, et al.
Published: (2023)

LLM-based Generative Error Correction for Rare Words with Synthetic Data and Phonetic Context
by: Yamashita, Natsuo, et al.
Published: (2025)

Improving ASR Contextual Biasing with Guided Attention
by: Tang, Jiyang, et al.
Published: (2024)

Full-text Error Correction for Chinese Speech Recognition with Large Language Model
by: Tang, Zhiyuan, et al.
Published: (2024)

MSA-ASR: Efficient Multilingual Speaker Attribution with frozen ASR Models
by: Nguyen, Thai-Binh, et al.
Published: (2024)

A Comprehensive Study on the Effectiveness of ASR Representations for Noise-Robust Speech Emotion Recognition
by: Shi, Xiaohan, et al.
Published: (2023)

Exploring the Impact of Data Quantity on ASR in Extremely Low-resource Languages
by: Cheng, Yao-Fei, et al.
Published: (2024)

Towards scalable efficient on-device ASR with transfer learning
by: Pandey, Laxmi, et al.
Published: (2024)

Optimizing Byte-level Representation for End-to-end ASR
by: Hsiao, Roger, et al.
Published: (2024)

Building English ASR model with regional language support
by: Agrawal, Purvi, et al.
Published: (2025)

Retrieval Augmented Generation based context discovery for ASR
by: Siskos, Dimitrios, et al.
Published: (2025)

AutoMode-ASR: Learning to Select ASR Systems for Better Quality and Cost
by: Gündüz, Ahmet, et al.
Published: (2024)