Saved in:
| Main Authors: | Udagawa, Takuma, Suzuki, Masayuki, Muraoka, Masayasu, Kurata, Gakuto |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2407.13300 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Contextual Biasing for ASR in Speech LLM with Common Word Cues and Bias Word Position Prediction
by: Novitasari, Sashi, et al.
Published: (2026)
by: Novitasari, Sashi, et al.
Published: (2026)
Large Language Models based ASR Error Correction for Child Conversations
by: Xu, Anfeng, et al.
Published: (2025)
by: Xu, Anfeng, et al.
Published: (2025)
ASR Error Correction using Large Language Models
by: Ma, Rao, et al.
Published: (2024)
by: Ma, Rao, et al.
Published: (2024)
Crossmodal ASR Error Correction with Discrete Speech Units
by: Li, Yuanchao, et al.
Published: (2024)
by: Li, Yuanchao, et al.
Published: (2024)
ASR-EC Benchmark: Evaluating Large Language Models on Chinese ASR Error Correction
by: Wei, Victor Junqiu, et al.
Published: (2024)
by: Wei, Victor Junqiu, et al.
Published: (2024)
Failing Forward: Improving Generative Error Correction for ASR with Synthetic Data and Retrieval Augmentation
by: Ghosh, Sreyan, et al.
Published: (2024)
by: Ghosh, Sreyan, et al.
Published: (2024)
Evolutionary Prompt Design for LLM-Based Post-ASR Error Correction
by: Sachdev, Rithik, et al.
Published: (2024)
by: Sachdev, Rithik, et al.
Published: (2024)
Fewer Hallucinations, More Verification: A Three-Stage LLM-Based Framework for ASR Error Correction
by: Fang, Yangui, et al.
Published: (2025)
by: Fang, Yangui, et al.
Published: (2025)
Large Language Model Should Understand Pinyin for Chinese ASR Error Correction
by: Li, Yuang, et al.
Published: (2024)
by: Li, Yuang, et al.
Published: (2024)
Revisiting ASR Error Correction with Specialized Models
by: Gu, Zijin, et al.
Published: (2024)
by: Gu, Zijin, et al.
Published: (2024)
Better Pseudo-labeling with Multi-ASR Fusion and Error Correction by SpeechLLM
by: Prakash, Jeena, et al.
Published: (2025)
by: Prakash, Jeena, et al.
Published: (2025)
Benchmarking Japanese Speech Recognition on ASR-LLM Setups with Multi-Pass Augmented Generative Error Correction
by: Ko, Yuka, et al.
Published: (2024)
by: Ko, Yuka, et al.
Published: (2024)
Advocating Character Error Rate for Multilingual ASR Evaluation
by: K, Thennal D, et al.
Published: (2024)
by: K, Thennal D, et al.
Published: (2024)
Causal Structure Discovery for Error Diagnostics of Children's ASR
by: Singh, Vishwanath Pratap, et al.
Published: (2025)
by: Singh, Vishwanath Pratap, et al.
Published: (2025)
Better Semi-supervised Learning for Multi-domain ASR Through Incremental Retraining and Data Filtering
by: Carofilis, Andres, et al.
Published: (2025)
by: Carofilis, Andres, et al.
Published: (2025)
Efficient Data Selection for Domain Adaptation of ASR Using Pseudo-Labels and Multi-Stage Filtering
by: Rangappa, Pradeep, et al.
Published: (2025)
by: Rangappa, Pradeep, et al.
Published: (2025)
FlanEC: Exploring Flan-T5 for Post-ASR Error Correction
by: La Quatra, Moreno, et al.
Published: (2025)
by: La Quatra, Moreno, et al.
Published: (2025)
Revisiting Acoustic Features for Robust ASR
by: Shah, Muhammad A., et al.
Published: (2024)
by: Shah, Muhammad A., et al.
Published: (2024)
Analyzing Error Propagation in Korean Spoken QA with ASR-LLM Cascades
by: Jung, Donghyuk, et al.
Published: (2026)
by: Jung, Donghyuk, et al.
Published: (2026)
NIM4-ASR: Towards Efficient, Robust, and Customizable Real-Time LLM-Based ASR
by: Xie, Yuan, et al.
Published: (2026)
by: Xie, Yuan, et al.
Published: (2026)
Spelling Correction through Rewriting of Non-Autoregressive ASR Lattices
by: Velikovich, Leonid, et al.
Published: (2024)
by: Velikovich, Leonid, et al.
Published: (2024)
An Exhaustive Evaluation of TTS- and VC-based Data Augmentation for ASR
by: Ogun, Sewade, et al.
Published: (2025)
by: Ogun, Sewade, et al.
Published: (2025)
Minimising Biasing Word Errors for Contextual ASR with the Tree-Constrained Pointer Generator
by: Sun, Guangzhi, et al.
Published: (2022)
by: Sun, Guangzhi, et al.
Published: (2022)
PMF-CEC: Phoneme-augmented Multimodal Fusion for Context-aware ASR Error Correction with Error-specific Selective Decoding
by: He, Jiajun, et al.
Published: (2025)
by: He, Jiajun, et al.
Published: (2025)
The Multicultural Medical Assistant: Can LLMs Improve Medical ASR Errors Across Borders?
by: Adedeji, Ayo, et al.
Published: (2025)
by: Adedeji, Ayo, et al.
Published: (2025)
Towards ASR Robust Spoken Language Understanding Through In-Context Learning With Word Confusion Networks
by: Everson, Kevin, et al.
Published: (2024)
by: Everson, Kevin, et al.
Published: (2024)
Exploring Generative Error Correction for Dysarthric Speech Recognition
by: La Quatra, Moreno, et al.
Published: (2025)
by: La Quatra, Moreno, et al.
Published: (2025)
Efficient ASR for Low-Resource Languages: Leveraging Cross-Lingual Unlabeled Data
by: Bandarupalli, Srihari, et al.
Published: (2025)
by: Bandarupalli, Srihari, et al.
Published: (2025)
PromptASR for contextualized ASR with controllable style
by: Yang, Xiaoyu, et al.
Published: (2023)
by: Yang, Xiaoyu, et al.
Published: (2023)
LLM-based Generative Error Correction for Rare Words with Synthetic Data and Phonetic Context
by: Yamashita, Natsuo, et al.
Published: (2025)
by: Yamashita, Natsuo, et al.
Published: (2025)
Improving ASR Contextual Biasing with Guided Attention
by: Tang, Jiyang, et al.
Published: (2024)
by: Tang, Jiyang, et al.
Published: (2024)
Full-text Error Correction for Chinese Speech Recognition with Large Language Model
by: Tang, Zhiyuan, et al.
Published: (2024)
by: Tang, Zhiyuan, et al.
Published: (2024)
MSA-ASR: Efficient Multilingual Speaker Attribution with frozen ASR Models
by: Nguyen, Thai-Binh, et al.
Published: (2024)
by: Nguyen, Thai-Binh, et al.
Published: (2024)
A Comprehensive Study on the Effectiveness of ASR Representations for Noise-Robust Speech Emotion Recognition
by: Shi, Xiaohan, et al.
Published: (2023)
by: Shi, Xiaohan, et al.
Published: (2023)
Exploring the Impact of Data Quantity on ASR in Extremely Low-resource Languages
by: Cheng, Yao-Fei, et al.
Published: (2024)
by: Cheng, Yao-Fei, et al.
Published: (2024)
Towards scalable efficient on-device ASR with transfer learning
by: Pandey, Laxmi, et al.
Published: (2024)
by: Pandey, Laxmi, et al.
Published: (2024)
Optimizing Byte-level Representation for End-to-end ASR
by: Hsiao, Roger, et al.
Published: (2024)
by: Hsiao, Roger, et al.
Published: (2024)
Building English ASR model with regional language support
by: Agrawal, Purvi, et al.
Published: (2025)
by: Agrawal, Purvi, et al.
Published: (2025)
Retrieval Augmented Generation based context discovery for ASR
by: Siskos, Dimitrios, et al.
Published: (2025)
by: Siskos, Dimitrios, et al.
Published: (2025)
AutoMode-ASR: Learning to Select ASR Systems for Better Quality and Cost
by: Gündüz, Ahmet, et al.
Published: (2024)
by: Gündüz, Ahmet, et al.
Published: (2024)
Similar Items
-
Contextual Biasing for ASR in Speech LLM with Common Word Cues and Bias Word Position Prediction
by: Novitasari, Sashi, et al.
Published: (2026) -
Large Language Models based ASR Error Correction for Child Conversations
by: Xu, Anfeng, et al.
Published: (2025) -
ASR Error Correction using Large Language Models
by: Ma, Rao, et al.
Published: (2024) -
Crossmodal ASR Error Correction with Discrete Speech Units
by: Li, Yuanchao, et al.
Published: (2024) -
ASR-EC Benchmark: Evaluating Large Language Models on Chinese ASR Error Correction
by: Wei, Victor Junqiu, et al.
Published: (2024)