:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Robatian, Amin, Hajipour, Mohammad, Peyghan, Mohammad Reza, Rajabi, Fatemeh, Amini, Sajjad, Ghaemmaghami, Shahrokh, Gholampour, Iman
Format:	Preprint
Published:	2025
Subjects:	Audio and Speech Processing Artificial Intelligence Sound
Online Access:	https://arxiv.org/abs/2501.10734
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Non-Intrusive Automatic Speech Recognition Refinement: A Survey
by: Peyghan, Mohammad Reza, et al.
Published: (2025)

Speech Retrieval-Augmented Generation without Automatic Speech Recognition
by: Min, Do June, et al.
Published: (2024)

Retrieval Augmented Correction of Named Entity Speech Recognition Errors
by: Pusateri, Ernest, et al.
Published: (2024)

RAG-Boost: Retrieval-Augmented Generation Enhanced LLM-based Speech Recognition
by: Wang, Pengcheng, et al.
Published: (2025)

Failing Forward: Improving Generative Error Correction for ASR with Synthetic Data and Retrieval Augmentation
by: Ghosh, Sreyan, et al.
Published: (2024)

LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition
by: Ghosh, Sreyan, et al.
Published: (2024)

Exploring Generative Error Correction for Dysarthric Speech Recognition
by: La Quatra, Moreno, et al.
Published: (2025)

Improving Automatic Speech Recognition for Speakers Treated for Oral Cancer using Data Augmentation and LLM Error Correction
by: Folkertsma, Hidde, et al.
Published: (2026)

UCorrect: An Unsupervised Framework for Automatic Speech Recognition Error Correction
by: Guo, Jiaxin, et al.
Published: (2024)

SpeechT-RAG: Reliable Depression Detection in LLMs with Retrieval-Augmented Generation Using Speech Timing Information
by: Zhang, Xiangyu, et al.
Published: (2025)

Benchmarking Japanese Speech Recognition on ASR-LLM Setups with Multi-Pass Augmented Generative Error Correction
by: Ko, Yuka, et al.
Published: (2024)

Error Correction by Paying Attention to Both Acoustic and Confidence References for Automatic Speech Recognition
by: Shu, Yuchun, et al.
Published: (2024)

Listening and Seeing Again: Generative Error Correction for Audio-Visual Speech Recognition
by: Liu, Rui, et al.
Published: (2025)

MMGER: Multi-modal and Multi-granularity Generative Error Correction with LLM for Joint Accent and Speech Recognition
by: Mu, Bingshen, et al.
Published: (2024)

Augmenting Polish Automatic Speech Recognition System With Synthetic Data
by: Bondaruk, Łukasz, et al.
Published: (2024)

Retrieving Effective Acoustic Impedance and Refractive Index for Size Mismatch Samples
by: Khodaei, Mohammad Javad, et al.
Published: (2021)

Accent-Invariant Automatic Speech Recognition via Saliency-Driven Spectrogram Masking
by: Sameti, Mohammad Hossein, et al.
Published: (2025)

Training Data Augmentation for Dysarthric Automatic Speech Recognition by Text-to-Dysarthric-Speech Synthesis
by: Leung, Wing-Zin, et al.
Published: (2024)

Zero Shot Text to Speech Augmentation for Automatic Speech Recognition on Low-Resource Accented Speech Corpora
by: Nespoli, Francesco, et al.
Published: (2024)

Semantically Corrected Amharic Automatic Speech Recognition
by: Adnew, Samuael, et al.
Published: (2024)

Automatic Speech Recognition System-Independent Word Error Rate Estimation
by: Park, Chanho, et al.
Published: (2024)

Hallucinations in Neural Automatic Speech Recognition: Identifying Errors and Hallucinatory Models
by: Frieske, Rita, et al.
Published: (2024)

Word Level Timestamp Generation for Automatic Speech Recognition and Translation
by: Hu, Ke, et al.
Published: (2025)

Mixture of LoRA Experts with Multi-Modal and Multi-Granularity LLM Generative Error Correction for Accented Speech Recognition
by: Mu, Bingshen, et al.
Published: (2025)

Automatic Speech Recognition Biases in Newcastle English: an Error Analysis
by: Serditova, Dana, et al.
Published: (2025)

Retrieval-Augmented Speech Recognition Approach for Domain Challenges
by: Shen, Peng, et al.
Published: (2025)

Fusion of Discrete Representations and Self-Augmented Representations for Multilingual Automatic Speech Recognition
by: Wang, Shih-heng, et al.
Published: (2024)

Generative Speech Recognition Error Correction with Large Language Models and Task-Activating Prompting
by: Yang, Chao-Han Huck, et al.
Published: (2023)

XEmoRAG: Cross-Lingual Emotion Transfer with Controllable Intensity Using Retrieval-Augmented Generation
by: Zuo, Tianlun, et al.
Published: (2025)

Transcription-Free Fine-Tuning of Speech Separation Models for Noisy and Reverberant Multi-Speaker Automatic Speech Recognition
by: Ravenscroft, William, et al.
Published: (2024)

CHSER: A Dataset and Case Study on Generative Speech Error Correction for Child ASR
by: Shankar, Natarajan Balaji, et al.
Published: (2025)

Whispering LLaMA: A Cross-Modal Generative Error Correction Framework for Speech Recognition
by: Radhakrishnan, Srijith, et al.
Published: (2023)

WavRAG: Audio-Integrated Retrieval Augmented Generation for Spoken Dialogue Models
by: Chen, Yifu, et al.
Published: (2025)

LA-RAG:Enhancing LLM-based ASR Accuracy with Retrieval-Augmented Generation
by: Li, Shaojun, et al.
Published: (2024)

Fairness of Automatic Speech Recognition in Cleft Lip and Palate Speech
by: Bhattacharjee, Susmita, et al.
Published: (2025)

Full-text Error Correction for Chinese Speech Recognition with Large Language Model
by: Tang, Zhiyuan, et al.
Published: (2024)

The RoyalFlush Automatic Speech Diarization and Recognition System for In-Car Multi-Channel Automatic Speech Recognition Challenge
by: Tian, Jingguang, et al.
Published: (2024)

Automatic Speech Recognition for Hindi
by: Saha, Anish, et al.
Published: (2024)

On the Problem of Text-To-Speech Model Selection for Synthetic Data Generation in Automatic Speech Recognition
by: Rossenbach, Nick, et al.
Published: (2024)

Speaker Attributed Automatic Speech Recognition Using Speech Aware LLMS
by: Aronowitz, Hagai, et al.
Published: (2026)