Saved in:
Bibliographic Details
Main Author: Singh, Karamvir
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2512.08973
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866914190913961984
author Singh, Karamvir
author_facet Singh, Karamvir
contents This research presents a novel approach to enhancing automatic speech recognition systems by integrating noise detection capabilities directly into the recognition architecture. Building upon the wav2vec2 framework, the proposed method incorporates a dedicated noise identification module that operates concurrently with speech transcription. Experimental validation using publicly available speech and environmental audio datasets demonstrates substantial improvements in transcription quality and noise discrimination. The enhanced system achieves superior performance in word error rate, character error rate, and noise detection accuracy compared to conventional architectures. Results indicate that joint optimization of transcription and noise classification objectives yields more reliable speech recognition in challenging acoustic conditions.
format Preprint
id arxiv_https___arxiv_org_abs_2512_08973
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Enhancing Automatic Speech Recognition Through Integrated Noise Detection Architecture
Singh, Karamvir
Sound
Artificial Intelligence
Machine Learning
Audio and Speech Processing
This research presents a novel approach to enhancing automatic speech recognition systems by integrating noise detection capabilities directly into the recognition architecture. Building upon the wav2vec2 framework, the proposed method incorporates a dedicated noise identification module that operates concurrently with speech transcription. Experimental validation using publicly available speech and environmental audio datasets demonstrates substantial improvements in transcription quality and noise discrimination. The enhanced system achieves superior performance in word error rate, character error rate, and noise detection accuracy compared to conventional architectures. Results indicate that joint optimization of transcription and noise classification objectives yields more reliable speech recognition in challenging acoustic conditions.
title Enhancing Automatic Speech Recognition Through Integrated Noise Detection Architecture
topic Sound
Artificial Intelligence
Machine Learning
Audio and Speech Processing
url https://arxiv.org/abs/2512.08973