Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Author:	Singh, Karamvir
Format:	Preprint
Published:	2025
Subjects:	Sound Artificial Intelligence Machine Learning Audio and Speech Processing
Online Access:	https://arxiv.org/abs/2512.08973
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866914190913961984
author	Singh, Karamvir
author_facet	Singh, Karamvir
contents	This research presents a novel approach to enhancing automatic speech recognition systems by integrating noise detection capabilities directly into the recognition architecture. Building upon the wav2vec2 framework, the proposed method incorporates a dedicated noise identification module that operates concurrently with speech transcription. Experimental validation using publicly available speech and environmental audio datasets demonstrates substantial improvements in transcription quality and noise discrimination. The enhanced system achieves superior performance in word error rate, character error rate, and noise detection accuracy compared to conventional architectures. Results indicate that joint optimization of transcription and noise classification objectives yields more reliable speech recognition in challenging acoustic conditions.
format	Preprint
id	arxiv_https___arxiv_org_abs_2512_08973
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Enhancing Automatic Speech Recognition Through Integrated Noise Detection Architecture Singh, Karamvir Sound Artificial Intelligence Machine Learning Audio and Speech Processing This research presents a novel approach to enhancing automatic speech recognition systems by integrating noise detection capabilities directly into the recognition architecture. Building upon the wav2vec2 framework, the proposed method incorporates a dedicated noise identification module that operates concurrently with speech transcription. Experimental validation using publicly available speech and environmental audio datasets demonstrates substantial improvements in transcription quality and noise discrimination. The enhanced system achieves superior performance in word error rate, character error rate, and noise detection accuracy compared to conventional architectures. Results indicate that joint optimization of transcription and noise classification objectives yields more reliable speech recognition in challenging acoustic conditions.
title	Enhancing Automatic Speech Recognition Through Integrated Noise Detection Architecture
topic	Sound Artificial Intelligence Machine Learning Audio and Speech Processing
url	https://arxiv.org/abs/2512.08973

Similar Items