Internformat: :: Library Catalog

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Bhardwaj, Saurabh, Srivastava, Smriti, Bhandari, Abhishek, Gupta, Krit, Bahl, Hitesh, Gupta, J. R. P.
Format:	Preprint
Veröffentlicht:	2025
Schlagworte:	Sound
Online-Zugang:	https://arxiv.org/abs/2512.18902
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

_version_	1866914217864462336
author	Bhardwaj, Saurabh Srivastava, Smriti Bhandari, Abhishek Gupta, Krit Bahl, Hitesh Gupta, J. R. P.
author_facet	Bhardwaj, Saurabh Srivastava, Smriti Bhandari, Abhishek Gupta, Krit Bahl, Hitesh Gupta, J. R. P.
contents	This paper proposes a novel Wavelet Packet based feature extraction approach for the task of text independent speaker recognition. The features are extracted by using the combination of Mel Frequency Cepstral Coefficient (MFCC) and Wavelet Packet Transform (WPT).Hybrid Features technique uses the advantage of human ear simulation offered by MFCC combining it with multi-resolution property and noise robustness of WPT. To check the validity of the proposed approach for the text independent speaker identification and verification we have used the Gaussian Mixture Model (GMM) and Hidden Markov Model (HMM) respectively as the classifiers. The proposed paradigm is tested on voxforge speech corpus and CSTR US KED Timit database. The paradigm is also evaluated after adding standard noise signal at different level of SNRs for evaluating the noise robustness. Experimental results show that better results are achieved for the tasks of both speaker identification as well as speaker verification.
format	Preprint
id	arxiv_https___arxiv_org_abs_2512_18902
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Speaker Recognition -- Wavelet Packet Based Multiresolution Feature Extraction Approach Bhardwaj, Saurabh Srivastava, Smriti Bhandari, Abhishek Gupta, Krit Bahl, Hitesh Gupta, J. R. P. Sound This paper proposes a novel Wavelet Packet based feature extraction approach for the task of text independent speaker recognition. The features are extracted by using the combination of Mel Frequency Cepstral Coefficient (MFCC) and Wavelet Packet Transform (WPT).Hybrid Features technique uses the advantage of human ear simulation offered by MFCC combining it with multi-resolution property and noise robustness of WPT. To check the validity of the proposed approach for the text independent speaker identification and verification we have used the Gaussian Mixture Model (GMM) and Hidden Markov Model (HMM) respectively as the classifiers. The proposed paradigm is tested on voxforge speech corpus and CSTR US KED Timit database. The paradigm is also evaluated after adding standard noise signal at different level of SNRs for evaluating the noise robustness. Experimental results show that better results are achieved for the tasks of both speaker identification as well as speaker verification.
title	Speaker Recognition -- Wavelet Packet Based Multiresolution Feature Extraction Approach
topic	Sound
url	https://arxiv.org/abs/2512.18902

Ähnliche Einträge