Saved in:
Bibliographic Details
Main Authors: Bocaccio, Hernan, Iglesias-Pérez, Sergio, Romance, Miguel, Criado, Regino, Mindlin, Gabriel B.
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2502.14110
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866909502390927360
author Bocaccio, Hernan
Iglesias-Pérez, Sergio
Romance, Miguel
Criado, Regino
Mindlin, Gabriel B.
author_facet Bocaccio, Hernan
Iglesias-Pérez, Sergio
Romance, Miguel
Criado, Regino
Mindlin, Gabriel B.
contents In this study, we explore the potential of visibility graphs in the spectral domain for speaker recognition. Adult participants were instructed to record vocalizations of the five Spanish vowels. For each vocalization, we computed the frequency spectrum considering the source-filter model of speech production, where formants are shaped by the vocal tract acting as a passive filter with resonant frequencies. Spectral profiles exhibited consistent intra-speaker characteristics, reflecting individual vocal tract anatomies, while showing variation between speakers. We then constructed visibility graphs from these spectral profiles and extracted various graph-theoretic metrics to capture their topological features. These metrics were assembled into feature vectors representing the five vowels for each speaker. Using an ensemble of decision trees trained on these features, we achieved high accuracy in speaker identification. Our analysis identified key topological features that were critical in distinguishing between speakers. This study demonstrates the effectiveness of visibility graphs for spectral analysis and their potential in speaker recognition. We also discuss the robustness of this approach, offering insights into its applicability for real-world speaker recognition systems. This research contributes to expanding the feature extraction toolbox for speaker recognition by leveraging the topological properties of speech signals in the spectral domain.
format Preprint
id arxiv_https___arxiv_org_abs_2502_14110
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle On the application of Visibility Graphs in the Spectral Domain for Speaker Recognition
Bocaccio, Hernan
Iglesias-Pérez, Sergio
Romance, Miguel
Criado, Regino
Mindlin, Gabriel B.
Sound
Audio and Speech Processing
In this study, we explore the potential of visibility graphs in the spectral domain for speaker recognition. Adult participants were instructed to record vocalizations of the five Spanish vowels. For each vocalization, we computed the frequency spectrum considering the source-filter model of speech production, where formants are shaped by the vocal tract acting as a passive filter with resonant frequencies. Spectral profiles exhibited consistent intra-speaker characteristics, reflecting individual vocal tract anatomies, while showing variation between speakers. We then constructed visibility graphs from these spectral profiles and extracted various graph-theoretic metrics to capture their topological features. These metrics were assembled into feature vectors representing the five vowels for each speaker. Using an ensemble of decision trees trained on these features, we achieved high accuracy in speaker identification. Our analysis identified key topological features that were critical in distinguishing between speakers. This study demonstrates the effectiveness of visibility graphs for spectral analysis and their potential in speaker recognition. We also discuss the robustness of this approach, offering insights into its applicability for real-world speaker recognition systems. This research contributes to expanding the feature extraction toolbox for speaker recognition by leveraging the topological properties of speech signals in the spectral domain.
title On the application of Visibility Graphs in the Spectral Domain for Speaker Recognition
topic Sound
Audio and Speech Processing
url https://arxiv.org/abs/2502.14110