Enregistré dans:
Détails bibliographiques
Auteurs principaux: Richard, Gael, Chouteau, Pierre, Torres, Bernardo
Format: Preprint
Publié: 2024
Sujets:
Accès en ligne:https://arxiv.org/abs/2401.16837
Tags: Ajouter un tag
Pas de tags, Soyez le premier à ajouter un tag!
_version_ 1866913215359746048
author Richard, Gael
Chouteau, Pierre
Torres, Bernardo
author_facet Richard, Gael
Chouteau, Pierre
Torres, Bernardo
contents A novel model was recently proposed by Schulze-Forster et al. in [1] for unsupervised music source separation. This model allows to tackle some of the major shortcomings of existing source separation frameworks. Specifically, it eliminates the need for isolated sources during training, performs efficiently with limited data, and can handle homogeneous sources (such as singing voice). But, this model relies on an external multipitch estimator and incorporates an Ad hoc voice assignment procedure. In this paper, we propose to extend this framework and to build a fully differentiable model by integrating a multipitch estimator and a novel differentiable assignment module within the core model. We show the merits of our approach through a set of experiments, and we highlight in particular its potential for processing diverse and unseen data.
format Preprint
id arxiv_https___arxiv_org_abs_2401_16837
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle A fully differentiable model for unsupervised singing voice separation
Richard, Gael
Chouteau, Pierre
Torres, Bernardo
Signal Processing
A novel model was recently proposed by Schulze-Forster et al. in [1] for unsupervised music source separation. This model allows to tackle some of the major shortcomings of existing source separation frameworks. Specifically, it eliminates the need for isolated sources during training, performs efficiently with limited data, and can handle homogeneous sources (such as singing voice). But, this model relies on an external multipitch estimator and incorporates an Ad hoc voice assignment procedure. In this paper, we propose to extend this framework and to build a fully differentiable model by integrating a multipitch estimator and a novel differentiable assignment module within the core model. We show the merits of our approach through a set of experiments, and we highlight in particular its potential for processing diverse and unseen data.
title A fully differentiable model for unsupervised singing voice separation
topic Signal Processing
url https://arxiv.org/abs/2401.16837