Saved in:
Bibliographic Details
Main Authors: Sechet, Dylan, Bugiotti, Francesca, Kowalski, Matthieu, d'Hérouville, Edouard, Langiewicz, Filip
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2506.21167
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866908423518420992
author Sechet, Dylan
Bugiotti, Francesca
Kowalski, Matthieu
d'Hérouville, Edouard
Langiewicz, Filip
author_facet Sechet, Dylan
Bugiotti, Francesca
Kowalski, Matthieu
d'Hérouville, Edouard
Langiewicz, Filip
contents Identifying instrument activities within audio excerpts is vital in music information retrieval, with significant implications for music cataloging and discovery. Prior deep learning endeavors in musical instrument recognition have predominantly emphasized instrument classes with ample data availability. Recent studies have demonstrated the applicability of hierarchical classification in detecting instrument activities in orchestral music, even with limited fine-grained annotations at the instrument level. Based on the Hornbostel-Sachs classification, such a hierarchical classification system is evaluated using the MedleyDB dataset, renowned for its diversity and richness concerning various instruments and music genres. This work presents various strategies to integrate hierarchical structures into models and tests a new class of models for hierarchical music prediction. This study showcases more reliable coarse-level instrument detection by bridging the gap between detailed instrument identification and group-level recognition, paving the way for further advancements in this domain.
format Preprint
id arxiv_https___arxiv_org_abs_2506_21167
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle A Hierarchical Deep Learning Approach for Minority Instrument Detection
Sechet, Dylan
Bugiotti, Francesca
Kowalski, Matthieu
d'Hérouville, Edouard
Langiewicz, Filip
Sound
Artificial Intelligence
Audio and Speech Processing
Identifying instrument activities within audio excerpts is vital in music information retrieval, with significant implications for music cataloging and discovery. Prior deep learning endeavors in musical instrument recognition have predominantly emphasized instrument classes with ample data availability. Recent studies have demonstrated the applicability of hierarchical classification in detecting instrument activities in orchestral music, even with limited fine-grained annotations at the instrument level. Based on the Hornbostel-Sachs classification, such a hierarchical classification system is evaluated using the MedleyDB dataset, renowned for its diversity and richness concerning various instruments and music genres. This work presents various strategies to integrate hierarchical structures into models and tests a new class of models for hierarchical music prediction. This study showcases more reliable coarse-level instrument detection by bridging the gap between detailed instrument identification and group-level recognition, paving the way for further advancements in this domain.
title A Hierarchical Deep Learning Approach for Minority Instrument Detection
topic Sound
Artificial Intelligence
Audio and Speech Processing
url https://arxiv.org/abs/2506.21167