Affichage MARC: :: Library Catalog

Enregistré dans:

Détails bibliographiques
Auteurs principaux:	Nfissi, Alaa, Bouachir, Wassim, Bouguila, Nizar, Mishara, Brian
Format:	Preprint
Publié:	2024
Sujets:	Sound Artificial Intelligence Computation and Language Machine Learning Audio and Speech Processing I.2.7; I.2.6; I.2.1; I.2.8
Accès en ligne:	https://arxiv.org/abs/2405.20172
Tags:	Ajouter un tag Pas de tags, Soyez le premier à ajouter un tag!

_version_	1866910474251010048
author	Nfissi, Alaa Bouachir, Wassim Bouguila, Nizar Mishara, Brian
author_facet	Nfissi, Alaa Bouachir, Wassim Bouguila, Nizar Mishara, Brian
contents	In speech emotion recognition (SER), using predefined features without considering their practical importance may lead to high dimensional datasets, including redundant and irrelevant information. Consequently, high-dimensional learning often results in decreasing model accuracy while increasing computational complexity. Our work underlines the importance of carefully considering and analyzing features in order to build efficient SER systems. We present a new supervised SER method based on an efficient feature engineering approach. We pay particular attention to the explainability of results to evaluate feature relevance and refine feature sets. This is performed iteratively through feature evaluation loop, using Shapley values to boost feature selection and improve overall framework performance. Our approach allows thus to balance the benefits between model performance and transparency. The proposed method outperforms human-level performance (HLP) and state-of-the-art machine learning methods in emotion recognition on the TESS dataset. The source code of this paper is publicly available at https://github.com/alaaNfissi/Iterative-Feature-Boosting-for-Explainable-Speech-Emotion-Recognition.
format	Preprint
id	arxiv_https___arxiv_org_abs_2405_20172
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Iterative Feature Boosting for Explainable Speech Emotion Recognition Nfissi, Alaa Bouachir, Wassim Bouguila, Nizar Mishara, Brian Sound Artificial Intelligence Computation and Language Machine Learning Audio and Speech Processing I.2.7; I.2.6; I.2.1; I.2.8 In speech emotion recognition (SER), using predefined features without considering their practical importance may lead to high dimensional datasets, including redundant and irrelevant information. Consequently, high-dimensional learning often results in decreasing model accuracy while increasing computational complexity. Our work underlines the importance of carefully considering and analyzing features in order to build efficient SER systems. We present a new supervised SER method based on an efficient feature engineering approach. We pay particular attention to the explainability of results to evaluate feature relevance and refine feature sets. This is performed iteratively through feature evaluation loop, using Shapley values to boost feature selection and improve overall framework performance. Our approach allows thus to balance the benefits between model performance and transparency. The proposed method outperforms human-level performance (HLP) and state-of-the-art machine learning methods in emotion recognition on the TESS dataset. The source code of this paper is publicly available at https://github.com/alaaNfissi/Iterative-Feature-Boosting-for-Explainable-Speech-Emotion-Recognition.
title	Iterative Feature Boosting for Explainable Speech Emotion Recognition
topic	Sound Artificial Intelligence Computation and Language Machine Learning Audio and Speech Processing I.2.7; I.2.6; I.2.1; I.2.8
url	https://arxiv.org/abs/2405.20172

Documents similaires