Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Hartoyo, Agus, Argasiński, Jan, Trenk, Aleksandra, Przybylska, Kinga, Błasiak, Anna, Crimi, Alessandro
Format:	Preprint
Published:	2024
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2402.09281
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866929532063186944
author	Hartoyo, Agus Argasiński, Jan Trenk, Aleksandra Przybylska, Kinga Błasiak, Anna Crimi, Alessandro
author_facet	Hartoyo, Agus Argasiński, Jan Trenk, Aleksandra Przybylska, Kinga Błasiak, Anna Crimi, Alessandro
contents	Covariance and Hessian matrices have been analyzed separately in the literature for classification problems. However, integrating these matrices has the potential to enhance their combined power in improving classification performance. We present a novel approach that combines the eigenanalysis of a covariance matrix evaluated on a training set with a Hessian matrix evaluated on a deep learning model to achieve optimal class separability in binary classification tasks. Our approach is substantiated by formal proofs that establish its capability to maximize between-class mean distance (the concept of \textit{separation}) and minimize within-class variances (the concept of \textit{compactness}), which together define the two linear discriminant analysis (LDA) criteria, particularly under ideal data conditions such as isotropy around class means and dominant leading eigenvalues. By projecting data into the combined space of the most relevant eigendirections from both matrices, we achieve optimal class separability as per these LDA criteria. Empirical validation across neural and health datasets consistently supports our theoretical framework and demonstrates that our method outperforms established methods. Our method stands out by addressing both separation and compactness criteria, unlike PCA and the Hessian method, which predominantly emphasize one criterion each. This comprehensive approach captures intricate patterns and relationships, enhancing classification performance. Furthermore, through the utilization of both LDA criteria, our method outperforms LDA itself by leveraging higher-dimensional feature spaces, in accordance with Cover's theorem, which favors linear separability in higher dimensions. Additionally, our approach sheds light on complex DNN decision-making, rendering them comprehensible within a 2D space.
format	Preprint
id	arxiv_https___arxiv_org_abs_2402_09281
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Synergistic eigenanalysis of covariance and Hessian matrices for enhanced binary classification Hartoyo, Agus Argasiński, Jan Trenk, Aleksandra Przybylska, Kinga Błasiak, Anna Crimi, Alessandro Machine Learning Artificial Intelligence Covariance and Hessian matrices have been analyzed separately in the literature for classification problems. However, integrating these matrices has the potential to enhance their combined power in improving classification performance. We present a novel approach that combines the eigenanalysis of a covariance matrix evaluated on a training set with a Hessian matrix evaluated on a deep learning model to achieve optimal class separability in binary classification tasks. Our approach is substantiated by formal proofs that establish its capability to maximize between-class mean distance (the concept of \textit{separation}) and minimize within-class variances (the concept of \textit{compactness}), which together define the two linear discriminant analysis (LDA) criteria, particularly under ideal data conditions such as isotropy around class means and dominant leading eigenvalues. By projecting data into the combined space of the most relevant eigendirections from both matrices, we achieve optimal class separability as per these LDA criteria. Empirical validation across neural and health datasets consistently supports our theoretical framework and demonstrates that our method outperforms established methods. Our method stands out by addressing both separation and compactness criteria, unlike PCA and the Hessian method, which predominantly emphasize one criterion each. This comprehensive approach captures intricate patterns and relationships, enhancing classification performance. Furthermore, through the utilization of both LDA criteria, our method outperforms LDA itself by leveraging higher-dimensional feature spaces, in accordance with Cover's theorem, which favors linear separability in higher dimensions. Additionally, our approach sheds light on complex DNN decision-making, rendering them comprehensible within a 2D space.
title	Synergistic eigenanalysis of covariance and Hessian matrices for enhanced binary classification
topic	Machine Learning Artificial Intelligence
url	https://arxiv.org/abs/2402.09281

Similar Items