Saved in:
Bibliographic Details
Main Authors: Pinheiro, Diogo, Oliveira, M. Rosário, Kravchenko, Igor, Oliveira, Lina
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2512.11945
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866912761450070016
author Pinheiro, Diogo
Oliveira, M. Rosário
Kravchenko, Igor
Oliveira, Lina
author_facet Pinheiro, Diogo
Oliveira, M. Rosário
Kravchenko, Igor
Oliveira, Lina
contents In Data Science, entities are typically represented by single valued measurements. Symbolic Data Analysis extends this framework to more complex structures, such as intervals and histograms, that express internal variability. We propose an extension of multiclass Fisher's Discriminant Analysis to interval-valued data, using Moore's interval arithmetic and the Mallows' distance. Fisher's objective function is generalised to consider simultaneously the contributions of the centres and the ranges of intervals and is numerically maximised. The resulting discriminant directions are then used to classify interval-valued observations.To support visual assessment, we adapt the class map, originally introduced for conventional data, to classifiers that assign labels through minimum distance rules. We also extend the silhouette plot to this setting and use stacked mosaic plots to complement the visual display of class assignments. Together, these graphical tools provide insight into classifier performance and the strength of class membership. Applications to real datasets illustrate the proposed methodology and demonstrate its value in interpreting classification results for interval-valued data.
format Preprint
id arxiv_https___arxiv_org_abs_2512_11945
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Interval Fisher's Discriminant Analysis and Visualisation
Pinheiro, Diogo
Oliveira, M. Rosário
Kravchenko, Igor
Oliveira, Lina
Machine Learning
Statistics Theory
In Data Science, entities are typically represented by single valued measurements. Symbolic Data Analysis extends this framework to more complex structures, such as intervals and histograms, that express internal variability. We propose an extension of multiclass Fisher's Discriminant Analysis to interval-valued data, using Moore's interval arithmetic and the Mallows' distance. Fisher's objective function is generalised to consider simultaneously the contributions of the centres and the ranges of intervals and is numerically maximised. The resulting discriminant directions are then used to classify interval-valued observations.To support visual assessment, we adapt the class map, originally introduced for conventional data, to classifiers that assign labels through minimum distance rules. We also extend the silhouette plot to this setting and use stacked mosaic plots to complement the visual display of class assignments. Together, these graphical tools provide insight into classifier performance and the strength of class membership. Applications to real datasets illustrate the proposed methodology and demonstrate its value in interpreting classification results for interval-valued data.
title Interval Fisher's Discriminant Analysis and Visualisation
topic Machine Learning
Statistics Theory
url https://arxiv.org/abs/2512.11945