Saved in:
Bibliographic Details
Main Authors: Levy, Loup-Noe, Guerard, Guillaume, Djebali, Sonia, Amor, Soufian Ben
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2512.03071
Tags: Add Tag
No Tags, Be the first to tag this record!
Table of Contents:
  • This article presents a novel pretopology-based algorithm designed to address the challenges of clustering mixed data without the need for dimensionality reduction. Leveraging Disjunctive Normal Form, our approach formulates customizable logical rules and adjustable hyperparameters that allow for user-defined hierarchical cluster construction and facilitate tailored solutions for heterogeneous datasets. Through hierarchical dendrogram analysis and comparative clustering metrics, our method demonstrates superior performance by accurately and interpretably delineating clusters directly from raw data, thus preserving data integrity. Empirical findings highlight the algorithm's robustness in constructing meaningful clusters and reveal its potential in overcoming issues related to clustered data explainability. The novelty of this work lies in its departure from traditional dimensionality reduction techniques and its innovative use of logical rules that enhance both cluster formation and clarity, thereby contributing a significant advancement to the discourse on clustering mixed data.