Enregistré dans:
Détails bibliographiques
Auteurs principaux: Cao, Chunxu, Zhang, Qiang
Format: Preprint
Publié: 2024
Sujets:
Accès en ligne:https://arxiv.org/abs/2401.07540
Tags: Ajouter un tag
Pas de tags, Soyez le premier à ajouter un tag!
_version_ 1866911758336131072
author Cao, Chunxu
Zhang, Qiang
author_facet Cao, Chunxu
Zhang, Qiang
contents In this paper, we present a novel framework for data redundancy measurement based on probabilistic modeling of datasets, and a new criterion for redundancy detection that is resilient to noise. We also develop new methods for data redundancy reduction using both deterministic and stochastic optimization techniques. Our framework is flexible and can handle different types of features, and our experiments on benchmark datasets demonstrate the effectiveness of our methods. We provide a new perspective on feature selection, and propose effective and robust approaches for both supervised and unsupervised learning problems.
format Preprint
id arxiv_https___arxiv_org_abs_2401_07540
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Study Features via Exploring Distribution Structure
Cao, Chunxu
Zhang, Qiang
Machine Learning
In this paper, we present a novel framework for data redundancy measurement based on probabilistic modeling of datasets, and a new criterion for redundancy detection that is resilient to noise. We also develop new methods for data redundancy reduction using both deterministic and stochastic optimization techniques. Our framework is flexible and can handle different types of features, and our experiments on benchmark datasets demonstrate the effectiveness of our methods. We provide a new perspective on feature selection, and propose effective and robust approaches for both supervised and unsupervised learning problems.
title Study Features via Exploring Distribution Structure
topic Machine Learning
url https://arxiv.org/abs/2401.07540