Saved in:
Bibliographic Details
Main Authors: Neyra, Jorge, Siramshetty, Vishal B., Ashqar, Huthaifa I.
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2411.05937
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866910692196483072
author Neyra, Jorge
Siramshetty, Vishal B.
Ashqar, Huthaifa I.
author_facet Neyra, Jorge
Siramshetty, Vishal B.
Ashqar, Huthaifa I.
contents This study examines the effect that different feature selection methods have on models created with XGBoost, a popular machine learning algorithm with superb regularization methods. It shows that three different ways for reducing the dimensionality of features produces no statistically significant change in the prediction accuracy of the model. This suggests that the traditional idea of removing the noisy training data to make sure models do not overfit may not apply to XGBoost. But it may still be viable in order to reduce computational complexity.
format Preprint
id arxiv_https___arxiv_org_abs_2411_05937
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle The effect of different feature selection methods on models created with XGBoost
Neyra, Jorge
Siramshetty, Vishal B.
Ashqar, Huthaifa I.
Machine Learning
Information Retrieval
This study examines the effect that different feature selection methods have on models created with XGBoost, a popular machine learning algorithm with superb regularization methods. It shows that three different ways for reducing the dimensionality of features produces no statistically significant change in the prediction accuracy of the model. This suggests that the traditional idea of removing the noisy training data to make sure models do not overfit may not apply to XGBoost. But it may still be viable in order to reduce computational complexity.
title The effect of different feature selection methods on models created with XGBoost
topic Machine Learning
Information Retrieval
url https://arxiv.org/abs/2411.05937