Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Sauer, Christina, Boulesteix, Anne-Laure, Hanßum, Luzia, Hodiamont, Farina, Bausewein, Claudia, Ullmann, Theresa
Format:	Preprint
Published:	2024
Subjects:	Machine Learning Methodology
Online Access:	https://arxiv.org/abs/2412.03491
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866915447050338304
author	Sauer, Christina Boulesteix, Anne-Laure Hanßum, Luzia Hodiamont, Farina Bausewein, Claudia Ullmann, Theresa
author_facet	Sauer, Christina Boulesteix, Anne-Laure Hanßum, Luzia Hodiamont, Farina Bausewein, Claudia Ullmann, Theresa
contents	Adequately generating and evaluating prediction models based on supervised machine learning (ML) is often challenging, especially for less experienced users in applied research areas. Special attention is required in settings where the model generation process involves hyperparameter tuning, i.e. data-driven optimization of different types of hyperparameters to improve the predictive performance of the resulting model. Discussions about tuning typically focus on the hyperparameters of the ML algorithm (e.g., the minimum number of observations in each terminal node for a tree-based algorithm). In this context, it is often neglected that hyperparameters also exist for the preprocessing steps that are applied to the data before it is provided to the algorithm (e.g., how to handle missing feature values in the data). As a consequence, users experimenting with different preprocessing options to improve model performance may be unaware that this constitutes a form of hyperparameter tuning, albeit informal and unsystematic, and thus may fail to report or account for this optimization. To illuminate this issue, this paper reviews and empirically illustrates different procedures for generating and evaluating prediction models, explicitly addressing the different ways algorithm and preprocessing hyperparameters are typically handled by applied ML users. By highlighting potential pitfalls, especially those that may lead to exaggerated performance claims, this review aims to further improve the quality of predictive modeling in ML applications.
format	Preprint
id	arxiv_https___arxiv_org_abs_2412_03491
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Beyond algorithm hyperparameters: on preprocessing hyperparameters and associated pitfalls in machine learning applications Sauer, Christina Boulesteix, Anne-Laure Hanßum, Luzia Hodiamont, Farina Bausewein, Claudia Ullmann, Theresa Machine Learning Methodology Adequately generating and evaluating prediction models based on supervised machine learning (ML) is often challenging, especially for less experienced users in applied research areas. Special attention is required in settings where the model generation process involves hyperparameter tuning, i.e. data-driven optimization of different types of hyperparameters to improve the predictive performance of the resulting model. Discussions about tuning typically focus on the hyperparameters of the ML algorithm (e.g., the minimum number of observations in each terminal node for a tree-based algorithm). In this context, it is often neglected that hyperparameters also exist for the preprocessing steps that are applied to the data before it is provided to the algorithm (e.g., how to handle missing feature values in the data). As a consequence, users experimenting with different preprocessing options to improve model performance may be unaware that this constitutes a form of hyperparameter tuning, albeit informal and unsystematic, and thus may fail to report or account for this optimization. To illuminate this issue, this paper reviews and empirically illustrates different procedures for generating and evaluating prediction models, explicitly addressing the different ways algorithm and preprocessing hyperparameters are typically handled by applied ML users. By highlighting potential pitfalls, especially those that may lead to exaggerated performance claims, this review aims to further improve the quality of predictive modeling in ML applications.
title	Beyond algorithm hyperparameters: on preprocessing hyperparameters and associated pitfalls in machine learning applications
topic	Machine Learning Methodology
url	https://arxiv.org/abs/2412.03491

Similar Items