Saved in:
Bibliographic Details
Main Authors: Khurshid, Shakir, Loganathan, Bharath Kumar, Duvinage, Matthieu
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2411.00920
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866917826977071104
author Khurshid, Shakir
Loganathan, Bharath Kumar
Duvinage, Matthieu
author_facet Khurshid, Shakir
Loganathan, Bharath Kumar
Duvinage, Matthieu
contents The applicability domain refers to the range of data for which the prediction of the predictive model is expected to be reliable and accurate and using a model outside its applicability domain can lead to incorrect results. The ability to define the regions in data space where a predictive model can be safely used is a necessary condition for having safer and more reliable predictions to assure the reliability of new predictions. However, defining the applicability domain of a model is a challenging problem, as there is no clear and universal definition or metric for it. This work aims to make the applicability domain more quantifiable and pragmatic. Eight applicability domain detection techniques were applied to seven regression models, trained on five different datasets, and their performance was benchmarked using a validation framework. We also propose a novel approach based on non-deterministic Bayesian neural networks to define the applicability domain of the model. Our method exhibited superior accuracy in defining the Applicability Domain compared to previous methods, highlighting its potential in this regard.
format Preprint
id arxiv_https___arxiv_org_abs_2411_00920
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Comparative Evaluation of Applicability Domain Definition Methods for Regression Models
Khurshid, Shakir
Loganathan, Bharath Kumar
Duvinage, Matthieu
Machine Learning
Artificial Intelligence
The applicability domain refers to the range of data for which the prediction of the predictive model is expected to be reliable and accurate and using a model outside its applicability domain can lead to incorrect results. The ability to define the regions in data space where a predictive model can be safely used is a necessary condition for having safer and more reliable predictions to assure the reliability of new predictions. However, defining the applicability domain of a model is a challenging problem, as there is no clear and universal definition or metric for it. This work aims to make the applicability domain more quantifiable and pragmatic. Eight applicability domain detection techniques were applied to seven regression models, trained on five different datasets, and their performance was benchmarked using a validation framework. We also propose a novel approach based on non-deterministic Bayesian neural networks to define the applicability domain of the model. Our method exhibited superior accuracy in defining the Applicability Domain compared to previous methods, highlighting its potential in this regard.
title Comparative Evaluation of Applicability Domain Definition Methods for Regression Models
topic Machine Learning
Artificial Intelligence
url https://arxiv.org/abs/2411.00920