Saved in:
| Main Authors: | , , |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2411.00920 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1866917826977071104 |
|---|---|
| author | Khurshid, Shakir Loganathan, Bharath Kumar Duvinage, Matthieu |
| author_facet | Khurshid, Shakir Loganathan, Bharath Kumar Duvinage, Matthieu |
| contents | The applicability domain refers to the range of data for which the prediction of the predictive model is expected to be reliable and accurate and using a model outside its applicability domain can lead to incorrect results. The ability to define the regions in data space where a predictive model can be safely used is a necessary condition for having safer and more reliable predictions to assure the reliability of new predictions. However, defining the applicability domain of a model is a challenging problem, as there is no clear and universal definition or metric for it. This work aims to make the applicability domain more quantifiable and pragmatic. Eight applicability domain detection techniques were applied to seven regression models, trained on five different datasets, and their performance was benchmarked using a validation framework. We also propose a novel approach based on non-deterministic Bayesian neural networks to define the applicability domain of the model. Our method exhibited superior accuracy in defining the Applicability Domain compared to previous methods, highlighting its potential in this regard. |
| format | Preprint |
| id |
arxiv_https___arxiv_org_abs_2411_00920 |
| institution | arXiv |
| publishDate | 2024 |
| record_format | arxiv |
| spellingShingle | Comparative Evaluation of Applicability Domain Definition Methods for Regression Models Khurshid, Shakir Loganathan, Bharath Kumar Duvinage, Matthieu Machine Learning Artificial Intelligence The applicability domain refers to the range of data for which the prediction of the predictive model is expected to be reliable and accurate and using a model outside its applicability domain can lead to incorrect results. The ability to define the regions in data space where a predictive model can be safely used is a necessary condition for having safer and more reliable predictions to assure the reliability of new predictions. However, defining the applicability domain of a model is a challenging problem, as there is no clear and universal definition or metric for it. This work aims to make the applicability domain more quantifiable and pragmatic. Eight applicability domain detection techniques were applied to seven regression models, trained on five different datasets, and their performance was benchmarked using a validation framework. We also propose a novel approach based on non-deterministic Bayesian neural networks to define the applicability domain of the model. Our method exhibited superior accuracy in defining the Applicability Domain compared to previous methods, highlighting its potential in this regard. |
| title | Comparative Evaluation of Applicability Domain Definition Methods for Regression Models |
| topic | Machine Learning Artificial Intelligence |
| url | https://arxiv.org/abs/2411.00920 |