Saved in:
Bibliographic Details
Main Authors: Jiang, Zhongli, Zhang, Min, Zhang, Dabao
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2407.03515
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866913859806167040
author Jiang, Zhongli
Zhang, Min
Zhang, Dabao
author_facet Jiang, Zhongli
Zhang, Min
Zhang, Dabao
contents Recently, several fast algorithms have been proposed to decompose predicted value into Shapley values, enabling individualized feature contribution analysis in tree models. While such local decomposition offers valuable insights, it underscores the need for a global evaluation of feature contributions. Although coefficients of determination ($R^2$) allow for comparative assessment of individual features, individualizing $R^2$ is challenged by the underlying quadratic losses. To address this, we propose Q-SHAP, an efficient algorithm that reduces the computational complexity of calculating Shapley values for quadratic losses to polynomial time. Our simulations show that Q-SHAP not only improves computational efficiency but also enhances the accuracy of feature-specific $R^2$ estimates.
format Preprint
id arxiv_https___arxiv_org_abs_2407_03515
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Fast Calculation of Feature Contributions in Boosting Trees
Jiang, Zhongli
Zhang, Min
Zhang, Dabao
Machine Learning
Recently, several fast algorithms have been proposed to decompose predicted value into Shapley values, enabling individualized feature contribution analysis in tree models. While such local decomposition offers valuable insights, it underscores the need for a global evaluation of feature contributions. Although coefficients of determination ($R^2$) allow for comparative assessment of individual features, individualizing $R^2$ is challenged by the underlying quadratic losses. To address this, we propose Q-SHAP, an efficient algorithm that reduces the computational complexity of calculating Shapley values for quadratic losses to polynomial time. Our simulations show that Q-SHAP not only improves computational efficiency but also enhances the accuracy of feature-specific $R^2$ estimates.
title Fast Calculation of Feature Contributions in Boosting Trees
topic Machine Learning
url https://arxiv.org/abs/2407.03515