Inhaltsangabe: :: Library Catalog

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Caraker, Drake, Arnold, Bryan, Rhoads, David
Format:	Recurso digital
Sprache:	Englisch
Veröffentlicht:	Zenodo 2026
Schlagworte:	feature importance multicollinearity explainability interpretable machine learning XAI ensemble explainability diversity-aware aggregation explanation aggregation feature stability XGBoost SHAP DASH machine learning
Online-Zugang:	https://doi.org/10.5281/zenodo.19060133
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Inhaltsangabe:

Abstract We isolate and empirically characterize first-mover bias—a path-dependent concentration of feature importance caused by sequential residual fitting in gradient boosting—as a specific mechanistic cause of the well-known instability of SHAP-based feature rankings under mul- ticollinearity. When correlated features compete for early splits, gradient boosting creates a self-reinforcing advantage for whichever feature is selected first: subsequent trees inherit modified residuals that favor the incumbent, concentrating SHAP importance on an arbitrary feature rather than distributing it across the correlated group. Scaling up a single model amplifies this effect—a Large Single Model with the same total tree count as our method produces the worst explanations of any approach tested. We demonstrate that model independence is sufficient to resolve first-mover bias in the linear regime, and remains the most effective mitigation under nonlinear data-generating processes. Both our proposed method, DASH (Diversified Aggregation of SHAP), and simple seed-averaging (Stochastic Retrain) restore stability by breaking the sequential dependency chain, confirming that the operative mechanism is independence between explained models, not any particular aggregation strategy. At ρ = 0.9, both methods achieve stability = 0.977, while the standard single-best workflow degrades to 0.958 and the Large Single Model to 0.938. On the Breast Cancer dataset, DASH improves stability from 0.53 to 0.93 (+0.40) over the standard Single Best, and from 0.32 to 0.93 (+0.61) over the training-budget-matched Single Best (M =200). DASH additionally provides two novel diagnostic tools—the Feature Stability Index (FSI) and Importance-Stability (IS) Plot—that detect first-mover bias without ground truth, enabling practitioners to audit explanation reliability before acting on feature rankings. Software and reproducible benchmarks are available at https://github.com/DrakeCaraker/dash-shap. Keywords: first-mover bias, SHAP, feature importance, multicollinearity, model independence, gradient boosting, explainability, Rashomon effect

Ähnliche Einträge