Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Caraker, Drake, Arnold, Bryan, Rhoads, David
Format: Recurso digital
Sprache:Englisch
Veröffentlicht: Zenodo 2026
Schlagworte:
Online-Zugang:https://doi.org/10.5281/zenodo.19060133
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Inhaltsangabe:
  • <p>Abstract<br><br>We isolate and empirically characterize first-mover bias—a path-dependent concentration<br>of feature importance caused by sequential residual fitting in gradient boosting—as a specific<br>mechanistic cause of the well-known instability of SHAP-based feature rankings under mul-<br>ticollinearity. When correlated features compete for early splits, gradient boosting creates a<br>self-reinforcing advantage for whichever feature is selected first: subsequent trees inherit modified<br>residuals that favor the incumbent, concentrating SHAP importance on an arbitrary feature<br>rather than distributing it across the correlated group. Scaling up a single model amplifies this<br>effect—a Large Single Model with the same total tree count as our method produces the worst<br>explanations of any approach tested.<br><br>We demonstrate that model independence is sufficient to resolve first-mover bias in the linear<br>regime, and remains the most effective mitigation under nonlinear data-generating processes.<br>Both our proposed method, DASH (Diversified Aggregation of SHAP), and simple seed-averaging<br>(Stochastic Retrain) restore stability by breaking the sequential dependency chain, confirming<br>that the operative mechanism is independence between explained models, not any particular<br>aggregation strategy. At ρ = 0.9, both methods achieve stability = 0.977, while the standard<br>single-best workflow degrades to 0.958 and the Large Single Model to 0.938. On the Breast<br>Cancer dataset, DASH improves stability from 0.53 to 0.93 (+0.40) over the standard Single<br>Best, and from 0.32 to 0.93 (+0.61) over the training-budget-matched Single Best (M =200).<br><br>DASH additionally provides two novel diagnostic tools—the Feature Stability Index (FSI)<br>and Importance-Stability (IS) Plot—that detect first-mover bias without ground truth, enabling<br>practitioners to audit explanation reliability before acting on feature rankings. Software and<br>reproducible benchmarks are available at https://github.com/DrakeCaraker/dash-shap.<br><br>Keywords: first-mover bias, SHAP, feature importance, multicollinearity, model independence,<br>gradient boosting, explainability, Rashomon effect</p>