Saved in:
Bibliographic Details
Main Authors: Krumpl, Gerhard, Avenhaus, Henning, Possegger, Horst
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2601.10836
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866908769419526144
author Krumpl, Gerhard
Avenhaus, Henning
Possegger, Horst
author_facet Krumpl, Gerhard
Avenhaus, Henning
Possegger, Horst
contents Out-of-distribution (OOD) detection is crucial for deploying robust and reliable machine-learning systems in open-world settings. Despite steady advances in OOD detectors, their interplay with modern training pipelines that maximize in-distribution (ID) accuracy and generalization remains under-explored. We investigate this link through a comprehensive empirical study. Fixing the architecture to the widely adopted ResNet-50, we benchmark 21 post-hoc, state-of-the-art OOD detection methods across 56 ImageNet-trained models obtained via diverse training strategies and evaluate them on eight OOD test sets. Contrary to the common assumption that higher ID accuracy implies better OOD detection performance, we uncover a non-monotonic relationship: OOD performance initially improves with accuracy but declines once advanced training recipes push accuracy beyond the baseline. Moreover, we observe a strong interdependence between training strategy, detector choice, and resulting OOD performance, indicating that no single method is universally optimal.
format Preprint
id arxiv_https___arxiv_org_abs_2601_10836
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle One Model, Many Behaviors: Training-Induced Effects on Out-of-Distribution Detection
Krumpl, Gerhard
Avenhaus, Henning
Possegger, Horst
Computer Vision and Pattern Recognition
Out-of-distribution (OOD) detection is crucial for deploying robust and reliable machine-learning systems in open-world settings. Despite steady advances in OOD detectors, their interplay with modern training pipelines that maximize in-distribution (ID) accuracy and generalization remains under-explored. We investigate this link through a comprehensive empirical study. Fixing the architecture to the widely adopted ResNet-50, we benchmark 21 post-hoc, state-of-the-art OOD detection methods across 56 ImageNet-trained models obtained via diverse training strategies and evaluate them on eight OOD test sets. Contrary to the common assumption that higher ID accuracy implies better OOD detection performance, we uncover a non-monotonic relationship: OOD performance initially improves with accuracy but declines once advanced training recipes push accuracy beyond the baseline. Moreover, we observe a strong interdependence between training strategy, detector choice, and resulting OOD performance, indicating that no single method is universally optimal.
title One Model, Many Behaviors: Training-Induced Effects on Out-of-Distribution Detection
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2601.10836