Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Wang, Shunxin, Veldhuis, Raymond, Strisciuglio, Nicola
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2503.03519
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866917966284587008
author	Wang, Shunxin Veldhuis, Raymond Strisciuglio, Nicola
author_facet	Wang, Shunxin Veldhuis, Raymond Strisciuglio, Nicola
contents	Frequency shortcuts refer to specific frequency patterns that models heavily rely on for correct classification. Previous studies have shown that models trained on small image datasets often exploit such shortcuts, potentially impairing their generalization performance. However, existing methods for identifying frequency shortcuts require expensive computations and become impractical for analyzing models trained on large datasets. In this work, we propose the first approach to more efficiently analyze frequency shortcuts at a large scale. We show that both CNN and transformer models learn frequency shortcuts on ImageNet. We also expose that frequency shortcut solutions can yield good performance on out-of-distribution (OOD) test sets which largely retain texture information. However, these shortcuts, mostly aligned with texture patterns, hinder model generalization on rendition-based OOD test sets. These observations suggest that current OOD evaluations often overlook the impact of frequency shortcuts on model generalization. Future benchmarks could thus benefit from explicitly assessing and accounting for these shortcuts to build models that generalize across a broader range of OOD scenarios.
format	Preprint
id	arxiv_https___arxiv_org_abs_2503_03519
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Do ImageNet-trained models learn shortcuts? The impact of frequency shortcuts on generalization Wang, Shunxin Veldhuis, Raymond Strisciuglio, Nicola Computer Vision and Pattern Recognition Frequency shortcuts refer to specific frequency patterns that models heavily rely on for correct classification. Previous studies have shown that models trained on small image datasets often exploit such shortcuts, potentially impairing their generalization performance. However, existing methods for identifying frequency shortcuts require expensive computations and become impractical for analyzing models trained on large datasets. In this work, we propose the first approach to more efficiently analyze frequency shortcuts at a large scale. We show that both CNN and transformer models learn frequency shortcuts on ImageNet. We also expose that frequency shortcut solutions can yield good performance on out-of-distribution (OOD) test sets which largely retain texture information. However, these shortcuts, mostly aligned with texture patterns, hinder model generalization on rendition-based OOD test sets. These observations suggest that current OOD evaluations often overlook the impact of frequency shortcuts on model generalization. Future benchmarks could thus benefit from explicitly assessing and accounting for these shortcuts to build models that generalize across a broader range of OOD scenarios.
title	Do ImageNet-trained models learn shortcuts? The impact of frequency shortcuts on generalization
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2503.03519

Similar Items