Vista Equipo: :: Library Catalog

Guardado en:

Detalles Bibliográficos
Autores principales:	Lonnqvist, Ben, Scialom, Elsa, Gokce, Abdulkadir, Merchant, Zehra, Herzog, Michael H., Schrimpf, Martin
Formato:	Preprint
Publicado:	2025
Materias:	Computer Vision and Pattern Recognition
Acceso en línea:	https://arxiv.org/abs/2504.05253
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

_version_	1866912439239442432
author	Lonnqvist, Ben Scialom, Elsa Gokce, Abdulkadir Merchant, Zehra Herzog, Michael H. Schrimpf, Martin
author_facet	Lonnqvist, Ben Scialom, Elsa Gokce, Abdulkadir Merchant, Zehra Herzog, Michael H. Schrimpf, Martin
contents	Despite the tremendous success of deep learning in computer vision, models still fall behind humans in generalizing to new input distributions. Existing benchmarks do not investigate the specific failure points of models by analyzing performance under many controlled conditions. Our study systematically dissects where and why models struggle with contour integration -- a hallmark of human vision -- by designing an experiment that tests object recognition under various levels of object fragmentation. Humans (n=50) perform at high accuracy, even with few object contours present. This is in contrast to models which exhibit substantially lower sensitivity to increasing object contours, with most of the over 1,000 models we tested barely performing above chance. Only at very large scales ($\sim5B$ training dataset size) do models begin to approach human performance. Importantly, humans exhibit an integration bias -- a preference towards recognizing objects made up of directional fragments over directionless fragments. We find that not only do models that share this property perform better at our task, but that this bias also increases with model training dataset size, and training models to exhibit contour integration leads to high shape bias. Taken together, our results suggest that contour integration is a hallmark of object vision that underlies object recognition performance, and may be a mechanism learned from data at scale.
format	Preprint
id	arxiv_https___arxiv_org_abs_2504_05253
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Contour Integration Underlies Human-Like Vision Lonnqvist, Ben Scialom, Elsa Gokce, Abdulkadir Merchant, Zehra Herzog, Michael H. Schrimpf, Martin Computer Vision and Pattern Recognition Despite the tremendous success of deep learning in computer vision, models still fall behind humans in generalizing to new input distributions. Existing benchmarks do not investigate the specific failure points of models by analyzing performance under many controlled conditions. Our study systematically dissects where and why models struggle with contour integration -- a hallmark of human vision -- by designing an experiment that tests object recognition under various levels of object fragmentation. Humans (n=50) perform at high accuracy, even with few object contours present. This is in contrast to models which exhibit substantially lower sensitivity to increasing object contours, with most of the over 1,000 models we tested barely performing above chance. Only at very large scales ($\sim5B$ training dataset size) do models begin to approach human performance. Importantly, humans exhibit an integration bias -- a preference towards recognizing objects made up of directional fragments over directionless fragments. We find that not only do models that share this property perform better at our task, but that this bias also increases with model training dataset size, and training models to exhibit contour integration leads to high shape bias. Taken together, our results suggest that contour integration is a hallmark of object vision that underlies object recognition performance, and may be a mechanism learned from data at scale.
title	Contour Integration Underlies Human-Like Vision
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2504.05253

Ejemplares similares