Saved in:
Bibliographic Details
Main Authors: Passi, Ananya, Robinson, Brian S., Bonner, Michael F.
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2605.19155
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866918510281621504
author Passi, Ananya
Robinson, Brian S.
Bonner, Michael F.
author_facet Passi, Ananya
Robinson, Brian S.
Bonner, Michael F.
contents Biological visual systems learn from limited experience, unlike deep learning models that rely on millions of training images. What learning principles make this possible? We tested whether efficient coding, the idea that neural representations capture the statistical structure of natural inputs, can build a hierarchy of human-aligned visual features from limited data. We developed an unsupervised learning procedure in which each layer of a deep network compresses its inputs onto the dominant modes of variation in natural images, using only local statistics and no labels, tasks, or backpropagation. This unsupervised procedure yields features that progress from edges and colors to textures and shapes. The features of this deep efficient coding model are readily recognized by human observers and are predictive of image-evoked fMRI responses in human visual cortex. Furthermore, a hybrid learning procedure that combines efficient coding with supervised fine-tuning yields better brain alignment in low-data settings and more rapid category learning. These findings suggest that efficient coding may shape representations across the entire visual hierarchy and help explain the data efficiency of biological vision.
format Preprint
id arxiv_https___arxiv_org_abs_2605_19155
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Efficient coding along the visual hierarchy
Passi, Ananya
Robinson, Brian S.
Bonner, Michael F.
Computer Vision and Pattern Recognition
Biological visual systems learn from limited experience, unlike deep learning models that rely on millions of training images. What learning principles make this possible? We tested whether efficient coding, the idea that neural representations capture the statistical structure of natural inputs, can build a hierarchy of human-aligned visual features from limited data. We developed an unsupervised learning procedure in which each layer of a deep network compresses its inputs onto the dominant modes of variation in natural images, using only local statistics and no labels, tasks, or backpropagation. This unsupervised procedure yields features that progress from edges and colors to textures and shapes. The features of this deep efficient coding model are readily recognized by human observers and are predictive of image-evoked fMRI responses in human visual cortex. Furthermore, a hybrid learning procedure that combines efficient coding with supervised fine-tuning yields better brain alignment in low-data settings and more rapid category learning. These findings suggest that efficient coding may shape representations across the entire visual hierarchy and help explain the data efficiency of biological vision.
title Efficient coding along the visual hierarchy
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2605.19155