Saved in:
Bibliographic Details
Main Authors: Piskovskyi, Valentyn, Chimisso, Riccardo, Patania, Sabrina, Foulsham, Tom, Vizzari, Giuseppe, Ognibene, Dimitri
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2410.04497
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866916424745746432
author Piskovskyi, Valentyn
Chimisso, Riccardo
Patania, Sabrina
Foulsham, Tom
Vizzari, Giuseppe
Ognibene, Dimitri
author_facet Piskovskyi, Valentyn
Chimisso, Riccardo
Patania, Sabrina
Foulsham, Tom
Vizzari, Giuseppe
Ognibene, Dimitri
contents The purpose of this work is to investigate the soundness and utility of a neural network-based approach as a framework for exploring the impact of image enhancement techniques on visual cortex activation. In a preliminary study, we prepare a set of state-of-the-art brain encoding models, selected among the top 10 methods that participated in The Algonauts Project 2023 Challenge [16]. We analyze their ability to make valid predictions about the effects of various image enhancement techniques on neural responses. Given the impossibility of acquiring the actual data due to the high costs associated with brain imaging procedures, our investigation builds up on a series of experiments. Specifically, we analyze the ability of brain encoders to estimate the cerebral reaction to various augmentations by evaluating the response to augmentations targeting objects (i.e., faces and words) with known impact on specific areas. Moreover, we study the predicted activation in response to objects unseen during training, exploring the impact of semantically out-of-distribution stimuli. We provide relevant evidence for the generalization ability of the models forming the proposed framework, which appears to be promising for the identification of the optimal visual augmentation filter for a given task, model-driven design strategies as well as for AR and VR applications.
format Preprint
id arxiv_https___arxiv_org_abs_2410_04497
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Generalizability analysis of deep learning predictions of human brain responses to augmented and semantically novel visual stimuli
Piskovskyi, Valentyn
Chimisso, Riccardo
Patania, Sabrina
Foulsham, Tom
Vizzari, Giuseppe
Ognibene, Dimitri
Computer Vision and Pattern Recognition
Artificial Intelligence
Human-Computer Interaction
The purpose of this work is to investigate the soundness and utility of a neural network-based approach as a framework for exploring the impact of image enhancement techniques on visual cortex activation. In a preliminary study, we prepare a set of state-of-the-art brain encoding models, selected among the top 10 methods that participated in The Algonauts Project 2023 Challenge [16]. We analyze their ability to make valid predictions about the effects of various image enhancement techniques on neural responses. Given the impossibility of acquiring the actual data due to the high costs associated with brain imaging procedures, our investigation builds up on a series of experiments. Specifically, we analyze the ability of brain encoders to estimate the cerebral reaction to various augmentations by evaluating the response to augmentations targeting objects (i.e., faces and words) with known impact on specific areas. Moreover, we study the predicted activation in response to objects unseen during training, exploring the impact of semantically out-of-distribution stimuli. We provide relevant evidence for the generalization ability of the models forming the proposed framework, which appears to be promising for the identification of the optimal visual augmentation filter for a given task, model-driven design strategies as well as for AR and VR applications.
title Generalizability analysis of deep learning predictions of human brain responses to augmented and semantically novel visual stimuli
topic Computer Vision and Pattern Recognition
Artificial Intelligence
Human-Computer Interaction
url https://arxiv.org/abs/2410.04497