Saved in:
Bibliographic Details
Main Authors: Fadaei, Amir Hosein, Dehaqani, Mohammad-Reza A.
Format: Preprint
Published: 2023
Subjects:
Online Access:https://arxiv.org/abs/2311.00800
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866916326818185216
author Fadaei, Amir Hosein
Dehaqani, Mohammad-Reza A.
author_facet Fadaei, Amir Hosein
Dehaqani, Mohammad-Reza A.
contents Traditionally, vision models have predominantly relied on spatial features extracted from static images, deviating from the continuous stream of spatiotemporal features processed by the brain in natural vision. While numerous video-understanding models have emerged, incorporating videos into image-understanding models with spatiotemporal features has been limited. Drawing inspiration from natural vision, which exhibits remarkable resilience to input changes, our research focuses on the development of a brain-inspired model for vision understanding trained with videos. Our findings demonstrate that models that train on videos instead of still images and include temporal features become more resilient to various alternations on input media.
format Preprint
id arxiv_https___arxiv_org_abs_2311_00800
institution arXiv
publishDate 2023
record_format arxiv
spellingShingle Beyond still images: Temporal features and input variance resilience
Fadaei, Amir Hosein
Dehaqani, Mohammad-Reza A.
Computer Vision and Pattern Recognition
Artificial Intelligence
I.2.10; I.5.1; I.4.8
Traditionally, vision models have predominantly relied on spatial features extracted from static images, deviating from the continuous stream of spatiotemporal features processed by the brain in natural vision. While numerous video-understanding models have emerged, incorporating videos into image-understanding models with spatiotemporal features has been limited. Drawing inspiration from natural vision, which exhibits remarkable resilience to input changes, our research focuses on the development of a brain-inspired model for vision understanding trained with videos. Our findings demonstrate that models that train on videos instead of still images and include temporal features become more resilient to various alternations on input media.
title Beyond still images: Temporal features and input variance resilience
topic Computer Vision and Pattern Recognition
Artificial Intelligence
I.2.10; I.5.1; I.4.8
url https://arxiv.org/abs/2311.00800