Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Fadaei, Amir Hosein, Dehaqani, Mohammad-Reza A.
Format:	Preprint
Published:	2023
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence I.2.10; I.5.1; I.4.8
Online Access:	https://arxiv.org/abs/2311.00800
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866916326818185216
author	Fadaei, Amir Hosein Dehaqani, Mohammad-Reza A.
author_facet	Fadaei, Amir Hosein Dehaqani, Mohammad-Reza A.
contents	Traditionally, vision models have predominantly relied on spatial features extracted from static images, deviating from the continuous stream of spatiotemporal features processed by the brain in natural vision. While numerous video-understanding models have emerged, incorporating videos into image-understanding models with spatiotemporal features has been limited. Drawing inspiration from natural vision, which exhibits remarkable resilience to input changes, our research focuses on the development of a brain-inspired model for vision understanding trained with videos. Our findings demonstrate that models that train on videos instead of still images and include temporal features become more resilient to various alternations on input media.
format	Preprint
id	arxiv_https___arxiv_org_abs_2311_00800
institution	arXiv
publishDate	2023
record_format	arxiv
spellingShingle	Beyond still images: Temporal features and input variance resilience Fadaei, Amir Hosein Dehaqani, Mohammad-Reza A. Computer Vision and Pattern Recognition Artificial Intelligence I.2.10; I.5.1; I.4.8 Traditionally, vision models have predominantly relied on spatial features extracted from static images, deviating from the continuous stream of spatiotemporal features processed by the brain in natural vision. While numerous video-understanding models have emerged, incorporating videos into image-understanding models with spatiotemporal features has been limited. Drawing inspiration from natural vision, which exhibits remarkable resilience to input changes, our research focuses on the development of a brain-inspired model for vision understanding trained with videos. Our findings demonstrate that models that train on videos instead of still images and include temporal features become more resilient to various alternations on input media.
title	Beyond still images: Temporal features and input variance resilience
topic	Computer Vision and Pattern Recognition Artificial Intelligence I.2.10; I.5.1; I.4.8
url	https://arxiv.org/abs/2311.00800

Similar Items