Internformat: :: Library Catalog

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Pan, Yi, Huang, Jun-Jie, Chen, Zihan, Zhao, Wentao, Wang, Ziyue
Format:	Preprint
Veröffentlicht:	2024
Schlagworte:	Computer Vision and Pattern Recognition
Online-Zugang:	https://arxiv.org/abs/2406.01894
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

_version_	1866913376387465216
author	Pan, Yi Huang, Jun-Jie Chen, Zihan Zhao, Wentao Wang, Ziyue
author_facet	Pan, Yi Huang, Jun-Jie Chen, Zihan Zhao, Wentao Wang, Ziyue
contents	Robust and imperceptible adversarial video attack is challenging due to the spatial and temporal characteristics of videos. The existing video adversarial attack methods mainly take a gradient-based approach and generate adversarial videos with noticeable perturbations. In this paper, we propose a novel Sparse Adversarial Video Attack via Spatio-Temporal Invertible Neural Networks (SVASTIN) to generate adversarial videos through spatio-temporal feature space information exchanging. It consists of a Guided Target Video Learning (GTVL) module to balance the perturbation budget and optimization speed and a Spatio-Temporal Invertible Neural Network (STIN) module to perform spatio-temporal feature space information exchanging between a source video and the target feature tensor learned by GTVL module. Extensive experiments on UCF-101 and Kinetics-400 demonstrate that our proposed SVASTIN can generate adversarial examples with higher imperceptibility than the state-of-the-art methods with the higher fooling rate. Code is available at \href{https://github.com/Brittany-Chen/SVASTIN}{https://github.com/Brittany-Chen/SVASTIN}.
format	Preprint
id	arxiv_https___arxiv_org_abs_2406_01894
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	SVASTIN: Sparse Video Adversarial Attack via Spatio-Temporal Invertible Neural Networks Pan, Yi Huang, Jun-Jie Chen, Zihan Zhao, Wentao Wang, Ziyue Computer Vision and Pattern Recognition Robust and imperceptible adversarial video attack is challenging due to the spatial and temporal characteristics of videos. The existing video adversarial attack methods mainly take a gradient-based approach and generate adversarial videos with noticeable perturbations. In this paper, we propose a novel Sparse Adversarial Video Attack via Spatio-Temporal Invertible Neural Networks (SVASTIN) to generate adversarial videos through spatio-temporal feature space information exchanging. It consists of a Guided Target Video Learning (GTVL) module to balance the perturbation budget and optimization speed and a Spatio-Temporal Invertible Neural Network (STIN) module to perform spatio-temporal feature space information exchanging between a source video and the target feature tensor learned by GTVL module. Extensive experiments on UCF-101 and Kinetics-400 demonstrate that our proposed SVASTIN can generate adversarial examples with higher imperceptibility than the state-of-the-art methods with the higher fooling rate. Code is available at \href{https://github.com/Brittany-Chen/SVASTIN}{https://github.com/Brittany-Chen/SVASTIN}.
title	SVASTIN: Sparse Video Adversarial Attack via Spatio-Temporal Invertible Neural Networks
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2406.01894

Ähnliche Einträge