:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Pan, Yi, Huang, Jun-Jie, Chen, Zihan, Zhao, Wentao, Wang, Ziyue
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2406.01894
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

TVRN: Invertible Neural Networks for Compression-Aware Temporal Video Rescaling
by: Feng, Xinmin, et al.
Published: (2026)

Dynamics-aware Adversarial Attack of Adaptive Neural Networks
by: Tao, An, et al.
Published: (2022)

SpatioTemporal Learning for Human Pose Estimation in Sparsely-Labeled Videos
by: Jiao, Yingying, et al.
Published: (2025)

Towards Effective and Sparse Adversarial Attack on Spiking Neural Networks via Breaking Invisible Surrogate Gradients
by: Lun, Li, et al.
Published: (2025)

SMILENet: Unleashing Extra-Large Capacity Image Steganography via a Synergistic Mosaic InvertibLE Hiding Network
by: Huang, Jun-Jie, et al.
Published: (2025)

VideoFusion: A Spatio-Temporal Collaborative Network for Multi-modal Video Fusion
by: Tang, Linfeng, et al.
Published: (2025)

SparseCam4D: Spatio-Temporally Consistent 4D Reconstruction from Sparse Cameras
by: Pan, Weihong, et al.
Published: (2026)

Transferability of Adversarial Attacks in Video-based MLLMs: A Cross-modal Image-to-Video Approach
by: Huang, Linhao, et al.
Published: (2025)

SpatioTemporal Difference Network for Video Depth Super-Resolution
by: Wang, Zhengxue, et al.
Published: (2025)

Event-to-Video Reconstruction using Spatio-Temporal and Frequency-Enhanced Deep Neural Networks
by: Maqsood, Ramna, et al.
Published: (2026)

Context-Guided Spatio-Temporal Video Grounding
by: Gu, Xin, et al.
Published: (2024)

Pseudo-Invertible Neural Networks
by: Ehrlich, Yamit, et al.
Published: (2026)

Video-Language Alignment via Spatio-Temporal Graph Transformer
by: Zhang, Shi-Xue, et al.
Published: (2024)

Invertible Diffusion Models for Compressed Sensing
by: Chen, Bin, et al.
Published: (2024)

FiLA-Video: Spatio-Temporal Compression for Fine-Grained Long Video Understanding
by: Guo, Yanan, et al.
Published: (2025)

Compact Attention: Exploiting Structured Spatio-Temporal Sparsity for Fast Video Generation
by: Li, Qirui, et al.
Published: (2025)

VideoStir: Understanding Long Videos via Spatio-Temporally Structured and Intent-Aware RAG
by: Fu, Honghao, et al.
Published: (2026)

VideoChat-R1: Enhancing Spatio-Temporal Perception via Reinforcement Fine-Tuning
by: Li, Xinhao, et al.
Published: (2025)

STCDiT: Spatio-Temporally Consistent Diffusion Transformer for High-Quality Video Super-Resolution
by: Chen, Junyang, et al.
Published: (2025)

SAIF: Sparse Adversarial and Imperceptible Attack Framework
by: Imtiaz, Tooba, et al.
Published: (2022)

T2VAttack: Adversarial Attack on Text-to-Video Diffusion Models
by: Li, Changzhen, et al.
Published: (2025)

Robust Spiking Neural Networks Against Adversarial Attacks
by: Wang, Shuai, et al.
Published: (2026)

Attention-Guided Patch-Wise Sparse Adversarial Attacks on Vision-Language-Action Models
by: Zhang, Naifu, et al.
Published: (2025)

Towards Long-Form Spatio-Temporal Video Grounding
by: Gu, Xin, et al.
Published: (2026)

Vectorized Video Representation with Easy Editing via Hierarchical Spatio-Temporally Consistent Proxy Embedding
by: Chen, Ye, et al.
Published: (2025)

Guided Diffusion-based Generation of Adversarial Objects for Real-World Monocular Depth Estimation Attacks
by: Chen, Yongtao, et al.
Published: (2025)

Spatio-Temporal Distortion Aware Omnidirectional Video Super-Resolution
by: An, Hongyu, et al.
Published: (2024)

MuseTalk: Real-Time High-Fidelity Video Dubbing via Spatio-Temporal Sampling
by: Zhang, Yue, et al.
Published: (2024)

PoseAnything: Universal Pose-guided Video Generation with Part-aware Temporal Coherence
by: Wang, Ruiyan, et al.
Published: (2025)

Cluster-Wise Spatio-Temporal Masking for Efficient Video-Language Pretraining
by: Zhuang, Weijun, et al.
Published: (2026)

VISTA: Video Interaction Spatio-Temporal Analysis Benchmark
by: Aparcedo, Alejandro, et al.
Published: (2026)

Fooling Neural Networks for Motion Forecasting via Adversarial Attacks
by: Medina, Edgar, et al.
Published: (2024)

STAR-Pose: Efficient Low-Resolution Video Human Pose Estimation via Spatial-Temporal Adaptive Super-Resolution
by: Jin, Yucheng, et al.
Published: (2025)

EvoVid: Temporal-Centric Self-Evolution for Video Large Language Models
by: Huang, Shiqi, et al.
Published: (2026)

Left-right Discrepancy for Adversarial Attack on Stereo Networks
by: Wang, Pengfei, et al.
Published: (2024)

Interpreting Video Representations with Spatio-Temporal Sparse Autoencoders
by: Dokme, Atahan, et al.
Published: (2026)

UHD-GPGNet: UHD Video Denoising via Gaussian-Process-Guided Local Spatio-Temporal Modeling
by: He, Weiyuan, et al.
Published: (2026)

Open-Vocabulary Spatio-Temporal Action Detection
by: Wu, Tao, et al.
Published: (2024)

Thinking With Bounding Boxes: Enhancing Spatio-Temporal Video Grounding via Reinforcement Fine-Tuning
by: Gu, Xin, et al.
Published: (2025)

Mango-GS: Enhancing Spatio-Temporal Consistency in Dynamic Scenes Reconstruction using Multi-Frame Node-Guided 4D Gaussian Splatting
by: Huang, Tingxuan, et al.
Published: (2026)