:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Bu, Yiming, Liu, Jiayang, Qiu, Qinru
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2402.08936
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

SAFER-AiD: Saccade-Assisted Foveal-peripheral vision Enhanced Reconstruction for Adversarial Defense
by: Liu, Jiayang, et al.
Published: (2025)

SemanticSLAM: Learning based Semantic Map Construction and Robust Camera Localization
by: Li, Mingyang, et al.
Published: (2024)

StreamSTGS: Streaming Spatial and Temporal Gaussian Grids for Real-Time Free-Viewpoint Video
by: Ke, Zhihui, et al.
Published: (2025)

TS-Attn: Temporal-wise Separable Attention for Multi-Event Video Generation
by: Zhang, Hongyu, et al.
Published: (2026)

SmartSight: Mitigating Hallucination in Video-LLMs Without Compromising Video Understanding via Temporal Attention Collapse
by: Sun, Yiming, et al.
Published: (2025)

StreamForest: Efficient Online Video Understanding with Persistent Event Memory
by: Zeng, Xiangyu, et al.
Published: (2025)

Energy-Aware Imitation Learning for Steering Prediction Using Events and Frames
by: Cao, Hu, et al.
Published: (2026)

Assessing Situational and Spatial Awareness of VLMs with Synthetically Generated Video
by: Benschop, Pascal, et al.
Published: (2026)

LFS: Learnable Frame Selector for Event-Aware and Temporally Diverse Video Captioning
by: Chao, Lianying, et al.
Published: (2026)

FluencyVE: Marrying Temporal-Aware Mamba with Bypass Attention for Video Editing
by: Cai, Mingshu, et al.
Published: (2025)

SVLTA: Benchmarking Vision-Language Temporal Alignment via Synthetic Video Situation
by: Du, Hao, et al.
Published: (2025)

Memory Helps, but Confabulation Misleads: Understanding Streaming Events in Videos with MLLMs
by: Zhang, Gengyuan, et al.
Published: (2025)

Event-based Video Person Re-identification via Cross-Modality and Temporal Collaboration
by: Li, Renkai, et al.
Published: (2025)

MapATM: Enhancing HD Map Construction through Actor Trajectory Modeling
by: Li, Mingyang, et al.
Published: (2026)

Memory-efficient Streaming VideoLLMs for Real-time Procedural Video Understanding
by: Chatterjee, Dibyadip, et al.
Published: (2025)

End-to-End Streaming Video Temporal Action Segmentation with Reinforce Learning
by: Zhang, Jinrong, et al.
Published: (2023)

OASIS: On-Demand Hierarchical Event Memory for Streaming Video Reasoning
by: Liang, Zhijia, et al.
Published: (2026)

EndoStreamDepth: Temporally Consistent Monocular Depth Estimation for Endoscopic Video Streams
by: Li, Hao, et al.
Published: (2025)

TRACE: Temporal Grounding Video LLM via Causal Event Modeling
by: Guo, Yongxin, et al.
Published: (2024)

EventTracer: Fast Path Tracing-based Event Stream Rendering
by: Li, Zhenyang, et al.
Published: (2025)

Three-Stream Temporal-Shift Attention Network Based on Self-Knowledge Distillation for Micro-Expression Recognition
by: Zhu, Guanghao, et al.
Published: (2024)

RISAM: Referring Image Segmentation via Mutual-Aware Attention Features
by: Zhang, Mengxi, et al.
Published: (2023)

Recurrent Attention-based Token Selection for Efficient Streaming Video-LLMs
by: Dorovatas, Vaggelis, et al.
Published: (2025)

VCBench: A Streaming Counting Benchmark for Spatial-Temporal State Maintenance in Long Videos
by: Liu, Pengyiang, et al.
Published: (2026)

Stream-R1: Reliability-Perplexity Aware Reward Distillation for Streaming Video Generation
by: Wu, Bin, et al.
Published: (2026)

HiStream: Efficient High-Resolution Video Generation via Redundancy-Eliminated Streaming
by: Qiu, Haonan, et al.
Published: (2025)

VideoShield: Regulating Diffusion-based Video Generation Models via Watermarking
by: Hu, Runyi, et al.
Published: (2025)

DeformStream: Deformation-based Adaptive Volumetric Video Streaming
by: Li, Boyan, et al.
Published: (2024)

ImVideoEdit: Image-learning Video Editing via 2D Spatial Difference Attention Blocks
by: Xu, Jiayang, et al.
Published: (2026)

Prompt Relay: Inference-Time Temporal Control for Multi-Event Video Generation
by: Chen, Gordon, et al.
Published: (2026)

EventVAD: Training-Free Event-Aware Video Anomaly Detection
by: Shao, Yihua, et al.
Published: (2025)

StreamChat: Chatting with Streaming Video
by: Liu, Jihao, et al.
Published: (2024)

StreamingCoT: A Dataset for Temporal Dynamics and Multimodal Chain-of-Thought Reasoning in Streaming VideoQA
by: Hu, Yuhang, et al.
Published: (2025)

EventGait: Towards Robust Gait Recognition with Event Streams
by: Xu, Senyan, et al.
Published: (2026)

Hierarchical Event Memory for Accurate and Low-latency Online Video Temporal Grounding
by: Zheng, Minghang, et al.
Published: (2025)

Event-VStream: Event-Driven Real-Time Understanding for Long Video Streams
by: Guo, Zhenghui, et al.
Published: (2026)

Sparse-Dense Side-Tuner for efficient Video Temporal Grounding
by: Pujol-Perich, David, et al.
Published: (2025)

Cluster-based Video Summarization with Temporal Context Awareness
by: Huynh-Lam, Hai-Dang, et al.
Published: (2024)

GraphThinker: Reinforcing Temporally Grounded Video Reasoning with Event Graph Thinking
by: Cheng, Zixu, et al.
Published: (2026)

Mind the Time: Temporally-Controlled Multi-Event Video Generation
by: Wu, Ziyi, et al.
Published: (2024)