Saved in:
| Main Authors: | Bu, Yiming, Liu, Jiayang, Qiu, Qinru |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2402.08936 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
SAFER-AiD: Saccade-Assisted Foveal-peripheral vision Enhanced Reconstruction for Adversarial Defense
by: Liu, Jiayang, et al.
Published: (2025)
by: Liu, Jiayang, et al.
Published: (2025)
SemanticSLAM: Learning based Semantic Map Construction and Robust Camera Localization
by: Li, Mingyang, et al.
Published: (2024)
by: Li, Mingyang, et al.
Published: (2024)
StreamSTGS: Streaming Spatial and Temporal Gaussian Grids for Real-Time Free-Viewpoint Video
by: Ke, Zhihui, et al.
Published: (2025)
by: Ke, Zhihui, et al.
Published: (2025)
TS-Attn: Temporal-wise Separable Attention for Multi-Event Video Generation
by: Zhang, Hongyu, et al.
Published: (2026)
by: Zhang, Hongyu, et al.
Published: (2026)
SmartSight: Mitigating Hallucination in Video-LLMs Without Compromising Video Understanding via Temporal Attention Collapse
by: Sun, Yiming, et al.
Published: (2025)
by: Sun, Yiming, et al.
Published: (2025)
StreamForest: Efficient Online Video Understanding with Persistent Event Memory
by: Zeng, Xiangyu, et al.
Published: (2025)
by: Zeng, Xiangyu, et al.
Published: (2025)
Energy-Aware Imitation Learning for Steering Prediction Using Events and Frames
by: Cao, Hu, et al.
Published: (2026)
by: Cao, Hu, et al.
Published: (2026)
Assessing Situational and Spatial Awareness of VLMs with Synthetically Generated Video
by: Benschop, Pascal, et al.
Published: (2026)
by: Benschop, Pascal, et al.
Published: (2026)
LFS: Learnable Frame Selector for Event-Aware and Temporally Diverse Video Captioning
by: Chao, Lianying, et al.
Published: (2026)
by: Chao, Lianying, et al.
Published: (2026)
FluencyVE: Marrying Temporal-Aware Mamba with Bypass Attention for Video Editing
by: Cai, Mingshu, et al.
Published: (2025)
by: Cai, Mingshu, et al.
Published: (2025)
SVLTA: Benchmarking Vision-Language Temporal Alignment via Synthetic Video Situation
by: Du, Hao, et al.
Published: (2025)
by: Du, Hao, et al.
Published: (2025)
Memory Helps, but Confabulation Misleads: Understanding Streaming Events in Videos with MLLMs
by: Zhang, Gengyuan, et al.
Published: (2025)
by: Zhang, Gengyuan, et al.
Published: (2025)
Event-based Video Person Re-identification via Cross-Modality and Temporal Collaboration
by: Li, Renkai, et al.
Published: (2025)
by: Li, Renkai, et al.
Published: (2025)
MapATM: Enhancing HD Map Construction through Actor Trajectory Modeling
by: Li, Mingyang, et al.
Published: (2026)
by: Li, Mingyang, et al.
Published: (2026)
Memory-efficient Streaming VideoLLMs for Real-time Procedural Video Understanding
by: Chatterjee, Dibyadip, et al.
Published: (2025)
by: Chatterjee, Dibyadip, et al.
Published: (2025)
End-to-End Streaming Video Temporal Action Segmentation with Reinforce Learning
by: Zhang, Jinrong, et al.
Published: (2023)
by: Zhang, Jinrong, et al.
Published: (2023)
OASIS: On-Demand Hierarchical Event Memory for Streaming Video Reasoning
by: Liang, Zhijia, et al.
Published: (2026)
by: Liang, Zhijia, et al.
Published: (2026)
EndoStreamDepth: Temporally Consistent Monocular Depth Estimation for Endoscopic Video Streams
by: Li, Hao, et al.
Published: (2025)
by: Li, Hao, et al.
Published: (2025)
TRACE: Temporal Grounding Video LLM via Causal Event Modeling
by: Guo, Yongxin, et al.
Published: (2024)
by: Guo, Yongxin, et al.
Published: (2024)
EventTracer: Fast Path Tracing-based Event Stream Rendering
by: Li, Zhenyang, et al.
Published: (2025)
by: Li, Zhenyang, et al.
Published: (2025)
Three-Stream Temporal-Shift Attention Network Based on Self-Knowledge Distillation for Micro-Expression Recognition
by: Zhu, Guanghao, et al.
Published: (2024)
by: Zhu, Guanghao, et al.
Published: (2024)
RISAM: Referring Image Segmentation via Mutual-Aware Attention Features
by: Zhang, Mengxi, et al.
Published: (2023)
by: Zhang, Mengxi, et al.
Published: (2023)
Recurrent Attention-based Token Selection for Efficient Streaming Video-LLMs
by: Dorovatas, Vaggelis, et al.
Published: (2025)
by: Dorovatas, Vaggelis, et al.
Published: (2025)
VCBench: A Streaming Counting Benchmark for Spatial-Temporal State Maintenance in Long Videos
by: Liu, Pengyiang, et al.
Published: (2026)
by: Liu, Pengyiang, et al.
Published: (2026)
Stream-R1: Reliability-Perplexity Aware Reward Distillation for Streaming Video Generation
by: Wu, Bin, et al.
Published: (2026)
by: Wu, Bin, et al.
Published: (2026)
HiStream: Efficient High-Resolution Video Generation via Redundancy-Eliminated Streaming
by: Qiu, Haonan, et al.
Published: (2025)
by: Qiu, Haonan, et al.
Published: (2025)
VideoShield: Regulating Diffusion-based Video Generation Models via Watermarking
by: Hu, Runyi, et al.
Published: (2025)
by: Hu, Runyi, et al.
Published: (2025)
DeformStream: Deformation-based Adaptive Volumetric Video Streaming
by: Li, Boyan, et al.
Published: (2024)
by: Li, Boyan, et al.
Published: (2024)
ImVideoEdit: Image-learning Video Editing via 2D Spatial Difference Attention Blocks
by: Xu, Jiayang, et al.
Published: (2026)
by: Xu, Jiayang, et al.
Published: (2026)
Prompt Relay: Inference-Time Temporal Control for Multi-Event Video Generation
by: Chen, Gordon, et al.
Published: (2026)
by: Chen, Gordon, et al.
Published: (2026)
EventVAD: Training-Free Event-Aware Video Anomaly Detection
by: Shao, Yihua, et al.
Published: (2025)
by: Shao, Yihua, et al.
Published: (2025)
StreamChat: Chatting with Streaming Video
by: Liu, Jihao, et al.
Published: (2024)
by: Liu, Jihao, et al.
Published: (2024)
StreamingCoT: A Dataset for Temporal Dynamics and Multimodal Chain-of-Thought Reasoning in Streaming VideoQA
by: Hu, Yuhang, et al.
Published: (2025)
by: Hu, Yuhang, et al.
Published: (2025)
EventGait: Towards Robust Gait Recognition with Event Streams
by: Xu, Senyan, et al.
Published: (2026)
by: Xu, Senyan, et al.
Published: (2026)
Hierarchical Event Memory for Accurate and Low-latency Online Video Temporal Grounding
by: Zheng, Minghang, et al.
Published: (2025)
by: Zheng, Minghang, et al.
Published: (2025)
Event-VStream: Event-Driven Real-Time Understanding for Long Video Streams
by: Guo, Zhenghui, et al.
Published: (2026)
by: Guo, Zhenghui, et al.
Published: (2026)
Sparse-Dense Side-Tuner for efficient Video Temporal Grounding
by: Pujol-Perich, David, et al.
Published: (2025)
by: Pujol-Perich, David, et al.
Published: (2025)
Cluster-based Video Summarization with Temporal Context Awareness
by: Huynh-Lam, Hai-Dang, et al.
Published: (2024)
by: Huynh-Lam, Hai-Dang, et al.
Published: (2024)
GraphThinker: Reinforcing Temporally Grounded Video Reasoning with Event Graph Thinking
by: Cheng, Zixu, et al.
Published: (2026)
by: Cheng, Zixu, et al.
Published: (2026)
Mind the Time: Temporally-Controlled Multi-Event Video Generation
by: Wu, Ziyi, et al.
Published: (2024)
by: Wu, Ziyi, et al.
Published: (2024)
Similar Items
-
SAFER-AiD: Saccade-Assisted Foveal-peripheral vision Enhanced Reconstruction for Adversarial Defense
by: Liu, Jiayang, et al.
Published: (2025) -
SemanticSLAM: Learning based Semantic Map Construction and Robust Camera Localization
by: Li, Mingyang, et al.
Published: (2024) -
StreamSTGS: Streaming Spatial and Temporal Gaussian Grids for Real-Time Free-Viewpoint Video
by: Ke, Zhihui, et al.
Published: (2025) -
TS-Attn: Temporal-wise Separable Attention for Multi-Event Video Generation
by: Zhang, Hongyu, et al.
Published: (2026) -
SmartSight: Mitigating Hallucination in Video-LLMs Without Compromising Video Understanding via Temporal Attention Collapse
by: Sun, Yiming, et al.
Published: (2025)