Saved in:
| Main Authors: | Li, KaiZhou, Gu, Jindong, Yu, Xinchun, Cao, Junjie, Tang, Yansong, Zhang, Xiao-Ping |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2411.17746 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
VideoGuard: Protecting Video Content from Unauthorized Editing
by: Cao, Junjie, et al.
Published: (2025)
by: Cao, Junjie, et al.
Published: (2025)
Temporal Pair Consistency for Variance-Reduced Flow Matching
by: Maduabuchi, Chika, et al.
Published: (2026)
by: Maduabuchi, Chika, et al.
Published: (2026)
FADE: Frequency-Aware Diffusion Model Factorization for Video Editing
by: Zhu, Yixuan, et al.
Published: (2025)
by: Zhu, Yixuan, et al.
Published: (2025)
Understanding Temporal Logic Consistency in Video-Language Models through Cross-Modal Attention Discriminability
by: Li, Chengzhi, et al.
Published: (2025)
by: Li, Chengzhi, et al.
Published: (2025)
1st Place Solution for 5th LSVOS Challenge: Referring Video Object Segmentation
by: Luo, Zhuoyan, et al.
Published: (2024)
by: Luo, Zhuoyan, et al.
Published: (2024)
Beyond Cross-Modal Alignment: Measuring and Leveraging Modality Gap in Vision-Language Models
by: Yan, Hanqi, et al.
Published: (2025)
by: Yan, Hanqi, et al.
Published: (2025)
Video Flow as Time Series: Discovering Temporal Consistency and Variability for VideoQA
by: Song, Zijie, et al.
Published: (2025)
by: Song, Zijie, et al.
Published: (2025)
RelightVid: Temporal-Consistent Diffusion Model for Video Relighting
by: Fang, Ye, et al.
Published: (2025)
by: Fang, Ye, et al.
Published: (2025)
A Survey on Responsible Generative AI: What to Generate and What Not
by: Gu, Jindong
Published: (2024)
by: Gu, Jindong
Published: (2024)
Leveraging the Video-level Semantic Consistency of Event for Audio-visual Event Localization
by: Jiang, Yuanyuan, et al.
Published: (2022)
by: Jiang, Yuanyuan, et al.
Published: (2022)
Underwater object detection in sonar imagery with detection transformer and Zero-shot neural architecture search
by: Gu, XiaoTong, et al.
Published: (2025)
by: Gu, XiaoTong, et al.
Published: (2025)
FastInit: Fast Noise Initialization for Temporally Consistent Video Generation
by: Bai, Chengyu, et al.
Published: (2025)
by: Bai, Chengyu, et al.
Published: (2025)
DropletVideo: A Dataset and Approach to Explore Integral Spatio-Temporal Consistent Video Generation
by: Zhang, Runze, et al.
Published: (2025)
by: Zhang, Runze, et al.
Published: (2025)
Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language Models
by: Wang, Haibo, et al.
Published: (2024)
by: Wang, Haibo, et al.
Published: (2024)
Detecting AI-Generated Video via Frame Consistency
by: Ma, Long, et al.
Published: (2024)
by: Ma, Long, et al.
Published: (2024)
IDESplat: Iterative Depth Probability Estimation for Generalizable 3D Gaussian Splatting
by: Long, Wei, et al.
Published: (2026)
by: Long, Wei, et al.
Published: (2026)
One-Step Diffusion for Detail-Rich and Temporally Consistent Video Super-Resolution
by: Sun, Yujing, et al.
Published: (2025)
by: Sun, Yujing, et al.
Published: (2025)
LSA: Localized Semantic Alignment for Enhancing Temporal Consistency in Traffic Video Generation
by: Karimov, Mirlan, et al.
Published: (2026)
by: Karimov, Mirlan, et al.
Published: (2026)
Localizing Events in Videos with Multimodal Queries
by: Zhang, Gengyuan, et al.
Published: (2024)
by: Zhang, Gengyuan, et al.
Published: (2024)
VideoGPA: Distilling Geometry Priors for 3D-Consistent Video Generation
by: Du, Hongyang, et al.
Published: (2026)
by: Du, Hongyang, et al.
Published: (2026)
Fast Adversarial Training with Weak-to-Strong Spatial-Temporal Consistency in the Frequency Domain on Videos
by: Wang, Songping, et al.
Published: (2025)
by: Wang, Songping, et al.
Published: (2025)
AlcheMinT: Fine-grained Temporal Control for Multi-Reference Consistent Video Generation
by: Girish, Sharath, et al.
Published: (2025)
by: Girish, Sharath, et al.
Published: (2025)
SPARROW: Learning Spatial Precision and Temporal Referential Consistency in Pixel-Grounded Video MLLMs
by: Alansari, Mohamad, et al.
Published: (2026)
by: Alansari, Mohamad, et al.
Published: (2026)
Leveraging Imperfect Medical Data: A Manifold-Consistent Spatio-Temporal Network for Sensor-based Human Activity Recognition
by: Fan, Jiangtao, et al.
Published: (2026)
by: Fan, Jiangtao, et al.
Published: (2026)
Rethinking Weakly-supervised Video Temporal Grounding From a Game Perspective
by: Fang, Xiang, et al.
Published: (2026)
by: Fang, Xiang, et al.
Published: (2026)
Vid-Freeze: Protecting Images from Malicious Image-to-Video Generation via Temporal Freezing
by: Chowdhury, Rohit, et al.
Published: (2025)
by: Chowdhury, Rohit, et al.
Published: (2025)
LensWalk: Agentic Video Understanding by Planning How You See in Videos
by: Li, Keliang, et al.
Published: (2026)
by: Li, Keliang, et al.
Published: (2026)
Tempo-R0: A Video-MLLM for Temporal Video Grounding through Efficient Temporal Sensing Reinforcement Learning
by: Yue, Feng, et al.
Published: (2025)
by: Yue, Feng, et al.
Published: (2025)
Image Tokens Matter: Mitigating Hallucination in Discrete Tokenizer-based Large Vision-Language Models via Latent Editing
by: Wang, Weixing, et al.
Published: (2025)
by: Wang, Weixing, et al.
Published: (2025)
Exposing and Mitigating Temporal Attack in Deepfake Video Detection
by: Gu, Zheyuan, et al.
Published: (2026)
by: Gu, Zheyuan, et al.
Published: (2026)
Zero-TIG: Temporal Consistency-Aware Zero-Shot Illumination-Guided Low-light Video Enhancement
by: Li, Yini, et al.
Published: (2025)
by: Li, Yini, et al.
Published: (2025)
VidDoS: Universal Denial-of-Service Attack on Video-based Large Language Models
by: Tang, Duoxun, et al.
Published: (2026)
by: Tang, Duoxun, et al.
Published: (2026)
A Survey: Spatiotemporal Consistency in Video Generation
by: Yin, Zhiyu, et al.
Published: (2025)
by: Yin, Zhiyu, et al.
Published: (2025)
Acquiring Weak Annotations for Tumor Localization in Temporal and Volumetric Data
by: Chou, Yu-Cheng, et al.
Published: (2023)
by: Chou, Yu-Cheng, et al.
Published: (2023)
Video Generation with Consistency Tuning
by: Wang, Chaoyi, et al.
Published: (2024)
by: Wang, Chaoyi, et al.
Published: (2024)
CubeComposer: Spatio-Temporal Autoregressive 4K 360° Video Generation from Perspective Video
by: Li, Lingen, et al.
Published: (2026)
by: Li, Lingen, et al.
Published: (2026)
Concat-ID: Towards Universal Identity-Preserving Video Synthesis
by: Zhong, Yong, et al.
Published: (2025)
by: Zhong, Yong, et al.
Published: (2025)
FTBC: Forward Temporal Bias Correction for Optimizing ANN-SNN Conversion
by: Wu, Xiaofeng, et al.
Published: (2024)
by: Wu, Xiaofeng, et al.
Published: (2024)
Temporal Aware Pruning for Efficient Diffusion-based Video Generation
by: Li, Sheng, et al.
Published: (2026)
by: Li, Sheng, et al.
Published: (2026)
AI Powered High Quality Text to Video Generation with Enhanced Temporal Consistency
by: Patel, Piyushkumar
Published: (2025)
by: Patel, Piyushkumar
Published: (2025)
Similar Items
-
VideoGuard: Protecting Video Content from Unauthorized Editing
by: Cao, Junjie, et al.
Published: (2025) -
Temporal Pair Consistency for Variance-Reduced Flow Matching
by: Maduabuchi, Chika, et al.
Published: (2026) -
FADE: Frequency-Aware Diffusion Model Factorization for Video Editing
by: Zhu, Yixuan, et al.
Published: (2025) -
Understanding Temporal Logic Consistency in Video-Language Models through Cross-Modal Attention Discriminability
by: Li, Chengzhi, et al.
Published: (2025) -
1st Place Solution for 5th LSVOS Challenge: Referring Video Object Segmentation
by: Luo, Zhuoyan, et al.
Published: (2024)