Saved in:
| Main Authors: | Wang, Shuyun, Zhang, Hu, Shen, Xin, Wang, Dadong, Yu, Xin |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.13906 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Towards Blind Bitstream-corrupted Video Recovery via a Visual Foundation Model-driven Framework
by: Liu, Tianyi, et al.
Published: (2025)
by: Liu, Tianyi, et al.
Published: (2025)
Mirage: One-Step Video Diffusion for Photorealistic and Coherent Asset Editing in Driving Scenes
by: Wang, Shuyun, et al.
Published: (2025)
by: Wang, Shuyun, et al.
Published: (2025)
Enhancing User-Centric Privacy Protection: An Interactive Framework through Diffusion Models and Machine Unlearning
by: Huang, Huaxi, et al.
Published: (2024)
by: Huang, Huaxi, et al.
Published: (2024)
7ABAW-Compound Expression Recognition via Curriculum Learning
by: Liu, Chen, et al.
Published: (2025)
by: Liu, Chen, et al.
Published: (2025)
NTIRE 2026 Challenge on Bitstream-Corrupted Video Restoration: Methods and Results
by: Zou, Wenbin, et al.
Published: (2026)
by: Zou, Wenbin, et al.
Published: (2026)
SGIA: Enhancing Fine-Grained Visual Classification with Sequence Generative Image Augmentation
by: Liao, Qiyu, et al.
Published: (2024)
by: Liao, Qiyu, et al.
Published: (2024)
Robust Audio-Visual Segmentation via Audio-Guided Visual Convergent Alignment
by: Liu, Chen, et al.
Published: (2025)
by: Liu, Chen, et al.
Published: (2025)
Learning to Refocus with Video Diffusion Models
by: Tedla, SaiKiran, et al.
Published: (2025)
by: Tedla, SaiKiran, et al.
Published: (2025)
SIGMark: Scalable In-Generation Watermark with Blind Extraction for Video Diffusion
by: Zhu, Xinjie, et al.
Published: (2026)
by: Zhu, Xinjie, et al.
Published: (2026)
Frequency-Guided Diffusion Model with Perturbation Training for Skeleton-Based Video Anomaly Detection
by: Tan, Xiaofeng, et al.
Published: (2024)
by: Tan, Xiaofeng, et al.
Published: (2024)
Feature Denoising Diffusion Model for Blind Image Quality Assessment
by: Li, Xudong, et al.
Published: (2024)
by: Li, Xudong, et al.
Published: (2024)
Diffusion Models are Efficient Data Generators for Human Mesh Recovery
by: Ge, Yongtao, et al.
Published: (2024)
by: Ge, Yongtao, et al.
Published: (2024)
Edit Temporal-Consistent Videos with Image Diffusion Model
by: Wang, Yuanzhi, et al.
Published: (2023)
by: Wang, Yuanzhi, et al.
Published: (2023)
LiftVSR: Lifting Image Diffusion to Video Super-Resolution via Hybrid Temporal Modeling with Only 4$\times$RTX 4090s
by: Wang, Xijun, et al.
Published: (2025)
by: Wang, Xijun, et al.
Published: (2025)
CoNo: Consistency Noise Injection for Tuning-free Long Video Diffusion
by: Wang, Xingrui, et al.
Published: (2024)
by: Wang, Xingrui, et al.
Published: (2024)
Visual Superordinate Abstraction for Robust Concept Learning
by: Zheng, Qi, et al.
Published: (2022)
by: Zheng, Qi, et al.
Published: (2022)
Dynamic Derivation and Elimination: Audio Visual Segmentation with Enhanced Audio Semantics
by: Liu, Chen, et al.
Published: (2025)
by: Liu, Chen, et al.
Published: (2025)
CMamba: Learned Image Compression with State Space Models
by: Wu, Zhuojie, et al.
Published: (2025)
by: Wu, Zhuojie, et al.
Published: (2025)
BlindDiff: Empowering Degradation Modelling in Diffusion Models for Blind Image Super-Resolution
by: Li, Feng, et al.
Published: (2024)
by: Li, Feng, et al.
Published: (2024)
Seeing Clearly, Reasoning Confidently: Plug-and-Play Remedies for Vision Language Model Blindness
by: Hu, Xin, et al.
Published: (2026)
by: Hu, Xin, et al.
Published: (2026)
Knowledge Priors for Identity-Disentangled Open-Set Privacy-Preserving Video FER
by: Xu, Feng, et al.
Published: (2026)
by: Xu, Feng, et al.
Published: (2026)
Towards Better Optimization For Listwise Preference in Diffusion Models
by: Bai, Jiamu, et al.
Published: (2025)
by: Bai, Jiamu, et al.
Published: (2025)
AccVideo: Accelerating Video Diffusion Model with Synthetic Dataset
by: Zhang, Haiyu, et al.
Published: (2025)
by: Zhang, Haiyu, et al.
Published: (2025)
TD-BFR: Truncated Diffusion Model for Efficient Blind Face Restoration
by: Zhang, Ziying, et al.
Published: (2025)
by: Zhang, Ziying, et al.
Published: (2025)
MM-WLAuslan: Multi-View Multi-Modal Word-Level Australian Sign Language Recognition Dataset
by: Shen, Xin, et al.
Published: (2024)
by: Shen, Xin, et al.
Published: (2024)
Spatiotemporal Blind-Spot Network with Calibrated Flow Alignment for Self-Supervised Video Denoising
by: Chen, Zikang, et al.
Published: (2024)
by: Chen, Zikang, et al.
Published: (2024)
CamPilot: Improving Camera Control in Video Diffusion Model with Efficient Camera Reward Feedback
by: Ge, Wenhang, et al.
Published: (2026)
by: Ge, Wenhang, et al.
Published: (2026)
Information Prebuilt Recurrent Reconstruction Network for Video Super-Resolution
by: Wang, Shuyun, et al.
Published: (2021)
by: Wang, Shuyun, et al.
Published: (2021)
Structure-guided Diffusion Transformer for Low-Light Image Enhancement
by: Yin, Xiangchen, et al.
Published: (2025)
by: Yin, Xiangchen, et al.
Published: (2025)
URSimulator: Human-Perception-Driven Prompt Tuning for Enhanced Virtual Urban Renewal via Diffusion Models
by: Hu, Chuanbo, et al.
Published: (2024)
by: Hu, Chuanbo, et al.
Published: (2024)
HandDiffuse: Generative Controllers for Two-Hand Interactions via Diffusion Models
by: Lin, Pei, et al.
Published: (2023)
by: Lin, Pei, et al.
Published: (2023)
Zero-Shot Video Restoration and Enhancement with Assistance of Video Diffusion Models
by: Cao, Cong, et al.
Published: (2026)
by: Cao, Cong, et al.
Published: (2026)
Single-Shot HDR Recovery via a Video Diffusion Prior
by: Talegaonkar, Chinmay, et al.
Published: (2026)
by: Talegaonkar, Chinmay, et al.
Published: (2026)
VideoShield: Regulating Diffusion-based Video Generation Models via Watermarking
by: Hu, Runyi, et al.
Published: (2025)
by: Hu, Runyi, et al.
Published: (2025)
Latte: Latent Diffusion Transformer for Video Generation
by: Ma, Xin, et al.
Published: (2024)
by: Ma, Xin, et al.
Published: (2024)
Analysis and Benchmarking of Extending Blind Face Image Restoration to Videos
by: Wang, Zhouxia, et al.
Published: (2024)
by: Wang, Zhouxia, et al.
Published: (2024)
4Diffusion: Multi-view Video Diffusion Model for 4D Generation
by: Zhang, Haiyu, et al.
Published: (2024)
by: Zhang, Haiyu, et al.
Published: (2024)
TIV-Diffusion: Towards Object-Centric Movement for Text-driven Image to Video Generation
by: Wang, Xingrui, et al.
Published: (2024)
by: Wang, Xingrui, et al.
Published: (2024)
DreamSwapV: Mask-guided Subject Swapping for Any Customized Video Editing
by: Wang, Weitao, et al.
Published: (2025)
by: Wang, Weitao, et al.
Published: (2025)
DiffusionAD: Norm-guided One-step Denoising Diffusion for Anomaly Detection
by: Zhang, Hui, et al.
Published: (2023)
by: Zhang, Hui, et al.
Published: (2023)
Similar Items
-
Towards Blind Bitstream-corrupted Video Recovery via a Visual Foundation Model-driven Framework
by: Liu, Tianyi, et al.
Published: (2025) -
Mirage: One-Step Video Diffusion for Photorealistic and Coherent Asset Editing in Driving Scenes
by: Wang, Shuyun, et al.
Published: (2025) -
Enhancing User-Centric Privacy Protection: An Interactive Framework through Diffusion Models and Machine Unlearning
by: Huang, Huaxi, et al.
Published: (2024) -
7ABAW-Compound Expression Recognition via Curriculum Learning
by: Liu, Chen, et al.
Published: (2025) -
NTIRE 2026 Challenge on Bitstream-Corrupted Video Restoration: Methods and Results
by: Zou, Wenbin, et al.
Published: (2026)