:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Wang, Shuyun, Zhang, Hu, Shen, Xin, Wang, Dadong, Yu, Xin
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2604.13906
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Towards Blind Bitstream-corrupted Video Recovery via a Visual Foundation Model-driven Framework
by: Liu, Tianyi, et al.
Published: (2025)

Mirage: One-Step Video Diffusion for Photorealistic and Coherent Asset Editing in Driving Scenes
by: Wang, Shuyun, et al.
Published: (2025)

Enhancing User-Centric Privacy Protection: An Interactive Framework through Diffusion Models and Machine Unlearning
by: Huang, Huaxi, et al.
Published: (2024)

7ABAW-Compound Expression Recognition via Curriculum Learning
by: Liu, Chen, et al.
Published: (2025)

NTIRE 2026 Challenge on Bitstream-Corrupted Video Restoration: Methods and Results
by: Zou, Wenbin, et al.
Published: (2026)

SGIA: Enhancing Fine-Grained Visual Classification with Sequence Generative Image Augmentation
by: Liao, Qiyu, et al.
Published: (2024)

Robust Audio-Visual Segmentation via Audio-Guided Visual Convergent Alignment
by: Liu, Chen, et al.
Published: (2025)

Learning to Refocus with Video Diffusion Models
by: Tedla, SaiKiran, et al.
Published: (2025)

SIGMark: Scalable In-Generation Watermark with Blind Extraction for Video Diffusion
by: Zhu, Xinjie, et al.
Published: (2026)

Frequency-Guided Diffusion Model with Perturbation Training for Skeleton-Based Video Anomaly Detection
by: Tan, Xiaofeng, et al.
Published: (2024)

Feature Denoising Diffusion Model for Blind Image Quality Assessment
by: Li, Xudong, et al.
Published: (2024)

Diffusion Models are Efficient Data Generators for Human Mesh Recovery
by: Ge, Yongtao, et al.
Published: (2024)

Edit Temporal-Consistent Videos with Image Diffusion Model
by: Wang, Yuanzhi, et al.
Published: (2023)

LiftVSR: Lifting Image Diffusion to Video Super-Resolution via Hybrid Temporal Modeling with Only 4$\times$RTX 4090s
by: Wang, Xijun, et al.
Published: (2025)

CoNo: Consistency Noise Injection for Tuning-free Long Video Diffusion
by: Wang, Xingrui, et al.
Published: (2024)

Visual Superordinate Abstraction for Robust Concept Learning
by: Zheng, Qi, et al.
Published: (2022)

Dynamic Derivation and Elimination: Audio Visual Segmentation with Enhanced Audio Semantics
by: Liu, Chen, et al.
Published: (2025)

CMamba: Learned Image Compression with State Space Models
by: Wu, Zhuojie, et al.
Published: (2025)

BlindDiff: Empowering Degradation Modelling in Diffusion Models for Blind Image Super-Resolution
by: Li, Feng, et al.
Published: (2024)

Seeing Clearly, Reasoning Confidently: Plug-and-Play Remedies for Vision Language Model Blindness
by: Hu, Xin, et al.
Published: (2026)

Knowledge Priors for Identity-Disentangled Open-Set Privacy-Preserving Video FER
by: Xu, Feng, et al.
Published: (2026)

Towards Better Optimization For Listwise Preference in Diffusion Models
by: Bai, Jiamu, et al.
Published: (2025)

AccVideo: Accelerating Video Diffusion Model with Synthetic Dataset
by: Zhang, Haiyu, et al.
Published: (2025)

TD-BFR: Truncated Diffusion Model for Efficient Blind Face Restoration
by: Zhang, Ziying, et al.
Published: (2025)

MM-WLAuslan: Multi-View Multi-Modal Word-Level Australian Sign Language Recognition Dataset
by: Shen, Xin, et al.
Published: (2024)

Spatiotemporal Blind-Spot Network with Calibrated Flow Alignment for Self-Supervised Video Denoising
by: Chen, Zikang, et al.
Published: (2024)

CamPilot: Improving Camera Control in Video Diffusion Model with Efficient Camera Reward Feedback
by: Ge, Wenhang, et al.
Published: (2026)

Information Prebuilt Recurrent Reconstruction Network for Video Super-Resolution
by: Wang, Shuyun, et al.
Published: (2021)

Structure-guided Diffusion Transformer for Low-Light Image Enhancement
by: Yin, Xiangchen, et al.
Published: (2025)

URSimulator: Human-Perception-Driven Prompt Tuning for Enhanced Virtual Urban Renewal via Diffusion Models
by: Hu, Chuanbo, et al.
Published: (2024)

HandDiffuse: Generative Controllers for Two-Hand Interactions via Diffusion Models
by: Lin, Pei, et al.
Published: (2023)

Zero-Shot Video Restoration and Enhancement with Assistance of Video Diffusion Models
by: Cao, Cong, et al.
Published: (2026)

Single-Shot HDR Recovery via a Video Diffusion Prior
by: Talegaonkar, Chinmay, et al.
Published: (2026)

VideoShield: Regulating Diffusion-based Video Generation Models via Watermarking
by: Hu, Runyi, et al.
Published: (2025)

Latte: Latent Diffusion Transformer for Video Generation
by: Ma, Xin, et al.
Published: (2024)

Analysis and Benchmarking of Extending Blind Face Image Restoration to Videos
by: Wang, Zhouxia, et al.
Published: (2024)

4Diffusion: Multi-view Video Diffusion Model for 4D Generation
by: Zhang, Haiyu, et al.
Published: (2024)

TIV-Diffusion: Towards Object-Centric Movement for Text-driven Image to Video Generation
by: Wang, Xingrui, et al.
Published: (2024)

DreamSwapV: Mask-guided Subject Swapping for Any Customized Video Editing
by: Wang, Weitao, et al.
Published: (2025)

DiffusionAD: Norm-guided One-step Denoising Diffusion for Anomaly Detection
by: Zhang, Hui, et al.
Published: (2023)