:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Torbunov, Dmitrii, Okuducu, Onur, Huang, Yi, Dim, Odera, Coles, Rebecca, Cui, Yonggang, Ren, Yihui
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2512.05240
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

EvRT-DETR: Latent Space Adaptation of Image Detectors for Event-based Vision
by: Torbunov, Dmitrii, et al.
Published: (2024)

Studying Image Diffusion Features for Zero-Shot Video Object Segmentation
by: Delatolas, Thanos, et al.
Published: (2025)

HyperE2VID: Improving Event-Based Video Reconstruction via Hypernetworks
by: Ercan, Burak, et al.
Published: (2023)

EVREAL: Towards a Comprehensive Benchmark and Analysis Suite for Event-based Video Reconstruction
by: Ercan, Burak, et al.
Published: (2023)

CircuitSense: A Hierarchical MLLM Benchmark Bridging Visual Comprehension and Symbolic Reasoning in Engineering Design Process
by: Akbari, Arman, et al.
Published: (2025)

Temporal Residual Guided Diffusion Framework for Event-Driven Video Reconstruction
by: Zhu, Lin, et al.
Published: (2024)

Adapting Video Diffusion Models for Time-Lapse Microscopy
by: Holmberg, Alexander, et al.
Published: (2025)

Boosting Unsupervised Video Instance Segmentation with Automatic Quality-Guided Self-Training
by: Lu, Kaixuan, et al.
Published: (2025)

AVID: Adapting Video Diffusion Models to World Models
by: Rigter, Marc, et al.
Published: (2024)

AutoQ-VIS: Improving Unsupervised Video Instance Segmentation via Automatic Quality Assessment
by: Lu, Kaixuan, et al.
Published: (2025)

ViewPoint: Panoramic Video Generation with Pretrained Diffusion Models
by: Fang, Zixun, et al.
Published: (2025)

MindCine: Multimodal EEG-to-Video Reconstruction with Large-Scale Pretrained Models
by: Zhou, Tian-Yi, et al.
Published: (2026)

Visual Autoregressive Models Beat Diffusion Models on Inference Time Scaling
by: Riise, Erik, et al.
Published: (2025)

Parameter Inference and Uncertainty Quantification with Diffusion Models: Extending CDI to 2D Spatial Conditioning
by: Torbunov, Dmitrii, et al.
Published: (2026)

Adapting Image-to-Video Diffusion Models for Large-Motion Frame Interpolation
by: Jin, Luoxu, et al.
Published: (2024)

E2VIDiff: Perceptual Events-to-Video Reconstruction using Diffusion Priors
by: Liang, Jinxiu, et al.
Published: (2024)

FRAG: Frequency Adapting Group for Diffusion Video Editing
by: Yoon, Sunjae, et al.
Published: (2024)

EventMamba: Enhancing Spatio-Temporal Locality with State Space Models for Event-Based Video Reconstruction
by: Ge, Chengjie, et al.
Published: (2025)

Repurposing Pre-trained Video Diffusion Models for Event-based Video Interpolation
by: Chen, Jingxi, et al.
Published: (2024)

VideoMAP: Toward Scalable Mamba-based Video Autoregressive Pretraining
by: Liu, Yunze, et al.
Published: (2025)

Enhanced Event-Based Video Reconstruction with Motion Compensation
by: Liu, Siying, et al.
Published: (2024)

Lyra: Generative 3D Scene Reconstruction via Video Diffusion Model Self-Distillation
by: Bahmani, Sherwin, et al.
Published: (2025)

Adapting VACE for Real-Time Autoregressive Video Diffusion
by: Fosdick, Ryan
Published: (2026)

Harvest Video Foundation Models via Efficient Post-Pretraining
by: Li, Yizhuo, et al.
Published: (2023)

X2Video: Adapting Diffusion Models for Multimodal Controllable Neural Video Rendering
by: Huang, Zhitong, et al.
Published: (2025)

UniE2F: A Unified Diffusion Framework for Event-to-Frame Reconstruction with Video Foundation Models
by: Xu, Gang, et al.
Published: (2026)

Diffusion-Promoted HDR Video Reconstruction
by: Guan, Yuanshen, et al.
Published: (2024)

Video Signature: Implicit Watermarking for Video Diffusion Models
by: Huang, Yu, et al.
Published: (2025)

Scaling Video Pretraining for Surgical Foundation Models
by: Lu, Sicheng, et al.
Published: (2026)

ARVideo: Autoregressive Pretraining for Self-Supervised Video Representation Learning
by: Ren, Sucheng, et al.
Published: (2024)

Can Video Diffusion Model Reconstruct 4D Geometry?
by: Mai, Jinjie, et al.
Published: (2025)

AID: Adapting Image2Video Diffusion Models for Instruction-guided Video Prediction
by: Xing, Zhen, et al.
Published: (2024)

Audio-visual Event Localization on Portrait Mode Short Videos
by: Liu, Wuyang, et al.
Published: (2025)

Revisit Event Generation Model: Self-Supervised Learning of Event-to-Video Reconstruction with Implicit Neural Representations
by: Wang, Zipeng, et al.
Published: (2024)

Pusa V1.0: Unlocking Temporal Control in Pretrained Video Diffusion Models via Vectorized Timestep Adaptation
by: Liu, Yaofang, et al.
Published: (2025)

AdaViewPlanner: Adapting Video Diffusion Models for Viewpoint Planning in 4D Scenes
by: Li, Yu, et al.
Published: (2025)

EasyOmnimatte: Taming Pretrained Inpainting Diffusion Models for End-to-End Video Layered Decomposition
by: Hu, Yihan, et al.
Published: (2025)

DiffuEraser: A Diffusion Model for Video Inpainting
by: Li, Xiaowen, et al.
Published: (2025)

Static Scene Reconstruction from Dynamic Egocentric Videos
by: Cui, Qifei, et al.
Published: (2026)

HiddenObjects: Scalable Diffusion-Distilled Spatial Priors for Object Placement
by: Schouten, Marco, et al.
Published: (2026)