Saved in:
| Main Authors: | Torbunov, Dmitrii, Okuducu, Onur, Huang, Yi, Dim, Odera, Coles, Rebecca, Cui, Yonggang, Ren, Yihui |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2512.05240 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
EvRT-DETR: Latent Space Adaptation of Image Detectors for Event-based Vision
by: Torbunov, Dmitrii, et al.
Published: (2024)
by: Torbunov, Dmitrii, et al.
Published: (2024)
Studying Image Diffusion Features for Zero-Shot Video Object Segmentation
by: Delatolas, Thanos, et al.
Published: (2025)
by: Delatolas, Thanos, et al.
Published: (2025)
HyperE2VID: Improving Event-Based Video Reconstruction via Hypernetworks
by: Ercan, Burak, et al.
Published: (2023)
by: Ercan, Burak, et al.
Published: (2023)
EVREAL: Towards a Comprehensive Benchmark and Analysis Suite for Event-based Video Reconstruction
by: Ercan, Burak, et al.
Published: (2023)
by: Ercan, Burak, et al.
Published: (2023)
CircuitSense: A Hierarchical MLLM Benchmark Bridging Visual Comprehension and Symbolic Reasoning in Engineering Design Process
by: Akbari, Arman, et al.
Published: (2025)
by: Akbari, Arman, et al.
Published: (2025)
Temporal Residual Guided Diffusion Framework for Event-Driven Video Reconstruction
by: Zhu, Lin, et al.
Published: (2024)
by: Zhu, Lin, et al.
Published: (2024)
Adapting Video Diffusion Models for Time-Lapse Microscopy
by: Holmberg, Alexander, et al.
Published: (2025)
by: Holmberg, Alexander, et al.
Published: (2025)
Boosting Unsupervised Video Instance Segmentation with Automatic Quality-Guided Self-Training
by: Lu, Kaixuan, et al.
Published: (2025)
by: Lu, Kaixuan, et al.
Published: (2025)
AVID: Adapting Video Diffusion Models to World Models
by: Rigter, Marc, et al.
Published: (2024)
by: Rigter, Marc, et al.
Published: (2024)
AutoQ-VIS: Improving Unsupervised Video Instance Segmentation via Automatic Quality Assessment
by: Lu, Kaixuan, et al.
Published: (2025)
by: Lu, Kaixuan, et al.
Published: (2025)
ViewPoint: Panoramic Video Generation with Pretrained Diffusion Models
by: Fang, Zixun, et al.
Published: (2025)
by: Fang, Zixun, et al.
Published: (2025)
MindCine: Multimodal EEG-to-Video Reconstruction with Large-Scale Pretrained Models
by: Zhou, Tian-Yi, et al.
Published: (2026)
by: Zhou, Tian-Yi, et al.
Published: (2026)
Visual Autoregressive Models Beat Diffusion Models on Inference Time Scaling
by: Riise, Erik, et al.
Published: (2025)
by: Riise, Erik, et al.
Published: (2025)
Parameter Inference and Uncertainty Quantification with Diffusion Models: Extending CDI to 2D Spatial Conditioning
by: Torbunov, Dmitrii, et al.
Published: (2026)
by: Torbunov, Dmitrii, et al.
Published: (2026)
Adapting Image-to-Video Diffusion Models for Large-Motion Frame Interpolation
by: Jin, Luoxu, et al.
Published: (2024)
by: Jin, Luoxu, et al.
Published: (2024)
E2VIDiff: Perceptual Events-to-Video Reconstruction using Diffusion Priors
by: Liang, Jinxiu, et al.
Published: (2024)
by: Liang, Jinxiu, et al.
Published: (2024)
FRAG: Frequency Adapting Group for Diffusion Video Editing
by: Yoon, Sunjae, et al.
Published: (2024)
by: Yoon, Sunjae, et al.
Published: (2024)
EventMamba: Enhancing Spatio-Temporal Locality with State Space Models for Event-Based Video Reconstruction
by: Ge, Chengjie, et al.
Published: (2025)
by: Ge, Chengjie, et al.
Published: (2025)
Repurposing Pre-trained Video Diffusion Models for Event-based Video Interpolation
by: Chen, Jingxi, et al.
Published: (2024)
by: Chen, Jingxi, et al.
Published: (2024)
VideoMAP: Toward Scalable Mamba-based Video Autoregressive Pretraining
by: Liu, Yunze, et al.
Published: (2025)
by: Liu, Yunze, et al.
Published: (2025)
Enhanced Event-Based Video Reconstruction with Motion Compensation
by: Liu, Siying, et al.
Published: (2024)
by: Liu, Siying, et al.
Published: (2024)
Lyra: Generative 3D Scene Reconstruction via Video Diffusion Model Self-Distillation
by: Bahmani, Sherwin, et al.
Published: (2025)
by: Bahmani, Sherwin, et al.
Published: (2025)
Adapting VACE for Real-Time Autoregressive Video Diffusion
by: Fosdick, Ryan
Published: (2026)
by: Fosdick, Ryan
Published: (2026)
Harvest Video Foundation Models via Efficient Post-Pretraining
by: Li, Yizhuo, et al.
Published: (2023)
by: Li, Yizhuo, et al.
Published: (2023)
X2Video: Adapting Diffusion Models for Multimodal Controllable Neural Video Rendering
by: Huang, Zhitong, et al.
Published: (2025)
by: Huang, Zhitong, et al.
Published: (2025)
UniE2F: A Unified Diffusion Framework for Event-to-Frame Reconstruction with Video Foundation Models
by: Xu, Gang, et al.
Published: (2026)
by: Xu, Gang, et al.
Published: (2026)
Diffusion-Promoted HDR Video Reconstruction
by: Guan, Yuanshen, et al.
Published: (2024)
by: Guan, Yuanshen, et al.
Published: (2024)
Video Signature: Implicit Watermarking for Video Diffusion Models
by: Huang, Yu, et al.
Published: (2025)
by: Huang, Yu, et al.
Published: (2025)
Scaling Video Pretraining for Surgical Foundation Models
by: Lu, Sicheng, et al.
Published: (2026)
by: Lu, Sicheng, et al.
Published: (2026)
ARVideo: Autoregressive Pretraining for Self-Supervised Video Representation Learning
by: Ren, Sucheng, et al.
Published: (2024)
by: Ren, Sucheng, et al.
Published: (2024)
Can Video Diffusion Model Reconstruct 4D Geometry?
by: Mai, Jinjie, et al.
Published: (2025)
by: Mai, Jinjie, et al.
Published: (2025)
AID: Adapting Image2Video Diffusion Models for Instruction-guided Video Prediction
by: Xing, Zhen, et al.
Published: (2024)
by: Xing, Zhen, et al.
Published: (2024)
Audio-visual Event Localization on Portrait Mode Short Videos
by: Liu, Wuyang, et al.
Published: (2025)
by: Liu, Wuyang, et al.
Published: (2025)
Revisit Event Generation Model: Self-Supervised Learning of Event-to-Video Reconstruction with Implicit Neural Representations
by: Wang, Zipeng, et al.
Published: (2024)
by: Wang, Zipeng, et al.
Published: (2024)
Pusa V1.0: Unlocking Temporal Control in Pretrained Video Diffusion Models via Vectorized Timestep Adaptation
by: Liu, Yaofang, et al.
Published: (2025)
by: Liu, Yaofang, et al.
Published: (2025)
AdaViewPlanner: Adapting Video Diffusion Models for Viewpoint Planning in 4D Scenes
by: Li, Yu, et al.
Published: (2025)
by: Li, Yu, et al.
Published: (2025)
EasyOmnimatte: Taming Pretrained Inpainting Diffusion Models for End-to-End Video Layered Decomposition
by: Hu, Yihan, et al.
Published: (2025)
by: Hu, Yihan, et al.
Published: (2025)
DiffuEraser: A Diffusion Model for Video Inpainting
by: Li, Xiaowen, et al.
Published: (2025)
by: Li, Xiaowen, et al.
Published: (2025)
Static Scene Reconstruction from Dynamic Egocentric Videos
by: Cui, Qifei, et al.
Published: (2026)
by: Cui, Qifei, et al.
Published: (2026)
HiddenObjects: Scalable Diffusion-Distilled Spatial Priors for Object Placement
by: Schouten, Marco, et al.
Published: (2026)
by: Schouten, Marco, et al.
Published: (2026)
Similar Items
-
EvRT-DETR: Latent Space Adaptation of Image Detectors for Event-based Vision
by: Torbunov, Dmitrii, et al.
Published: (2024) -
Studying Image Diffusion Features for Zero-Shot Video Object Segmentation
by: Delatolas, Thanos, et al.
Published: (2025) -
HyperE2VID: Improving Event-Based Video Reconstruction via Hypernetworks
by: Ercan, Burak, et al.
Published: (2023) -
EVREAL: Towards a Comprehensive Benchmark and Analysis Suite for Event-based Video Reconstruction
by: Ercan, Burak, et al.
Published: (2023) -
CircuitSense: A Hierarchical MLLM Benchmark Bridging Visual Comprehension and Symbolic Reasoning in Engineering Design Process
by: Akbari, Arman, et al.
Published: (2025)