Saved in:
| Main Authors: | Xie, Linxi, Sun, Lisong C., Neall, Ashley, Wu, Tong, Cai, Shengqu, Wetzstein, Gordon |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.18422 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Collaborative Video Diffusion: Consistent Multi-video Generation with Camera Control
by: Kuang, Zhengfei, et al.
Published: (2024)
by: Kuang, Zhengfei, et al.
Published: (2024)
BulletTime: Decoupled Control of Time and Camera Pose for Video Generation
by: Wang, Yiming, et al.
Published: (2025)
by: Wang, Yiming, et al.
Published: (2025)
GeoFlow: Enforcing Implicit Geometric Consistency in Video Generation
by: Ackermann, Jan, et al.
Published: (2026)
by: Ackermann, Jan, et al.
Published: (2026)
Diffusion Self-Distillation for Zero-Shot Customized Image Generation
by: Cai, Shengqu, et al.
Published: (2024)
by: Cai, Shengqu, et al.
Published: (2024)
Frame Context Packing and Drift Prevention in Next-Frame-Prediction Video Diffusion Models
by: Zhang, Lvmin, et al.
Published: (2025)
by: Zhang, Lvmin, et al.
Published: (2025)
CameraCtrl: Enabling Camera Control for Text-to-Video Generation
by: He, Hao, et al.
Published: (2024)
by: He, Hao, et al.
Published: (2024)
ReStyle3D: Scene-Level Appearance Transfer with Semantic Correspondences
by: Zhu, Liyuan, et al.
Published: (2025)
by: Zhu, Liyuan, et al.
Published: (2025)
Mode Seeking meets Mean Seeking for Fast Long Video Generation
by: Cai, Shengqu, et al.
Published: (2026)
by: Cai, Shengqu, et al.
Published: (2026)
GenDoP: Auto-regressive Camera Trajectory Generation as a Director of Photography
by: Zhang, Mengchen, et al.
Published: (2025)
by: Zhang, Mengchen, et al.
Published: (2025)
Mixture of Contexts for Long Video Generation
by: Cai, Shengqu, et al.
Published: (2025)
by: Cai, Shengqu, et al.
Published: (2025)
Infinite Gaze Generation for Videos with Autoregressive Diffusion
by: Kang, Jenna, et al.
Published: (2026)
by: Kang, Jenna, et al.
Published: (2026)
Neural Ganglion Sensors: Learning Task-specific Event Cameras Inspired by the Neural Circuit of the Human Retina
by: So, Haley M., et al.
Published: (2025)
by: So, Haley M., et al.
Published: (2025)
Pretraining Frame Preservation for Lightweight Autoregressive Video History Embedding
by: Zhang, Lvmin, et al.
Published: (2025)
by: Zhang, Lvmin, et al.
Published: (2025)
Captain Cinema: Towards Short Movie Generation
by: Xiao, Junfei, et al.
Published: (2025)
by: Xiao, Junfei, et al.
Published: (2025)
Robust Symmetry Detection via Riemannian Langevin Dynamics
by: Je, Jihyeon, et al.
Published: (2024)
by: Je, Jihyeon, et al.
Published: (2024)
Controllable Human-centric Keyframe Interpolation with Generative Prior
by: Guo, Zujin, et al.
Published: (2025)
by: Guo, Zujin, et al.
Published: (2025)
Video World Models with Long-term Spatial Memory
by: Wu, Tong, et al.
Published: (2025)
by: Wu, Tong, et al.
Published: (2025)
Spectral Progressive Diffusion for Efficient Image and Video Generation
by: Xiao, Howard, et al.
Published: (2026)
by: Xiao, Howard, et al.
Published: (2026)
CL-Splats: Continual Learning of Gaussian Splatting with Local Optimization
by: Ackermann, Jan, et al.
Published: (2025)
by: Ackermann, Jan, et al.
Published: (2025)
Foveated Diffusion: Efficient Spatially Adaptive Image and Video Generation
by: Chao, Brian, et al.
Published: (2026)
by: Chao, Brian, et al.
Published: (2026)
Towards Vision-Language-Garment Models for Web Knowledge Garment Understanding and Generation
by: Ackermann, Jan, et al.
Published: (2025)
by: Ackermann, Jan, et al.
Published: (2025)
Interspatial Attention for Efficient 4D Human Video Generation
by: Shao, Ruizhi, et al.
Published: (2025)
by: Shao, Ruizhi, et al.
Published: (2025)
ImageGem: In-the-wild Generative Image Interaction Dataset for Generative Model Personalization
by: Guo, Yuanhe, et al.
Published: (2025)
by: Guo, Yuanhe, et al.
Published: (2025)
GazeFusion: Saliency-Guided Image Generation
by: Zhang, Yunxiang, et al.
Published: (2024)
by: Zhang, Yunxiang, et al.
Published: (2024)
GPT-4V(ision) is a Human-Aligned Evaluator for Text-to-3D Generation
by: Wu, Tong, et al.
Published: (2024)
by: Wu, Tong, et al.
Published: (2024)
TokenMotion: Decoupled Motion Control via Token Disentanglement for Human-centric Video Generation
by: Li, Ruineng, et al.
Published: (2025)
by: Li, Ruineng, et al.
Published: (2025)
Warp-as-History: Generalizable Camera-Controlled Video Generation from One Training Video
by: Wang, Yifan, et al.
Published: (2026)
by: Wang, Yifan, et al.
Published: (2026)
ByteMorph: Benchmarking Instruction-Guided Image Editing with Non-Rigid Motions
by: Chang, Di, et al.
Published: (2025)
by: Chang, Di, et al.
Published: (2025)
X-Dyna: Expressive Dynamic Human Image Animation
by: Chang, Di, et al.
Published: (2025)
by: Chang, Di, et al.
Published: (2025)
RealCam-I2V: Real-World Image-to-Video Generation with Interactive Complex Camera Control
by: Li, Teng, et al.
Published: (2025)
by: Li, Teng, et al.
Published: (2025)
Multiple Allergens Causing Sofa Dermatitis: A Case Report of Polysensitisation
by: Bethany Neall, et al.
Published: (2025)
by: Bethany Neall, et al.
Published: (2025)
HumanPlus: Humanoid Shadowing and Imitation from Humans
by: Fu, Zipeng, et al.
Published: (2024)
by: Fu, Zipeng, et al.
Published: (2024)
CameraCtrl II: Dynamic Scene Exploration via Camera-controlled Video Diffusion Models
by: He, Hao, et al.
Published: (2025)
by: He, Hao, et al.
Published: (2025)
GaussFusion: Improving 3D Reconstruction in the Wild with A Geometry-Informed Video Generator
by: Zhu, Liyuan, et al.
Published: (2026)
by: Zhu, Liyuan, et al.
Published: (2026)
3DGen-Bench: Comprehensive Benchmark Suite for 3D Generative Models
by: Zhang, Yuhan, et al.
Published: (2025)
by: Zhang, Yuhan, et al.
Published: (2025)
Unified Camera Positional Encoding for Controlled Video Generation
by: Zhang, Cheng, et al.
Published: (2025)
by: Zhang, Cheng, et al.
Published: (2025)
Streetscapes: Large-scale Consistent Street View Generation Using Autoregressive Video Diffusion
by: Deng, Boyang, et al.
Published: (2024)
by: Deng, Boyang, et al.
Published: (2024)
GenDexHand: Generative Simulation for Dexterous Hands
by: Chen, Feng, et al.
Published: (2025)
by: Chen, Feng, et al.
Published: (2025)
Image2Garment: Simulation-ready Garment Generation from a Single Image
by: Can, Selim Emir, et al.
Published: (2026)
by: Can, Selim Emir, et al.
Published: (2026)
Stereo World Model: Camera-Guided Stereo Video Generation
by: Sun, Yang-Tian, et al.
Published: (2026)
by: Sun, Yang-Tian, et al.
Published: (2026)
Similar Items
-
Collaborative Video Diffusion: Consistent Multi-video Generation with Camera Control
by: Kuang, Zhengfei, et al.
Published: (2024) -
BulletTime: Decoupled Control of Time and Camera Pose for Video Generation
by: Wang, Yiming, et al.
Published: (2025) -
GeoFlow: Enforcing Implicit Geometric Consistency in Video Generation
by: Ackermann, Jan, et al.
Published: (2026) -
Diffusion Self-Distillation for Zero-Shot Customized Image Generation
by: Cai, Shengqu, et al.
Published: (2024) -
Frame Context Packing and Drift Prevention in Next-Frame-Prediction Video Diffusion Models
by: Zhang, Lvmin, et al.
Published: (2025)