:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Yuan, Yu, Yuan, Jianhao, Wang, Xijun, Li, Daiqing, He, Liu, Ling, Lu, Chan, Stanley H.
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2606.00499
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

NewtonGen: Physics-Consistent and Controllable Text-to-Video Generation via Neural Newtonian Dynamics
by: Yuan, Yu, et al.
Published: (2025)

SeeU: Seeing the Unseen World via 4D Dynamics-aware Generation
by: Yuan, Yu, et al.
Published: (2025)

Generative Photography: Scene-Consistent Camera Control for Realistic Text-to-Image Synthesis
by: Yuan, Yu, et al.
Published: (2024)

Learning Phase Distortion with Selective State Space Models for Video Turbulence Mitigation
by: Zhang, Xingguang, et al.
Published: (2025)

WorldSimBench: Towards Video Generation Models as World Simulators
by: Qin, Yiran, et al.
Published: (2024)

Progressive Image Restoration via Text-Conditioned Video Generation
by: Kang, Peng, et al.
Published: (2025)

Astrophotography turbulence mitigation via generative models
by: Kim, Joonyeoup, et al.
Published: (2025)

Inference-time Physics Alignment of Video Generative Models with Latent World Models
by: Yuan, Jianhao, et al.
Published: (2026)

Rethinking Position Embedding as a Context Controller for Multi-Reference and Multi-Shot Video Generation
by: Huang, Binyuan, et al.
Published: (2026)

Personalized Generative Low-light Image Denoising and Enhancement
by: Wang, Xijun, et al.
Published: (2024)

WorldWander: Bridging Egocentric and Exocentric Worlds in Video Generation
by: Song, Quanjian, et al.
Published: (2025)

World-R1: Reinforcing 3D Constraints for Text-to-Video Generation
by: Wang, Weijie, et al.
Published: (2026)

EvoWorld: Evolving Panoramic World Generation with Explicit 3D Memory
by: Wang, Jiahao, et al.
Published: (2025)

Thinking with Spatial Code for Physical-World Video Reasoning
by: Chen, Jieneng, et al.
Published: (2026)

Pre-Trained Video Generative Models as World Simulators
by: He, Haoran, et al.
Published: (2025)

ALIVE: Animate Your World with Lifelike Audio-Video Generation
by: Guo, Ying, et al.
Published: (2026)

PhyWorld: Physics-Faithful World Model for Video Generation
by: Zhao, Pu, et al.
Published: (2026)

Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation
by: Meng, Fanqing, et al.
Published: (2024)

WorldCraft: From Camera Navigation to Object Manipulation in Interactive Video World Models
by: Gu, Bohai, et al.
Published: (2026)

Blended Latent Diffusion under Attention Control for Real-World Video Editing
by: Liu, Deyin, et al.
Published: (2024)

LiveWorld: Simulating Out-of-Sight Dynamics in Generative Video World Models
by: Duan, Zicheng, et al.
Published: (2026)

PhysCtrl: Generative Physics for Controllable and Physics-Grounded Video Generation
by: Wang, Chen, et al.
Published: (2025)

3D4D: An Interactive, Editable, 4D World Model via 3D Video Generation
by: He, Yunhong, et al.
Published: (2025)

NuiWorld: Exploring a Scalable Framework for End-to-End Controllable World Generation
by: Lee, Han-Hung, et al.
Published: (2026)

TRELLISWorld: Training-Free World Generation from Object Generators
by: Chen, Hanke, et al.
Published: (2025)

UnityVideo: Unified Multi-Modal Multi-Task Learning for Enhancing World-Aware Video Generation
by: Huang, Jiehui, et al.
Published: (2025)

WorldDreamer: Towards General World Models for Video Generation via Predicting Masked Tokens
by: Wang, Xiaofeng, et al.
Published: (2024)

VRMDiff: Text-Guided Video Referring Matting Generation of Diffusion
by: Yang, Lehan, et al.
Published: (2025)

Toward Physically Consistent Driving Video World Models under Challenging Trajectories
by: Zhou, Jiawei, et al.
Published: (2026)

DreamWorld: Unified World Modeling in Video Generation
by: Tan, Boming, et al.
Published: (2026)

GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control
by: Ren, Xuanchi, et al.
Published: (2025)

Dreamland: Controllable World Creation with Simulator and Generative Models
by: Mo, Sicheng, et al.
Published: (2025)

MultiWorld: Scalable Multi-Agent Multi-View Video World Models
by: Wu, Haoyu, et al.
Published: (2026)

What-If World: A Causal Benchmark for General World Models in Embodied Scenarios
by: Cai, Kunlin, et al.
Published: (2026)

WISA: World Simulator Assistant for Physics-Aware Text-to-Video Generation
by: Wang, Jing, et al.
Published: (2025)

WorldWeaver: Generating Long-Horizon Video Worlds via Rich Perception
by: Liu, Zhiheng, et al.
Published: (2025)

Seedance 2.0: Advancing Video Generation for World Complexity
by: Seedance, Team, et al.
Published: (2026)

BridgeV2W: Bridging Video Generation Models to Embodied World Models via Embodiment Masks
by: Chen, Yixiang, et al.
Published: (2026)

ShareVerse: Multi-Agent Consistent Video Generation for Shared World Modeling
by: Zhu, Jiayi, et al.
Published: (2026)

Owl-1: Omni World Model for Consistent Long Video Generation
by: Huang, Yuanhui, et al.
Published: (2024)