Saved in:
| Main Authors: | Yuan, Yu, Yuan, Jianhao, Wang, Xijun, Li, Daiqing, He, Liu, Ling, Lu, Chan, Stanley H. |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2606.00499 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
NewtonGen: Physics-Consistent and Controllable Text-to-Video Generation via Neural Newtonian Dynamics
by: Yuan, Yu, et al.
Published: (2025)
by: Yuan, Yu, et al.
Published: (2025)
SeeU: Seeing the Unseen World via 4D Dynamics-aware Generation
by: Yuan, Yu, et al.
Published: (2025)
by: Yuan, Yu, et al.
Published: (2025)
Generative Photography: Scene-Consistent Camera Control for Realistic Text-to-Image Synthesis
by: Yuan, Yu, et al.
Published: (2024)
by: Yuan, Yu, et al.
Published: (2024)
Learning Phase Distortion with Selective State Space Models for Video Turbulence Mitigation
by: Zhang, Xingguang, et al.
Published: (2025)
by: Zhang, Xingguang, et al.
Published: (2025)
WorldSimBench: Towards Video Generation Models as World Simulators
by: Qin, Yiran, et al.
Published: (2024)
by: Qin, Yiran, et al.
Published: (2024)
Progressive Image Restoration via Text-Conditioned Video Generation
by: Kang, Peng, et al.
Published: (2025)
by: Kang, Peng, et al.
Published: (2025)
Astrophotography turbulence mitigation via generative models
by: Kim, Joonyeoup, et al.
Published: (2025)
by: Kim, Joonyeoup, et al.
Published: (2025)
Inference-time Physics Alignment of Video Generative Models with Latent World Models
by: Yuan, Jianhao, et al.
Published: (2026)
by: Yuan, Jianhao, et al.
Published: (2026)
Rethinking Position Embedding as a Context Controller for Multi-Reference and Multi-Shot Video Generation
by: Huang, Binyuan, et al.
Published: (2026)
by: Huang, Binyuan, et al.
Published: (2026)
Personalized Generative Low-light Image Denoising and Enhancement
by: Wang, Xijun, et al.
Published: (2024)
by: Wang, Xijun, et al.
Published: (2024)
WorldWander: Bridging Egocentric and Exocentric Worlds in Video Generation
by: Song, Quanjian, et al.
Published: (2025)
by: Song, Quanjian, et al.
Published: (2025)
World-R1: Reinforcing 3D Constraints for Text-to-Video Generation
by: Wang, Weijie, et al.
Published: (2026)
by: Wang, Weijie, et al.
Published: (2026)
EvoWorld: Evolving Panoramic World Generation with Explicit 3D Memory
by: Wang, Jiahao, et al.
Published: (2025)
by: Wang, Jiahao, et al.
Published: (2025)
Thinking with Spatial Code for Physical-World Video Reasoning
by: Chen, Jieneng, et al.
Published: (2026)
by: Chen, Jieneng, et al.
Published: (2026)
Pre-Trained Video Generative Models as World Simulators
by: He, Haoran, et al.
Published: (2025)
by: He, Haoran, et al.
Published: (2025)
ALIVE: Animate Your World with Lifelike Audio-Video Generation
by: Guo, Ying, et al.
Published: (2026)
by: Guo, Ying, et al.
Published: (2026)
PhyWorld: Physics-Faithful World Model for Video Generation
by: Zhao, Pu, et al.
Published: (2026)
by: Zhao, Pu, et al.
Published: (2026)
Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation
by: Meng, Fanqing, et al.
Published: (2024)
by: Meng, Fanqing, et al.
Published: (2024)
WorldCraft: From Camera Navigation to Object Manipulation in Interactive Video World Models
by: Gu, Bohai, et al.
Published: (2026)
by: Gu, Bohai, et al.
Published: (2026)
Blended Latent Diffusion under Attention Control for Real-World Video Editing
by: Liu, Deyin, et al.
Published: (2024)
by: Liu, Deyin, et al.
Published: (2024)
LiveWorld: Simulating Out-of-Sight Dynamics in Generative Video World Models
by: Duan, Zicheng, et al.
Published: (2026)
by: Duan, Zicheng, et al.
Published: (2026)
PhysCtrl: Generative Physics for Controllable and Physics-Grounded Video Generation
by: Wang, Chen, et al.
Published: (2025)
by: Wang, Chen, et al.
Published: (2025)
3D4D: An Interactive, Editable, 4D World Model via 3D Video Generation
by: He, Yunhong, et al.
Published: (2025)
by: He, Yunhong, et al.
Published: (2025)
NuiWorld: Exploring a Scalable Framework for End-to-End Controllable World Generation
by: Lee, Han-Hung, et al.
Published: (2026)
by: Lee, Han-Hung, et al.
Published: (2026)
TRELLISWorld: Training-Free World Generation from Object Generators
by: Chen, Hanke, et al.
Published: (2025)
by: Chen, Hanke, et al.
Published: (2025)
UnityVideo: Unified Multi-Modal Multi-Task Learning for Enhancing World-Aware Video Generation
by: Huang, Jiehui, et al.
Published: (2025)
by: Huang, Jiehui, et al.
Published: (2025)
WorldDreamer: Towards General World Models for Video Generation via Predicting Masked Tokens
by: Wang, Xiaofeng, et al.
Published: (2024)
by: Wang, Xiaofeng, et al.
Published: (2024)
VRMDiff: Text-Guided Video Referring Matting Generation of Diffusion
by: Yang, Lehan, et al.
Published: (2025)
by: Yang, Lehan, et al.
Published: (2025)
Toward Physically Consistent Driving Video World Models under Challenging Trajectories
by: Zhou, Jiawei, et al.
Published: (2026)
by: Zhou, Jiawei, et al.
Published: (2026)
DreamWorld: Unified World Modeling in Video Generation
by: Tan, Boming, et al.
Published: (2026)
by: Tan, Boming, et al.
Published: (2026)
GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control
by: Ren, Xuanchi, et al.
Published: (2025)
by: Ren, Xuanchi, et al.
Published: (2025)
Dreamland: Controllable World Creation with Simulator and Generative Models
by: Mo, Sicheng, et al.
Published: (2025)
by: Mo, Sicheng, et al.
Published: (2025)
MultiWorld: Scalable Multi-Agent Multi-View Video World Models
by: Wu, Haoyu, et al.
Published: (2026)
by: Wu, Haoyu, et al.
Published: (2026)
What-If World: A Causal Benchmark for General World Models in Embodied Scenarios
by: Cai, Kunlin, et al.
Published: (2026)
by: Cai, Kunlin, et al.
Published: (2026)
WISA: World Simulator Assistant for Physics-Aware Text-to-Video Generation
by: Wang, Jing, et al.
Published: (2025)
by: Wang, Jing, et al.
Published: (2025)
WorldWeaver: Generating Long-Horizon Video Worlds via Rich Perception
by: Liu, Zhiheng, et al.
Published: (2025)
by: Liu, Zhiheng, et al.
Published: (2025)
Seedance 2.0: Advancing Video Generation for World Complexity
by: Seedance, Team, et al.
Published: (2026)
by: Seedance, Team, et al.
Published: (2026)
BridgeV2W: Bridging Video Generation Models to Embodied World Models via Embodiment Masks
by: Chen, Yixiang, et al.
Published: (2026)
by: Chen, Yixiang, et al.
Published: (2026)
ShareVerse: Multi-Agent Consistent Video Generation for Shared World Modeling
by: Zhu, Jiayi, et al.
Published: (2026)
by: Zhu, Jiayi, et al.
Published: (2026)
Owl-1: Omni World Model for Consistent Long Video Generation
by: Huang, Yuanhui, et al.
Published: (2024)
by: Huang, Yuanhui, et al.
Published: (2024)
Similar Items
-
NewtonGen: Physics-Consistent and Controllable Text-to-Video Generation via Neural Newtonian Dynamics
by: Yuan, Yu, et al.
Published: (2025) -
SeeU: Seeing the Unseen World via 4D Dynamics-aware Generation
by: Yuan, Yu, et al.
Published: (2025) -
Generative Photography: Scene-Consistent Camera Control for Realistic Text-to-Image Synthesis
by: Yuan, Yu, et al.
Published: (2024) -
Learning Phase Distortion with Selective State Space Models for Video Turbulence Mitigation
by: Zhang, Xingguang, et al.
Published: (2025) -
WorldSimBench: Towards Video Generation Models as World Simulators
by: Qin, Yiran, et al.
Published: (2024)