Saved in:
| Main Authors: | Zhang, Yisu, Cao, Chenjie, Wang, Tengfei, Zuo, Xuhui, Wu, Junta, Zhu, Jianke, Guo, Chunchao |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.02049 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
LiON-LoRA: Rethinking LoRA Fusion to Unify Controllable Spatial and Temporal Generation for Video Diffusion
by: Zhang, Yisu, et al.
Published: (2025)
by: Zhang, Yisu, et al.
Published: (2025)
WorldCompass: Reinforcement Learning for Long-Horizon World Models
by: Wang, Zehan, et al.
Published: (2026)
by: Wang, Zehan, et al.
Published: (2026)
Voyager: Long-Range and World-Consistent Video Diffusion for Explorable 3D Scene Generation
by: Huang, Tianyu, et al.
Published: (2025)
by: Huang, Tianyu, et al.
Published: (2025)
WorldMirror: Universal 3D World Reconstruction with Any-Prior Prompting
by: Liu, Yifan, et al.
Published: (2025)
by: Liu, Yifan, et al.
Published: (2025)
Stereo World Model: Camera-Guided Stereo Video Generation
by: Sun, Yang-Tian, et al.
Published: (2026)
by: Sun, Yang-Tian, et al.
Published: (2026)
FlashWorld: High-quality 3D Scene Generation within Seconds
by: Li, Xinyang, et al.
Published: (2025)
by: Li, Xinyang, et al.
Published: (2025)
WorldPlay: Towards Long-Term Geometric Consistency for Real-Time Interactive World Modeling
by: Sun, Wenqiang, et al.
Published: (2025)
by: Sun, Wenqiang, et al.
Published: (2025)
HY-World 2.0: A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worlds
by: HY-World, Team, et al.
Published: (2026)
by: HY-World, Team, et al.
Published: (2026)
Pathwise Test-Time Correction for Autoregressive Long Video Generation
by: Xiang, Xunzhi, et al.
Published: (2026)
by: Xiang, Xunzhi, et al.
Published: (2026)
MoCA: Mixture-of-Components Attention for Scalable Compositional 3D Generation
by: Li, Zhiqi, et al.
Published: (2025)
by: Li, Zhiqi, et al.
Published: (2025)
MVSFormer++: Revealing the Devil in Transformer's Details for Multi-View Stereo
by: Cao, Chenjie, et al.
Published: (2024)
by: Cao, Chenjie, et al.
Published: (2024)
Pyramid Deep Fusion Network for Two-Hand Reconstruction from RGB-D Images
by: Ren, Jinwei, et al.
Published: (2023)
by: Ren, Jinwei, et al.
Published: (2023)
R-DMesh: Video-Guided 3D Animation via Rectified Dynamic Mesh Flow
by: Wu, Zijie, et al.
Published: (2026)
by: Wu, Zijie, et al.
Published: (2026)
Geo4D: Leveraging Video Generators for Geometric 4D Scene Reconstruction
by: Jiang, Zeren, et al.
Published: (2025)
by: Jiang, Zeren, et al.
Published: (2025)
StereoWorld: Geometry-Aware Monocular-to-Stereo Video Generation
by: Xing, Ke, et al.
Published: (2025)
by: Xing, Ke, et al.
Published: (2025)
Uni3C: Unifying Precisely 3D-Enhanced Camera and Human Motion Controls for Video Generation
by: Cao, Chenjie, et al.
Published: (2025)
by: Cao, Chenjie, et al.
Published: (2025)
3D Scene Prompting for Scene-Consistent Camera-Controllable Video Generation
by: Lee, JoungBin, et al.
Published: (2025)
by: Lee, JoungBin, et al.
Published: (2025)
ActMVS: Active Scene Reconstruction with Monocular Multi-View Stereo
by: Pu, Guo, et al.
Published: (2026)
by: Pu, Guo, et al.
Published: (2026)
Multispectral Stereo-Image Fusion for 3D Hyperspectral Scene Reconstruction
by: Wisotzky, Eric L., et al.
Published: (2023)
by: Wisotzky, Eric L., et al.
Published: (2023)
From Visual Synthesis to Interactive Worlds: Toward Production-Ready 3D Asset Generation
by: Wu, Jiafeng, et al.
Published: (2026)
by: Wu, Jiafeng, et al.
Published: (2026)
4D Driving Scene Generation With Stereo Forcing
by: Lu, Hao, et al.
Published: (2025)
by: Lu, Hao, et al.
Published: (2025)
3DPhysVideo: Consistency-Guided Flow SDE for Video Generation via 3D Scene Reconstruction and Physical Simulation
by: Kim, Hwidong, et al.
Published: (2026)
by: Kim, Hwidong, et al.
Published: (2026)
HO-Gaussian: Hybrid Optimization of 3D Gaussian Splatting for Urban Scenes
by: Li, Zhuopeng, et al.
Published: (2024)
by: Li, Zhuopeng, et al.
Published: (2024)
Observation Modeling of Reference--Background Residuals in Single-Snapshot FDA-MIMO-GPR
by: Yan, Yisu, et al.
Published: (2026)
by: Yan, Yisu, et al.
Published: (2026)
Linking Dispersive-Medium Uncertainty to Clutter Analysis in Single-Snapshot FDA-MIMO-GPR
by: Yan, Yisu, et al.
Published: (2026)
by: Yan, Yisu, et al.
Published: (2026)
Medium-Induced Cross-Frequency Clutter Structure in Single-Snapshot FDA-MIMO-GPR With a Weak-Dispersion Criterion
by: Yan, Yisu, et al.
Published: (2026)
by: Yan, Yisu, et al.
Published: (2026)
Weak-Fluctuation-Induced Clutter Covariance and Subspace Structure in Single-Snapshot FDA-MIMO GPR
by: Yan, Yisu, et al.
Published: (2026)
by: Yan, Yisu, et al.
Published: (2026)
Inst3D-LMM: Instance-Aware 3D Scene Understanding with Multi-modal Instruction Tuning
by: Yu, Hanxun, et al.
Published: (2025)
by: Yu, Hanxun, et al.
Published: (2025)
RGB-Phase Speckle: Cross-Scene Stereo 3D Reconstruction via Wrapped Pre-Normalization
by: Yang, Kai, et al.
Published: (2025)
by: Yang, Kai, et al.
Published: (2025)
SC4D: Sparse-Controlled Video-to-4D Generation and Motion Transfer
by: Wu, Zijie, et al.
Published: (2024)
by: Wu, Zijie, et al.
Published: (2024)
MVInpainter: Learning Multi-View Consistent Inpainting to Bridge 2D and 3D Editing
by: Cao, Chenjie, et al.
Published: (2024)
by: Cao, Chenjie, et al.
Published: (2024)
HVOFusion: Incremental Mesh Reconstruction Using Hybrid Voxel Octree
by: Liu, Shaofan, et al.
Published: (2024)
by: Liu, Shaofan, et al.
Published: (2024)
SAM4D: Segment Anything in Camera and LiDAR Streams
by: Xu, Jianyun, et al.
Published: (2025)
by: Xu, Jianyun, et al.
Published: (2025)
IDCNet: Guided Video Diffusion for Metric-Consistent RGBD Scene Generation with Precise Camera Control
by: Liu, Lijuan, et al.
Published: (2025)
by: Liu, Lijuan, et al.
Published: (2025)
UniScene: Multi-Camera Unified Pre-training via 3D Scene Reconstruction for Autonomous Driving
by: Min, Chen, et al.
Published: (2023)
by: Min, Chen, et al.
Published: (2023)
Efficient Camera-Controlled Video Generation of Static Scenes via Sparse Diffusion and 3D Rendering
by: Chen, Jieying, et al.
Published: (2026)
by: Chen, Jieying, et al.
Published: (2026)
RealisMotion: Decomposed Human Motion Control and Video Generation in the World Space
by: Liang, Jingyun, et al.
Published: (2025)
by: Liang, Jingyun, et al.
Published: (2025)
Guided MRI Reconstruction via Schrödinger Bridge
by: Wang, Yue, et al.
Published: (2024)
by: Wang, Yue, et al.
Published: (2024)
SDGE: Stereo Guided Depth Estimation for 360$^\circ$ Camera Sets
by: Xu, Jialei, et al.
Published: (2024)
by: Xu, Jialei, et al.
Published: (2024)
Learning Dynamic Scene Reconstruction with Sinusoidal Geometric Priors
by: Guo, Tian, et al.
Published: (2025)
by: Guo, Tian, et al.
Published: (2025)
Similar Items
-
LiON-LoRA: Rethinking LoRA Fusion to Unify Controllable Spatial and Temporal Generation for Video Diffusion
by: Zhang, Yisu, et al.
Published: (2025) -
WorldCompass: Reinforcement Learning for Long-Horizon World Models
by: Wang, Zehan, et al.
Published: (2026) -
Voyager: Long-Range and World-Consistent Video Diffusion for Explorable 3D Scene Generation
by: Huang, Tianyu, et al.
Published: (2025) -
WorldMirror: Universal 3D World Reconstruction with Any-Prior Prompting
by: Liu, Yifan, et al.
Published: (2025) -
Stereo World Model: Camera-Guided Stereo Video Generation
by: Sun, Yang-Tian, et al.
Published: (2026)