:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Zhang, Yisu, Cao, Chenjie, Wang, Tengfei, Zuo, Xuhui, Wu, Junta, Zhu, Jianke, Guo, Chunchao
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2603.02049
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

LiON-LoRA: Rethinking LoRA Fusion to Unify Controllable Spatial and Temporal Generation for Video Diffusion
by: Zhang, Yisu, et al.
Published: (2025)

WorldCompass: Reinforcement Learning for Long-Horizon World Models
by: Wang, Zehan, et al.
Published: (2026)

Voyager: Long-Range and World-Consistent Video Diffusion for Explorable 3D Scene Generation
by: Huang, Tianyu, et al.
Published: (2025)

WorldMirror: Universal 3D World Reconstruction with Any-Prior Prompting
by: Liu, Yifan, et al.
Published: (2025)

Stereo World Model: Camera-Guided Stereo Video Generation
by: Sun, Yang-Tian, et al.
Published: (2026)

FlashWorld: High-quality 3D Scene Generation within Seconds
by: Li, Xinyang, et al.
Published: (2025)

WorldPlay: Towards Long-Term Geometric Consistency for Real-Time Interactive World Modeling
by: Sun, Wenqiang, et al.
Published: (2025)

HY-World 2.0: A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worlds
by: HY-World, Team, et al.
Published: (2026)

Pathwise Test-Time Correction for Autoregressive Long Video Generation
by: Xiang, Xunzhi, et al.
Published: (2026)

MoCA: Mixture-of-Components Attention for Scalable Compositional 3D Generation
by: Li, Zhiqi, et al.
Published: (2025)

MVSFormer++: Revealing the Devil in Transformer's Details for Multi-View Stereo
by: Cao, Chenjie, et al.
Published: (2024)

Pyramid Deep Fusion Network for Two-Hand Reconstruction from RGB-D Images
by: Ren, Jinwei, et al.
Published: (2023)

R-DMesh: Video-Guided 3D Animation via Rectified Dynamic Mesh Flow
by: Wu, Zijie, et al.
Published: (2026)

Geo4D: Leveraging Video Generators for Geometric 4D Scene Reconstruction
by: Jiang, Zeren, et al.
Published: (2025)

StereoWorld: Geometry-Aware Monocular-to-Stereo Video Generation
by: Xing, Ke, et al.
Published: (2025)

Uni3C: Unifying Precisely 3D-Enhanced Camera and Human Motion Controls for Video Generation
by: Cao, Chenjie, et al.
Published: (2025)

3D Scene Prompting for Scene-Consistent Camera-Controllable Video Generation
by: Lee, JoungBin, et al.
Published: (2025)

ActMVS: Active Scene Reconstruction with Monocular Multi-View Stereo
by: Pu, Guo, et al.
Published: (2026)

Multispectral Stereo-Image Fusion for 3D Hyperspectral Scene Reconstruction
by: Wisotzky, Eric L., et al.
Published: (2023)

From Visual Synthesis to Interactive Worlds: Toward Production-Ready 3D Asset Generation
by: Wu, Jiafeng, et al.
Published: (2026)

4D Driving Scene Generation With Stereo Forcing
by: Lu, Hao, et al.
Published: (2025)

3DPhysVideo: Consistency-Guided Flow SDE for Video Generation via 3D Scene Reconstruction and Physical Simulation
by: Kim, Hwidong, et al.
Published: (2026)

HO-Gaussian: Hybrid Optimization of 3D Gaussian Splatting for Urban Scenes
by: Li, Zhuopeng, et al.
Published: (2024)

Observation Modeling of Reference--Background Residuals in Single-Snapshot FDA-MIMO-GPR
by: Yan, Yisu, et al.
Published: (2026)

Linking Dispersive-Medium Uncertainty to Clutter Analysis in Single-Snapshot FDA-MIMO-GPR
by: Yan, Yisu, et al.
Published: (2026)

Medium-Induced Cross-Frequency Clutter Structure in Single-Snapshot FDA-MIMO-GPR With a Weak-Dispersion Criterion
by: Yan, Yisu, et al.
Published: (2026)

Weak-Fluctuation-Induced Clutter Covariance and Subspace Structure in Single-Snapshot FDA-MIMO GPR
by: Yan, Yisu, et al.
Published: (2026)

Inst3D-LMM: Instance-Aware 3D Scene Understanding with Multi-modal Instruction Tuning
by: Yu, Hanxun, et al.
Published: (2025)

RGB-Phase Speckle: Cross-Scene Stereo 3D Reconstruction via Wrapped Pre-Normalization
by: Yang, Kai, et al.
Published: (2025)

SC4D: Sparse-Controlled Video-to-4D Generation and Motion Transfer
by: Wu, Zijie, et al.
Published: (2024)

MVInpainter: Learning Multi-View Consistent Inpainting to Bridge 2D and 3D Editing
by: Cao, Chenjie, et al.
Published: (2024)

HVOFusion: Incremental Mesh Reconstruction Using Hybrid Voxel Octree
by: Liu, Shaofan, et al.
Published: (2024)

SAM4D: Segment Anything in Camera and LiDAR Streams
by: Xu, Jianyun, et al.
Published: (2025)

IDCNet: Guided Video Diffusion for Metric-Consistent RGBD Scene Generation with Precise Camera Control
by: Liu, Lijuan, et al.
Published: (2025)

UniScene: Multi-Camera Unified Pre-training via 3D Scene Reconstruction for Autonomous Driving
by: Min, Chen, et al.
Published: (2023)

Efficient Camera-Controlled Video Generation of Static Scenes via Sparse Diffusion and 3D Rendering
by: Chen, Jieying, et al.
Published: (2026)

RealisMotion: Decomposed Human Motion Control and Video Generation in the World Space
by: Liang, Jingyun, et al.
Published: (2025)

Guided MRI Reconstruction via Schrödinger Bridge
by: Wang, Yue, et al.
Published: (2024)

SDGE: Stereo Guided Depth Estimation for 360$^\circ$ Camera Sets
by: Xu, Jialei, et al.
Published: (2024)

Learning Dynamic Scene Reconstruction with Sinusoidal Geometric Priors
by: Guo, Tian, et al.
Published: (2025)