Saved in:
| Main Authors: | Popov, Stefan, Raj, Amit, Krainin, Michael, Li, Yuanzhen, Freeman, William T., Rubinstein, Michael |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2501.06006 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Probing the 3D Awareness of Visual Foundation Models
by: Banani, Mohamed El, et al.
Published: (2024)
by: Banani, Mohamed El, et al.
Published: (2024)
3D Congealing: 3D-Aware Image Alignment in the Wild
by: Zhang, Yunzhi, et al.
Published: (2024)
by: Zhang, Yunzhi, et al.
Published: (2024)
CameraCtrl II: Dynamic Scene Exploration via Camera-controlled Video Diffusion Models
by: He, Hao, et al.
Published: (2025)
by: He, Hao, et al.
Published: (2025)
CamCo: Camera-Controllable 3D-Consistent Image-to-Video Generation
by: Xu, Dejia, et al.
Published: (2024)
by: Xu, Dejia, et al.
Published: (2024)
DualCamCtrl: Dual-Branch Diffusion Model for Geometry-Aware Camera-Controlled Video Generation
by: Zhang, Hongfei, et al.
Published: (2025)
by: Zhang, Hongfei, et al.
Published: (2025)
PreciseCam: Precise Camera Control for Text-to-Image Generation
by: Bernal-Berdun, Edurne, et al.
Published: (2025)
by: Bernal-Berdun, Edurne, et al.
Published: (2025)
WonderWorld: Interactive 3D Scene Generation from a Single Image
by: Yu, Hong-Xing, et al.
Published: (2024)
by: Yu, Hong-Xing, et al.
Published: (2024)
CameraCtrl: Enabling Camera Control for Text-to-Video Generation
by: He, Hao, et al.
Published: (2024)
by: He, Hao, et al.
Published: (2024)
ObjCtrl-2.5D: Training-free Object Control with Camera Poses
by: Wang, Zhouxia, et al.
Published: (2024)
by: Wang, Zhouxia, et al.
Published: (2024)
DanceCamAnimator: Keyframe-Based Controllable 3D Dance Camera Synthesis
by: Wang, Zixuan, et al.
Published: (2024)
by: Wang, Zixuan, et al.
Published: (2024)
ConfCtrl: Enabling Precise Camera Control in Video Diffusion via Confidence-Aware Interpolation
by: Yang, Liudi, et al.
Published: (2026)
by: Yang, Liudi, et al.
Published: (2026)
3D Scene Prompting for Scene-Consistent Camera-Controllable Video Generation
by: Lee, JoungBin, et al.
Published: (2025)
by: Lee, JoungBin, et al.
Published: (2025)
Efficient Hybrid Zoom using Camera Fusion on Mobile Phones
by: Wu, Xiaotong, et al.
Published: (2024)
by: Wu, Xiaotong, et al.
Published: (2024)
3D Scene-Camera Representation with Joint Camera Photometric Optimization
by: Dai, Weichen, et al.
Published: (2025)
by: Dai, Weichen, et al.
Published: (2025)
Ctrl-Room: Controllable Text-to-3D Room Meshes Generation with Layout Constraints
by: Fang, Chuan, et al.
Published: (2023)
by: Fang, Chuan, et al.
Published: (2023)
Scene4U: Hierarchical Layered 3D Scene Reconstruction from Single Panoramic Image for Your Immerse Exploration
by: Huang, Zilong, et al.
Published: (2025)
by: Huang, Zilong, et al.
Published: (2025)
ActCam: Zero-Shot Joint Camera and 3D Motion Control for Video Generation
by: Khalifi, Omar El, et al.
Published: (2026)
by: Khalifi, Omar El, et al.
Published: (2026)
ReCamMaster: Camera-Controlled Generative Rendering from A Single Video
by: Bai, Jianhong, et al.
Published: (2025)
by: Bai, Jianhong, et al.
Published: (2025)
CC-FMO: Camera-Conditioned Zero-Shot Single Image to 3D Scene Generation with Foundation Model Orchestration
by: Tang, Boshi, et al.
Published: (2025)
by: Tang, Boshi, et al.
Published: (2025)
CamI2V: Camera-Controlled Image-to-Video Diffusion Model
by: Zheng, Guangcong, et al.
Published: (2024)
by: Zheng, Guangcong, et al.
Published: (2024)
LiCamPose: Combining Multi-View LiDAR and RGB Cameras for Robust Single-timestamp 3D Human Pose Estimation
by: Pan, Zhiyu, et al.
Published: (2023)
by: Pan, Zhiyu, et al.
Published: (2023)
WorldCam: Interactive Autoregressive 3D Gaming Worlds with Camera Pose as a Unifying Geometric Representation
by: Nam, Jisu, et al.
Published: (2026)
by: Nam, Jisu, et al.
Published: (2026)
TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance Control
by: Zeng, Weichao, et al.
Published: (2024)
by: Zeng, Weichao, et al.
Published: (2024)
Ctrl-Z Sampling: Scaling Diffusion Sampling with Controlled Random Zigzag Explorations
by: Mao, Shunqi, et al.
Published: (2025)
by: Mao, Shunqi, et al.
Published: (2025)
ChatCam: Empowering Camera Control through Conversational AI
by: Liu, Xinhang, et al.
Published: (2024)
by: Liu, Xinhang, et al.
Published: (2024)
Improved Single Camera BEV Perception Using Multi-Camera Training
by: Busch, Daniel, et al.
Published: (2024)
by: Busch, Daniel, et al.
Published: (2024)
EmoCtrl: Controllable Emotional Image Content Generation
by: Yang, Jingyuan, et al.
Published: (2025)
by: Yang, Jingyuan, et al.
Published: (2025)
VD3D: Taming Large Video Diffusion Transformers for 3D Camera Control
by: Bahmani, Sherwin, et al.
Published: (2024)
by: Bahmani, Sherwin, et al.
Published: (2024)
Beyond Inpainting: Unleash 3D Understanding for Precise Camera-Controlled Video Generation
by: Chen, Dong-Yu, et al.
Published: (2026)
by: Chen, Dong-Yu, et al.
Published: (2026)
3D-Mem: 3D Scene Memory for Embodied Exploration and Reasoning
by: Yang, Yuncong, et al.
Published: (2024)
by: Yang, Yuncong, et al.
Published: (2024)
REST3D: Reconstructing Physically Stable 3D Scenes from a Single Image
by: Ma, Xiaoxuan, et al.
Published: (2026)
by: Ma, Xiaoxuan, et al.
Published: (2026)
Wonderland: Navigating 3D Scenes from a Single Image
by: Liang, Hanwen, et al.
Published: (2024)
by: Liang, Hanwen, et al.
Published: (2024)
SparseCam4D: Spatio-Temporally Consistent 4D Reconstruction from Sparse Cameras
by: Pan, Weihong, et al.
Published: (2026)
by: Pan, Weihong, et al.
Published: (2026)
CtrlNeRF: The Generative Neural Radiation Fields for the Controllable Synthesis of High-fidelity 3D-Aware Images
by: Liu, Jian, et al.
Published: (2024)
by: Liu, Jian, et al.
Published: (2024)
Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text
by: Li, Xinyang, et al.
Published: (2024)
by: Li, Xinyang, et al.
Published: (2024)
Uni3C: Unifying Precisely 3D-Enhanced Camera and Human Motion Controls for Video Generation
by: Cao, Chenjie, et al.
Published: (2025)
by: Cao, Chenjie, et al.
Published: (2025)
TrafficLoc: Localizing Traffic Surveillance Cameras in 3D Scenes
by: Xia, Yan, et al.
Published: (2024)
by: Xia, Yan, et al.
Published: (2024)
GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control
by: Ren, Xuanchi, et al.
Published: (2025)
by: Ren, Xuanchi, et al.
Published: (2025)
LET-3D-AP: Longitudinal Error Tolerant 3D Average Precision for Camera-Only 3D Detection
by: Hung, Wei-Chih, et al.
Published: (2022)
by: Hung, Wei-Chih, et al.
Published: (2022)
Efficient Camera-Controlled Video Generation of Static Scenes via Sparse Diffusion and 3D Rendering
by: Chen, Jieying, et al.
Published: (2026)
by: Chen, Jieying, et al.
Published: (2026)
Similar Items
-
Probing the 3D Awareness of Visual Foundation Models
by: Banani, Mohamed El, et al.
Published: (2024) -
3D Congealing: 3D-Aware Image Alignment in the Wild
by: Zhang, Yunzhi, et al.
Published: (2024) -
CameraCtrl II: Dynamic Scene Exploration via Camera-controlled Video Diffusion Models
by: He, Hao, et al.
Published: (2025) -
CamCo: Camera-Controllable 3D-Consistent Image-to-Video Generation
by: Xu, Dejia, et al.
Published: (2024) -
DualCamCtrl: Dual-Branch Diffusion Model for Geometry-Aware Camera-Controlled Video Generation
by: Zhang, Hongfei, et al.
Published: (2025)