:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Popov, Stefan, Raj, Amit, Krainin, Michael, Li, Yuanzhen, Freeman, William T., Rubinstein, Michael
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2501.06006
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Probing the 3D Awareness of Visual Foundation Models
by: Banani, Mohamed El, et al.
Published: (2024)

3D Congealing: 3D-Aware Image Alignment in the Wild
by: Zhang, Yunzhi, et al.
Published: (2024)

CameraCtrl II: Dynamic Scene Exploration via Camera-controlled Video Diffusion Models
by: He, Hao, et al.
Published: (2025)

CamCo: Camera-Controllable 3D-Consistent Image-to-Video Generation
by: Xu, Dejia, et al.
Published: (2024)

DualCamCtrl: Dual-Branch Diffusion Model for Geometry-Aware Camera-Controlled Video Generation
by: Zhang, Hongfei, et al.
Published: (2025)

PreciseCam: Precise Camera Control for Text-to-Image Generation
by: Bernal-Berdun, Edurne, et al.
Published: (2025)

WonderWorld: Interactive 3D Scene Generation from a Single Image
by: Yu, Hong-Xing, et al.
Published: (2024)

CameraCtrl: Enabling Camera Control for Text-to-Video Generation
by: He, Hao, et al.
Published: (2024)

ObjCtrl-2.5D: Training-free Object Control with Camera Poses
by: Wang, Zhouxia, et al.
Published: (2024)

DanceCamAnimator: Keyframe-Based Controllable 3D Dance Camera Synthesis
by: Wang, Zixuan, et al.
Published: (2024)

ConfCtrl: Enabling Precise Camera Control in Video Diffusion via Confidence-Aware Interpolation
by: Yang, Liudi, et al.
Published: (2026)

3D Scene Prompting for Scene-Consistent Camera-Controllable Video Generation
by: Lee, JoungBin, et al.
Published: (2025)

Efficient Hybrid Zoom using Camera Fusion on Mobile Phones
by: Wu, Xiaotong, et al.
Published: (2024)

3D Scene-Camera Representation with Joint Camera Photometric Optimization
by: Dai, Weichen, et al.
Published: (2025)

Ctrl-Room: Controllable Text-to-3D Room Meshes Generation with Layout Constraints
by: Fang, Chuan, et al.
Published: (2023)

Scene4U: Hierarchical Layered 3D Scene Reconstruction from Single Panoramic Image for Your Immerse Exploration
by: Huang, Zilong, et al.
Published: (2025)

ActCam: Zero-Shot Joint Camera and 3D Motion Control for Video Generation
by: Khalifi, Omar El, et al.
Published: (2026)

ReCamMaster: Camera-Controlled Generative Rendering from A Single Video
by: Bai, Jianhong, et al.
Published: (2025)

CC-FMO: Camera-Conditioned Zero-Shot Single Image to 3D Scene Generation with Foundation Model Orchestration
by: Tang, Boshi, et al.
Published: (2025)

CamI2V: Camera-Controlled Image-to-Video Diffusion Model
by: Zheng, Guangcong, et al.
Published: (2024)

LiCamPose: Combining Multi-View LiDAR and RGB Cameras for Robust Single-timestamp 3D Human Pose Estimation
by: Pan, Zhiyu, et al.
Published: (2023)

WorldCam: Interactive Autoregressive 3D Gaming Worlds with Camera Pose as a Unifying Geometric Representation
by: Nam, Jisu, et al.
Published: (2026)

TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance Control
by: Zeng, Weichao, et al.
Published: (2024)

Ctrl-Z Sampling: Scaling Diffusion Sampling with Controlled Random Zigzag Explorations
by: Mao, Shunqi, et al.
Published: (2025)

ChatCam: Empowering Camera Control through Conversational AI
by: Liu, Xinhang, et al.
Published: (2024)

Improved Single Camera BEV Perception Using Multi-Camera Training
by: Busch, Daniel, et al.
Published: (2024)

EmoCtrl: Controllable Emotional Image Content Generation
by: Yang, Jingyuan, et al.
Published: (2025)

VD3D: Taming Large Video Diffusion Transformers for 3D Camera Control
by: Bahmani, Sherwin, et al.
Published: (2024)

Beyond Inpainting: Unleash 3D Understanding for Precise Camera-Controlled Video Generation
by: Chen, Dong-Yu, et al.
Published: (2026)

3D-Mem: 3D Scene Memory for Embodied Exploration and Reasoning
by: Yang, Yuncong, et al.
Published: (2024)

REST3D: Reconstructing Physically Stable 3D Scenes from a Single Image
by: Ma, Xiaoxuan, et al.
Published: (2026)

Wonderland: Navigating 3D Scenes from a Single Image
by: Liang, Hanwen, et al.
Published: (2024)

SparseCam4D: Spatio-Temporally Consistent 4D Reconstruction from Sparse Cameras
by: Pan, Weihong, et al.
Published: (2026)

CtrlNeRF: The Generative Neural Radiation Fields for the Controllable Synthesis of High-fidelity 3D-Aware Images
by: Liu, Jian, et al.
Published: (2024)

Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text
by: Li, Xinyang, et al.
Published: (2024)

Uni3C: Unifying Precisely 3D-Enhanced Camera and Human Motion Controls for Video Generation
by: Cao, Chenjie, et al.
Published: (2025)

TrafficLoc: Localizing Traffic Surveillance Cameras in 3D Scenes
by: Xia, Yan, et al.
Published: (2024)

GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control
by: Ren, Xuanchi, et al.
Published: (2025)

LET-3D-AP: Longitudinal Error Tolerant 3D Average Precision for Camera-Only 3D Detection
by: Hung, Wei-Chih, et al.
Published: (2022)

Efficient Camera-Controlled Video Generation of Static Scenes via Sparse Diffusion and 3D Rendering
by: Chen, Jieying, et al.
Published: (2026)