:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Li, Qixuan, Wang, Chao, He, Zongjin, Peng, Yan
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence
Online Access:	https://arxiv.org/abs/2502.00708
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

LAYOUTDREAMER: Physics-guided Layout for Text-to-3D Compositional Scene Generation
by: Zhou, Yang, et al.
Published: (2025)

PhyScene: Physically Interactable 3D Scene Synthesis for Embodied AI
by: Yang, Yandan, et al.
Published: (2024)

PhyT2V: LLM-Guided Iterative Self-Refinement for Physics-Grounded Text-to-Video Generation
by: Xue, Qiyao, et al.
Published: (2024)

PhyPrompt: RL-based Prompt Refinement for Physically Plausible Text-to-Video Generation
by: Wu, Shang, et al.
Published: (2026)

ReplaceAnything3D:Text-Guided 3D Scene Editing with Compositional Neural Radiance Fields
by: Bartrum, Edward, et al.
Published: (2024)

PhyVLLM: Physics-Guided Video Language Model with Motion-Appearance Disentanglement
by: Zhan, Yu-Wei, et al.
Published: (2025)

Frankenstein: Generating Semantic-Compositional 3D Scenes in One Tri-Plane
by: Yan, Han, et al.
Published: (2024)

PhyMotion: Structured 3D Motion Reward for Physics-Grounded Human Video Generation
by: Huang, Yidong, et al.
Published: (2026)

"PhyWorldBench": A Comprehensive Evaluation of Physical Realism in Text-to-Video Models
by: Gu, Jing, et al.
Published: (2025)

3D-SceneDreamer: Text-Driven 3D-Consistent Scene Generation
by: Zhang, Frank, et al.
Published: (2024)

PhyGile: Physics-Prefix Guided Motion Generation for Agile General Humanoid Motion Tracking
by: Bao, Jiacheng, et al.
Published: (2026)

DreamScene360: Unconstrained Text-to-3D Scene Generation with Panoramic Gaussian Splatting
by: Zhou, Shijie, et al.
Published: (2024)

PaintScene4D: Consistent 4D Scene Generation from Text Prompts
by: Gupta, Vinayak, et al.
Published: (2024)

PhyGenHOI: Physically-Aware 4D Generation of Dynamic Human-Object Interactions
by: Benishu, Omer, et al.
Published: (2026)

TextMamba: Scene Text Detector with Mamba
by: Zhao, Qiyan, et al.
Published: (2025)

GenXD: Generating Any 3D and 4D Scenes
by: Zhao, Yuyang, et al.
Published: (2024)

PhyGround: Benchmarking Physical Reasoning in Generative World Models
by: Lin, Juyi, et al.
Published: (2026)

Text-Scene: A Scene-to-Language Parsing Framework for 3D Scene Understanding
by: Li, Haoyuan, et al.
Published: (2025)

PhyDrawGen: Physically Grounded Diagram Generation from Natural Language
by: Haque, Nafiul, et al.
Published: (2026)

VideoPhy: Evaluating Physical Commonsense for Video Generation
by: Bansal, Hritik, et al.
Published: (2024)

From Scene to Object: Text-Guided Dual-Gaze Prediction
by: Ke, Zehong, et al.
Published: (2026)

Compositional 4D Dynamic Scenes Understanding with Physics Priors for Video Question Answering
by: Wang, Xingrui, et al.
Published: (2024)

PhyBlock: A Progressive Benchmark for Physical Understanding and Planning via 3D Block Assembly
by: Ma, Liang, et al.
Published: (2025)

PhyCo: Learning Controllable Physical Priors for Generative Motion
by: Narayanan, Sriram, et al.
Published: (2026)

4D Panoptic Scene Graph Generation
by: Yang, Jingkang, et al.
Published: (2024)

PhyTracker: An Online Tracker for Phytoplankton
by: Yu, Yang, et al.
Published: (2024)

SceneMaker: Open-set 3D Scene Generation with Decoupled De-occlusion and Pose Estimation Model
by: Shi, Yukai, et al.
Published: (2025)

RelaxFlow: Text-Driven Amodal 3D Generation
by: Zhu, Jiayin, et al.
Published: (2026)

RealGen: Photorealistic Text-to-Image Generation via Detector-Guided Rewards
by: Ye, Junyan, et al.
Published: (2025)

PhyWorld: Physics-Faithful World Model for Video Generation
by: Zhao, Pu, et al.
Published: (2026)

BlobGEN-Vid: Compositional Text-to-Video Generation with Blob Video Representations
by: Feng, Weixi, et al.
Published: (2025)

InteractMove: Text-Controlled Human-Object Interaction Generation in 3D Scenes with Movable Objects
by: Cai, Xinhao, et al.
Published: (2025)

3DPhysVideo: Consistency-Guided Flow SDE for Video Generation via 3D Scene Reconstruction and Physical Simulation
by: Kim, Hwidong, et al.
Published: (2026)

LatentEditor: Text Driven Local Editing of 3D Scenes
by: Khalid, Umar, et al.
Published: (2023)

Open-Vocabulary Functional 3D Human-Scene Interaction Generation
by: Liu, Jie, et al.
Published: (2026)

Evaluating Compositional Scene Understanding in Multimodal Generative Models
by: Fu, Shuhao, et al.
Published: (2025)

SceneX: Procedural Controllable Large-scale Scene Generation
by: Zhou, Mengqi, et al.
Published: (2024)

PhyCAGE: Physically Plausible Compositional 3D Asset Generation from a Single Image
by: Yan, Han, et al.
Published: (2024)

FurniScene: A Large-scale 3D Room Dataset with Intricate Furnishing Scenes
by: Zhang, Genghao, et al.
Published: (2024)

RoomCraft: Controllable and Complete 3D Indoor Scene Generation
by: Zhou, Mengqi, et al.
Published: (2025)