Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Liu, Yuheng, Li, Xinke, Zhang, Yuning, Qi, Lu, Li, Xin, Wang, Wenping, Li, Chongshou, Li, Xueting, Yang, Ming-Hsuan
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2503.07152
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866912267564482560
author	Liu, Yuheng Li, Xinke Zhang, Yuning Qi, Lu Li, Xin Wang, Wenping Li, Chongshou Li, Xueting Yang, Ming-Hsuan
author_facet	Liu, Yuheng Li, Xinke Zhang, Yuning Qi, Lu Li, Xin Wang, Wenping Li, Chongshou Li, Xueting Yang, Ming-Hsuan
contents	Three-dimensional scene generation is crucial in computer vision, with applications spanning autonomous driving, gaming and the metaverse. Current methods either lack user control or rely on imprecise, non-intuitive conditions. In this work, we propose a method that uses, scene graphs, an accessible, user friendly control format to generate outdoor 3D scenes. We develop an interactive system that transforms a sparse scene graph into a dense BEV (Bird's Eye View) Embedding Map, which guides a conditional diffusion model to generate 3D scenes that match the scene graph description. During inference, users can easily create or modify scene graphs to generate large-scale outdoor scenes. We create a large-scale dataset with paired scene graphs and 3D semantic scenes to train the BEV embedding and diffusion models. Experimental results show that our approach consistently produces high-quality 3D urban scenes closely aligned with the input scene graphs. To the best of our knowledge, this is the first approach to generate 3D outdoor scenes conditioned on scene graphs.
format	Preprint
id	arxiv_https___arxiv_org_abs_2503_07152
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Controllable 3D Outdoor Scene Generation via Scene Graphs Liu, Yuheng Li, Xinke Zhang, Yuning Qi, Lu Li, Xin Wang, Wenping Li, Chongshou Li, Xueting Yang, Ming-Hsuan Computer Vision and Pattern Recognition Three-dimensional scene generation is crucial in computer vision, with applications spanning autonomous driving, gaming and the metaverse. Current methods either lack user control or rely on imprecise, non-intuitive conditions. In this work, we propose a method that uses, scene graphs, an accessible, user friendly control format to generate outdoor 3D scenes. We develop an interactive system that transforms a sparse scene graph into a dense BEV (Bird's Eye View) Embedding Map, which guides a conditional diffusion model to generate 3D scenes that match the scene graph description. During inference, users can easily create or modify scene graphs to generate large-scale outdoor scenes. We create a large-scale dataset with paired scene graphs and 3D semantic scenes to train the BEV embedding and diffusion models. Experimental results show that our approach consistently produces high-quality 3D urban scenes closely aligned with the input scene graphs. To the best of our knowledge, this is the first approach to generate 3D outdoor scenes conditioned on scene graphs.
title	Controllable 3D Outdoor Scene Generation via Scene Graphs
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2503.07152

Similar Items