Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Hu, Tao, Ge, Wenhang, Zhao, Yuyang, Lee, Gim Hee
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2404.14329
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866916268859195392
author	Hu, Tao Ge, Wenhang Zhao, Yuyang Lee, Gim Hee
author_facet	Hu, Tao Ge, Wenhang Zhao, Yuyang Lee, Gim Hee
contents	We introduce X-Ray, a novel 3D sequential representation inspired by the penetrability of x-ray scans. X-Ray transforms a 3D object into a series of surface frames at different layers, making it suitable for generating 3D models from images. Our method utilizes ray casting from the camera center to capture geometric and textured details, including depth, normal, and color, across all intersected surfaces. This process efficiently condenses the whole 3D object into a multi-frame video format, motivating the utilize of a network architecture similar to those in video diffusion models. This design ensures an efficient 3D representation by focusing solely on surface information. Also, we propose a two-stage pipeline to generate 3D objects from X-Ray Diffusion Model and Upsampler. We demonstrate the practicality and adaptability of our X-Ray representation by synthesizing the complete visible and hidden surfaces of a 3D object from a single input image. Experimental results reveal the state-of-the-art superiority of our representation in enhancing the accuracy of 3D generation, paving the way for new 3D representation research and practical applications.
format	Preprint
id	arxiv_https___arxiv_org_abs_2404_14329
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	X-Ray: A Sequential 3D Representation For Generation Hu, Tao Ge, Wenhang Zhao, Yuyang Lee, Gim Hee Computer Vision and Pattern Recognition We introduce X-Ray, a novel 3D sequential representation inspired by the penetrability of x-ray scans. X-Ray transforms a 3D object into a series of surface frames at different layers, making it suitable for generating 3D models from images. Our method utilizes ray casting from the camera center to capture geometric and textured details, including depth, normal, and color, across all intersected surfaces. This process efficiently condenses the whole 3D object into a multi-frame video format, motivating the utilize of a network architecture similar to those in video diffusion models. This design ensures an efficient 3D representation by focusing solely on surface information. Also, we propose a two-stage pipeline to generate 3D objects from X-Ray Diffusion Model and Upsampler. We demonstrate the practicality and adaptability of our X-Ray representation by synthesizing the complete visible and hidden surfaces of a 3D object from a single input image. Experimental results reveal the state-of-the-art superiority of our representation in enhancing the accuracy of 3D generation, paving the way for new 3D representation research and practical applications.
title	X-Ray: A Sequential 3D Representation For Generation
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2404.14329

Similar Items