Saved in:
Bibliographic Details
Main Authors: Hu, Tao, Ge, Wenhang, Zhao, Yuyang, Lee, Gim Hee
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2404.14329
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866916268859195392
author Hu, Tao
Ge, Wenhang
Zhao, Yuyang
Lee, Gim Hee
author_facet Hu, Tao
Ge, Wenhang
Zhao, Yuyang
Lee, Gim Hee
contents We introduce X-Ray, a novel 3D sequential representation inspired by the penetrability of x-ray scans. X-Ray transforms a 3D object into a series of surface frames at different layers, making it suitable for generating 3D models from images. Our method utilizes ray casting from the camera center to capture geometric and textured details, including depth, normal, and color, across all intersected surfaces. This process efficiently condenses the whole 3D object into a multi-frame video format, motivating the utilize of a network architecture similar to those in video diffusion models. This design ensures an efficient 3D representation by focusing solely on surface information. Also, we propose a two-stage pipeline to generate 3D objects from X-Ray Diffusion Model and Upsampler. We demonstrate the practicality and adaptability of our X-Ray representation by synthesizing the complete visible and hidden surfaces of a 3D object from a single input image. Experimental results reveal the state-of-the-art superiority of our representation in enhancing the accuracy of 3D generation, paving the way for new 3D representation research and practical applications.
format Preprint
id arxiv_https___arxiv_org_abs_2404_14329
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle X-Ray: A Sequential 3D Representation For Generation
Hu, Tao
Ge, Wenhang
Zhao, Yuyang
Lee, Gim Hee
Computer Vision and Pattern Recognition
We introduce X-Ray, a novel 3D sequential representation inspired by the penetrability of x-ray scans. X-Ray transforms a 3D object into a series of surface frames at different layers, making it suitable for generating 3D models from images. Our method utilizes ray casting from the camera center to capture geometric and textured details, including depth, normal, and color, across all intersected surfaces. This process efficiently condenses the whole 3D object into a multi-frame video format, motivating the utilize of a network architecture similar to those in video diffusion models. This design ensures an efficient 3D representation by focusing solely on surface information. Also, we propose a two-stage pipeline to generate 3D objects from X-Ray Diffusion Model and Upsampler. We demonstrate the practicality and adaptability of our X-Ray representation by synthesizing the complete visible and hidden surfaces of a 3D object from a single input image. Experimental results reveal the state-of-the-art superiority of our representation in enhancing the accuracy of 3D generation, paving the way for new 3D representation research and practical applications.
title X-Ray: A Sequential 3D Representation For Generation
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2404.14329