Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Mathihalli, Nidhi, Wei, Audrey, Lavezzi, Giovanni, Siew, Peng Mun, Rodriguez-Fernandez, Victor, Urrutxua, Hodei, Linares, Richard
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition Machine Learning
Online Access:	https://arxiv.org/abs/2410.05097
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866909338770079744
author	Mathihalli, Nidhi Wei, Audrey Lavezzi, Giovanni Siew, Peng Mun Rodriguez-Fernandez, Victor Urrutxua, Hodei Linares, Richard
author_facet	Mathihalli, Nidhi Wei, Audrey Lavezzi, Giovanni Siew, Peng Mun Rodriguez-Fernandez, Victor Urrutxua, Hodei Linares, Richard
contents	Novel view synthesis (NVS) enables to generate new images of a scene or convert a set of 2D images into a comprehensive 3D model. In the context of Space Domain Awareness, since space is becoming increasingly congested, NVS can accurately map space objects and debris, improving the safety and efficiency of space operations. Similarly, in Rendezvous and Proximity Operations missions, 3D models can provide details about a target object's shape, size, and orientation, allowing for better planning and prediction of the target's behavior. In this work, we explore the generalization abilities of these reconstruction techniques, aiming to avoid the necessity of retraining for each new scene, by presenting a novel approach to 3D spacecraft reconstruction from single-view images, DreamSat, by fine-tuning the Zero123 XL, a state-of-the-art single-view reconstruction model, on a high-quality dataset of 190 high-quality spacecraft models and integrating it into the DreamGaussian framework. We demonstrate consistent improvements in reconstruction quality across multiple metrics, including Contrastive Language-Image Pretraining (CLIP) score (+0.33%), Peak Signal-to-Noise Ratio (PSNR) (+2.53%), Structural Similarity Index (SSIM) (+2.38%), and Learned Perceptual Image Patch Similarity (LPIPS) (+0.16%) on a test set of 30 previously unseen spacecraft images. Our method addresses the lack of domain-specific 3D reconstruction tools in the space industry by leveraging state-of-the-art diffusion models and 3D Gaussian splatting techniques. This approach maintains the efficiency of the DreamGaussian framework while enhancing the accuracy and detail of spacecraft reconstructions. The code for this work can be accessed on GitHub (https://github.com/ARCLab-MIT/space-nvs).
format	Preprint
id	arxiv_https___arxiv_org_abs_2410_05097
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	DreamSat: Towards a General 3D Model for Novel View Synthesis of Space Objects Mathihalli, Nidhi Wei, Audrey Lavezzi, Giovanni Siew, Peng Mun Rodriguez-Fernandez, Victor Urrutxua, Hodei Linares, Richard Computer Vision and Pattern Recognition Machine Learning Novel view synthesis (NVS) enables to generate new images of a scene or convert a set of 2D images into a comprehensive 3D model. In the context of Space Domain Awareness, since space is becoming increasingly congested, NVS can accurately map space objects and debris, improving the safety and efficiency of space operations. Similarly, in Rendezvous and Proximity Operations missions, 3D models can provide details about a target object's shape, size, and orientation, allowing for better planning and prediction of the target's behavior. In this work, we explore the generalization abilities of these reconstruction techniques, aiming to avoid the necessity of retraining for each new scene, by presenting a novel approach to 3D spacecraft reconstruction from single-view images, DreamSat, by fine-tuning the Zero123 XL, a state-of-the-art single-view reconstruction model, on a high-quality dataset of 190 high-quality spacecraft models and integrating it into the DreamGaussian framework. We demonstrate consistent improvements in reconstruction quality across multiple metrics, including Contrastive Language-Image Pretraining (CLIP) score (+0.33%), Peak Signal-to-Noise Ratio (PSNR) (+2.53%), Structural Similarity Index (SSIM) (+2.38%), and Learned Perceptual Image Patch Similarity (LPIPS) (+0.16%) on a test set of 30 previously unseen spacecraft images. Our method addresses the lack of domain-specific 3D reconstruction tools in the space industry by leveraging state-of-the-art diffusion models and 3D Gaussian splatting techniques. This approach maintains the efficiency of the DreamGaussian framework while enhancing the accuracy and detail of spacecraft reconstructions. The code for this work can be accessed on GitHub (https://github.com/ARCLab-MIT/space-nvs).
title	DreamSat: Towards a General 3D Model for Novel View Synthesis of Space Objects
topic	Computer Vision and Pattern Recognition Machine Learning
url	https://arxiv.org/abs/2410.05097

Similar Items