Saved in:
Bibliographic Details
Main Authors: Mathihalli, Nidhi, Wei, Audrey, Lavezzi, Giovanni, Siew, Peng Mun, Rodriguez-Fernandez, Victor, Urrutxua, Hodei, Linares, Richard
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2410.05097
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866909338770079744
author Mathihalli, Nidhi
Wei, Audrey
Lavezzi, Giovanni
Siew, Peng Mun
Rodriguez-Fernandez, Victor
Urrutxua, Hodei
Linares, Richard
author_facet Mathihalli, Nidhi
Wei, Audrey
Lavezzi, Giovanni
Siew, Peng Mun
Rodriguez-Fernandez, Victor
Urrutxua, Hodei
Linares, Richard
contents Novel view synthesis (NVS) enables to generate new images of a scene or convert a set of 2D images into a comprehensive 3D model. In the context of Space Domain Awareness, since space is becoming increasingly congested, NVS can accurately map space objects and debris, improving the safety and efficiency of space operations. Similarly, in Rendezvous and Proximity Operations missions, 3D models can provide details about a target object's shape, size, and orientation, allowing for better planning and prediction of the target's behavior. In this work, we explore the generalization abilities of these reconstruction techniques, aiming to avoid the necessity of retraining for each new scene, by presenting a novel approach to 3D spacecraft reconstruction from single-view images, DreamSat, by fine-tuning the Zero123 XL, a state-of-the-art single-view reconstruction model, on a high-quality dataset of 190 high-quality spacecraft models and integrating it into the DreamGaussian framework. We demonstrate consistent improvements in reconstruction quality across multiple metrics, including Contrastive Language-Image Pretraining (CLIP) score (+0.33%), Peak Signal-to-Noise Ratio (PSNR) (+2.53%), Structural Similarity Index (SSIM) (+2.38%), and Learned Perceptual Image Patch Similarity (LPIPS) (+0.16%) on a test set of 30 previously unseen spacecraft images. Our method addresses the lack of domain-specific 3D reconstruction tools in the space industry by leveraging state-of-the-art diffusion models and 3D Gaussian splatting techniques. This approach maintains the efficiency of the DreamGaussian framework while enhancing the accuracy and detail of spacecraft reconstructions. The code for this work can be accessed on GitHub (https://github.com/ARCLab-MIT/space-nvs).
format Preprint
id arxiv_https___arxiv_org_abs_2410_05097
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle DreamSat: Towards a General 3D Model for Novel View Synthesis of Space Objects
Mathihalli, Nidhi
Wei, Audrey
Lavezzi, Giovanni
Siew, Peng Mun
Rodriguez-Fernandez, Victor
Urrutxua, Hodei
Linares, Richard
Computer Vision and Pattern Recognition
Machine Learning
Novel view synthesis (NVS) enables to generate new images of a scene or convert a set of 2D images into a comprehensive 3D model. In the context of Space Domain Awareness, since space is becoming increasingly congested, NVS can accurately map space objects and debris, improving the safety and efficiency of space operations. Similarly, in Rendezvous and Proximity Operations missions, 3D models can provide details about a target object's shape, size, and orientation, allowing for better planning and prediction of the target's behavior. In this work, we explore the generalization abilities of these reconstruction techniques, aiming to avoid the necessity of retraining for each new scene, by presenting a novel approach to 3D spacecraft reconstruction from single-view images, DreamSat, by fine-tuning the Zero123 XL, a state-of-the-art single-view reconstruction model, on a high-quality dataset of 190 high-quality spacecraft models and integrating it into the DreamGaussian framework. We demonstrate consistent improvements in reconstruction quality across multiple metrics, including Contrastive Language-Image Pretraining (CLIP) score (+0.33%), Peak Signal-to-Noise Ratio (PSNR) (+2.53%), Structural Similarity Index (SSIM) (+2.38%), and Learned Perceptual Image Patch Similarity (LPIPS) (+0.16%) on a test set of 30 previously unseen spacecraft images. Our method addresses the lack of domain-specific 3D reconstruction tools in the space industry by leveraging state-of-the-art diffusion models and 3D Gaussian splatting techniques. This approach maintains the efficiency of the DreamGaussian framework while enhancing the accuracy and detail of spacecraft reconstructions. The code for this work can be accessed on GitHub (https://github.com/ARCLab-MIT/space-nvs).
title DreamSat: Towards a General 3D Model for Novel View Synthesis of Space Objects
topic Computer Vision and Pattern Recognition
Machine Learning
url https://arxiv.org/abs/2410.05097