Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Zhang, Xiaohan, Sun, Zhenyu, Qiu, Yukui, Su, Junyan, Liu, Qi
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2412.10078
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866915062909763584
author	Zhang, Xiaohan Sun, Zhenyu Qiu, Yukui Su, Junyan Liu, Qi
author_facet	Zhang, Xiaohan Sun, Zhenyu Qiu, Yukui Su, Junyan Liu, Qi
contents	Currently, 3D rendering for large-scale free camera trajectories, namely, arbitrary input camera trajectories, poses significant challenges: 1) The distribution and observation angles of the cameras are irregular, and various types of scenes are included in the free trajectories; 2) Processing the entire point cloud and all images at once for large-scale scenes requires a substantial amount of GPU memory. This paper presents a Toy-GS method for accurately rendering large-scale free camera trajectories. Specifically, we propose an adaptive spatial division approach for free trajectories to divide cameras and the sparse point cloud of the entire scene into various regions according to camera poses. Training each local Gaussian in parallel for each area enables us to concentrate on texture details and minimize GPU memory usage. Next, we use the multi-view constraint and position-aware point adaptive control (PPAC) to improve the rendering quality of texture details. In addition, our regional fusion approach combines local and global Gaussians to enhance rendering quality with an increasing number of divided areas. Extensive experiments have been carried out to confirm the effectiveness and efficiency of Toy-GS, leading to state-of-the-art results on two public large-scale datasets as well as our SCUTic dataset. Our proposal demonstrates an enhancement of 1.19 dB in PSNR and conserves 7 G of GPU memory when compared to various benchmarks.
format	Preprint
id	arxiv_https___arxiv_org_abs_2412_10078
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Toy-GS: Assembling Local Gaussians for Precisely Rendering Large-Scale Free Camera Trajectories Zhang, Xiaohan Sun, Zhenyu Qiu, Yukui Su, Junyan Liu, Qi Computer Vision and Pattern Recognition Currently, 3D rendering for large-scale free camera trajectories, namely, arbitrary input camera trajectories, poses significant challenges: 1) The distribution and observation angles of the cameras are irregular, and various types of scenes are included in the free trajectories; 2) Processing the entire point cloud and all images at once for large-scale scenes requires a substantial amount of GPU memory. This paper presents a Toy-GS method for accurately rendering large-scale free camera trajectories. Specifically, we propose an adaptive spatial division approach for free trajectories to divide cameras and the sparse point cloud of the entire scene into various regions according to camera poses. Training each local Gaussian in parallel for each area enables us to concentrate on texture details and minimize GPU memory usage. Next, we use the multi-view constraint and position-aware point adaptive control (PPAC) to improve the rendering quality of texture details. In addition, our regional fusion approach combines local and global Gaussians to enhance rendering quality with an increasing number of divided areas. Extensive experiments have been carried out to confirm the effectiveness and efficiency of Toy-GS, leading to state-of-the-art results on two public large-scale datasets as well as our SCUTic dataset. Our proposal demonstrates an enhancement of 1.19 dB in PSNR and conserves 7 G of GPU memory when compared to various benchmarks.
title	Toy-GS: Assembling Local Gaussians for Precisely Rendering Large-Scale Free Camera Trajectories
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2412.10078

Similar Items