Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Ni, Jingcheng, Zhao, Weiguang, Wang, Daniel, Zeng, Ziyao, You, Chenyu, Wong, Alex, Huang, Kaizhu
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2501.17636
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866910911009128448
author	Ni, Jingcheng Zhao, Weiguang Wang, Daniel Zeng, Ziyao You, Chenyu Wong, Alex Huang, Kaizhu
author_facet	Ni, Jingcheng Zhao, Weiguang Wang, Daniel Zeng, Ziyao You, Chenyu Wong, Alex Huang, Kaizhu
contents	3D object removal is an important sub-task in 3D scene editing, with broad applications in scene understanding, augmented reality, and robotics. However, existing methods struggle to achieve a desirable balance among consistency, usability, and computational efficiency in multi-view settings. These limitations are primarily due to unintuitive user interaction in the source view, inefficient multi-view object mask generation, computationally expensive inpainting procedures, and a lack of applicability across different radiance field representations. To address these challenges, we propose a novel pipeline that improves the quality and efficiency of multi-view object mask generation and inpainting. Our method introduces an intuitive region-based interaction mechanism in the source view and eliminates the need for camera poses or extra model training. Our lightweight HoMM module is employed to achieve high-quality multi-view mask propagation with enhanced efficiency. In the inpainting stage, we further reduce computational costs by performing inpainting only on selected key views and propagating the results to other views via homography-based mapping. Our pipeline is compatible with a variety of radiance field frameworks, including NeRF and 3D Gaussian Splatting, demonstrating improved generalizability and practicality in real-world scenarios. Additionally, we present a new 3D multi-object removal dataset with greater object diversity and viewpoint variation than existing datasets. Experiments on public benchmarks and our proposed dataset show that our method achieves state-of-the-art performance while reducing runtime to one-fifth of that required by leading baselines.
format	Preprint
id	arxiv_https___arxiv_org_abs_2501_17636
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	HOMER: Homography-Based Efficient Multi-view 3D Object Removal Ni, Jingcheng Zhao, Weiguang Wang, Daniel Zeng, Ziyao You, Chenyu Wong, Alex Huang, Kaizhu Computer Vision and Pattern Recognition 3D object removal is an important sub-task in 3D scene editing, with broad applications in scene understanding, augmented reality, and robotics. However, existing methods struggle to achieve a desirable balance among consistency, usability, and computational efficiency in multi-view settings. These limitations are primarily due to unintuitive user interaction in the source view, inefficient multi-view object mask generation, computationally expensive inpainting procedures, and a lack of applicability across different radiance field representations. To address these challenges, we propose a novel pipeline that improves the quality and efficiency of multi-view object mask generation and inpainting. Our method introduces an intuitive region-based interaction mechanism in the source view and eliminates the need for camera poses or extra model training. Our lightweight HoMM module is employed to achieve high-quality multi-view mask propagation with enhanced efficiency. In the inpainting stage, we further reduce computational costs by performing inpainting only on selected key views and propagating the results to other views via homography-based mapping. Our pipeline is compatible with a variety of radiance field frameworks, including NeRF and 3D Gaussian Splatting, demonstrating improved generalizability and practicality in real-world scenarios. Additionally, we present a new 3D multi-object removal dataset with greater object diversity and viewpoint variation than existing datasets. Experiments on public benchmarks and our proposed dataset show that our method achieves state-of-the-art performance while reducing runtime to one-fifth of that required by leading baselines.
title	HOMER: Homography-Based Efficient Multi-view 3D Object Removal
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2501.17636

Similar Items