Saved in:
Bibliographic Details
Main Authors: Ni, Jingcheng, Zhao, Weiguang, Wang, Daniel, Zeng, Ziyao, You, Chenyu, Wong, Alex, Huang, Kaizhu
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2501.17636
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866910911009128448
author Ni, Jingcheng
Zhao, Weiguang
Wang, Daniel
Zeng, Ziyao
You, Chenyu
Wong, Alex
Huang, Kaizhu
author_facet Ni, Jingcheng
Zhao, Weiguang
Wang, Daniel
Zeng, Ziyao
You, Chenyu
Wong, Alex
Huang, Kaizhu
contents 3D object removal is an important sub-task in 3D scene editing, with broad applications in scene understanding, augmented reality, and robotics. However, existing methods struggle to achieve a desirable balance among consistency, usability, and computational efficiency in multi-view settings. These limitations are primarily due to unintuitive user interaction in the source view, inefficient multi-view object mask generation, computationally expensive inpainting procedures, and a lack of applicability across different radiance field representations. To address these challenges, we propose a novel pipeline that improves the quality and efficiency of multi-view object mask generation and inpainting. Our method introduces an intuitive region-based interaction mechanism in the source view and eliminates the need for camera poses or extra model training. Our lightweight HoMM module is employed to achieve high-quality multi-view mask propagation with enhanced efficiency. In the inpainting stage, we further reduce computational costs by performing inpainting only on selected key views and propagating the results to other views via homography-based mapping. Our pipeline is compatible with a variety of radiance field frameworks, including NeRF and 3D Gaussian Splatting, demonstrating improved generalizability and practicality in real-world scenarios. Additionally, we present a new 3D multi-object removal dataset with greater object diversity and viewpoint variation than existing datasets. Experiments on public benchmarks and our proposed dataset show that our method achieves state-of-the-art performance while reducing runtime to one-fifth of that required by leading baselines.
format Preprint
id arxiv_https___arxiv_org_abs_2501_17636
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle HOMER: Homography-Based Efficient Multi-view 3D Object Removal
Ni, Jingcheng
Zhao, Weiguang
Wang, Daniel
Zeng, Ziyao
You, Chenyu
Wong, Alex
Huang, Kaizhu
Computer Vision and Pattern Recognition
3D object removal is an important sub-task in 3D scene editing, with broad applications in scene understanding, augmented reality, and robotics. However, existing methods struggle to achieve a desirable balance among consistency, usability, and computational efficiency in multi-view settings. These limitations are primarily due to unintuitive user interaction in the source view, inefficient multi-view object mask generation, computationally expensive inpainting procedures, and a lack of applicability across different radiance field representations. To address these challenges, we propose a novel pipeline that improves the quality and efficiency of multi-view object mask generation and inpainting. Our method introduces an intuitive region-based interaction mechanism in the source view and eliminates the need for camera poses or extra model training. Our lightweight HoMM module is employed to achieve high-quality multi-view mask propagation with enhanced efficiency. In the inpainting stage, we further reduce computational costs by performing inpainting only on selected key views and propagating the results to other views via homography-based mapping. Our pipeline is compatible with a variety of radiance field frameworks, including NeRF and 3D Gaussian Splatting, demonstrating improved generalizability and practicality in real-world scenarios. Additionally, we present a new 3D multi-object removal dataset with greater object diversity and viewpoint variation than existing datasets. Experiments on public benchmarks and our proposed dataset show that our method achieves state-of-the-art performance while reducing runtime to one-fifth of that required by leading baselines.
title HOMER: Homography-Based Efficient Multi-view 3D Object Removal
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2501.17636