Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Yu, Zhinan, Qin, Zheng, Tang, Yijie, Wang, Yongjun, Yi, Renjiao, Zhu, Chenyang, Xu, Kai
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2405.00507
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866912355531620352
author	Yu, Zhinan Qin, Zheng Tang, Yijie Wang, Yongjun Yi, Renjiao Zhu, Chenyang Xu, Kai
author_facet	Yu, Zhinan Qin, Zheng Tang, Yijie Wang, Yongjun Yi, Renjiao Zhu, Chenyang Xu, Kai
contents	This work studies the problem of unsupervised RGB-D point cloud registration, which aims at training a robust registration model without ground-truth pose supervision. Existing methods usually leverages unposed RGB-D sequences and adopt a frame-to-frame framework based on differentiable rendering to train the registration model, which enforces the photometric and geometric consistency between the two frames for supervision. However, this frame-to-frame framework is vulnerable to inconsistent factors between different frames, e.g., lighting changes, geometry occlusion, and reflective materials, which leads to suboptimal convergence of the registration model. In this paper, we propose a novel frame-to-model optimization framework named F2M-Reg for unsupervised RGB-D point cloud registration. We leverage the neural implicit field as a global model of the scene and optimize the estimated poses of the frames by registering them to the global model, and the registration model is subsequently trained with the optimized poses. Thanks to the global encoding capability of neural implicit field, our frame-to-model framework is significantly more robust to inconsistent factors between different frames and thus can provide better supervision for the registration model. Besides, we demonstrate that F2M-Reg can be further enhanced by a simplistic synthetic warming-up strategy. To this end, we construct a photorealistic synthetic dataset named Sim-RGBD to initialize the registration model for the frame-to-model optimization on real-world RGB-D sequences. Extensive experiments on four challenging benchmarks have shown that our method surpasses the previous state-of-the-art counterparts by a large margin, especially under scenarios with severe lighting changes and low overlap. Our code and models are available at https://github.com/MrIsland/F2M_Reg.
format	Preprint
id	arxiv_https___arxiv_org_abs_2405_00507
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	F2M-Reg: Unsupervised RGB-D Point Cloud Registration with Frame-to-Model Optimization Yu, Zhinan Qin, Zheng Tang, Yijie Wang, Yongjun Yi, Renjiao Zhu, Chenyang Xu, Kai Computer Vision and Pattern Recognition This work studies the problem of unsupervised RGB-D point cloud registration, which aims at training a robust registration model without ground-truth pose supervision. Existing methods usually leverages unposed RGB-D sequences and adopt a frame-to-frame framework based on differentiable rendering to train the registration model, which enforces the photometric and geometric consistency between the two frames for supervision. However, this frame-to-frame framework is vulnerable to inconsistent factors between different frames, e.g., lighting changes, geometry occlusion, and reflective materials, which leads to suboptimal convergence of the registration model. In this paper, we propose a novel frame-to-model optimization framework named F2M-Reg for unsupervised RGB-D point cloud registration. We leverage the neural implicit field as a global model of the scene and optimize the estimated poses of the frames by registering them to the global model, and the registration model is subsequently trained with the optimized poses. Thanks to the global encoding capability of neural implicit field, our frame-to-model framework is significantly more robust to inconsistent factors between different frames and thus can provide better supervision for the registration model. Besides, we demonstrate that F2M-Reg can be further enhanced by a simplistic synthetic warming-up strategy. To this end, we construct a photorealistic synthetic dataset named Sim-RGBD to initialize the registration model for the frame-to-model optimization on real-world RGB-D sequences. Extensive experiments on four challenging benchmarks have shown that our method surpasses the previous state-of-the-art counterparts by a large margin, especially under scenarios with severe lighting changes and low overlap. Our code and models are available at https://github.com/MrIsland/F2M_Reg.
title	F2M-Reg: Unsupervised RGB-D Point Cloud Registration with Frame-to-Model Optimization
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2405.00507

Similar Items