Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Cheng, Zhixin, Yin, Xiaotian, Deng, Jiacheng, Liao, Bohao, Chen, Yujia, Zhou, Xu, Yin, Baoqun, Zhang, Tianzhu
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence
Online Access:	https://arxiv.org/abs/2511.05965
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866911255544987648
author	Cheng, Zhixin Yin, Xiaotian Deng, Jiacheng Liao, Bohao Chen, Yujia Zhou, Xu Yin, Baoqun Zhang, Tianzhu
author_facet	Cheng, Zhixin Yin, Xiaotian Deng, Jiacheng Liao, Bohao Chen, Yujia Zhou, Xu Yin, Baoqun Zhang, Tianzhu
contents	Typical detection-free methods for image-to-point cloud registration leverage transformer-based architectures to aggregate cross-modal features and establish correspondences. However, they often struggle under challenging conditions, where noise disrupts similarity computation and leads to incorrect correspondences. Moreover, without dedicated designs, it remains difficult to effectively select informative and correlated representations across modalities, thereby limiting the robustness and accuracy of registration. To address these challenges, we propose a novel cross-modal registration framework composed of two key modules: the Iterative Agents Selection (IAS) module and the Reliable Agents Interaction (RAI) module. IAS enhances structural feature awareness with phase maps and employs reinforcement learning principles to efficiently select reliable agents. RAI then leverages these selected agents to guide cross-modal interactions, effectively reducing mismatches and improving overall robustness. Extensive experiments on the RGB-D Scenes v2 and 7-Scenes benchmarks demonstrate that our method consistently achieves state-of-the-art performance.
format	Preprint
id	arxiv_https___arxiv_org_abs_2511_05965
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Adaptive Agent Selection and Interaction Network for Image-to-point cloud Registration Cheng, Zhixin Yin, Xiaotian Deng, Jiacheng Liao, Bohao Chen, Yujia Zhou, Xu Yin, Baoqun Zhang, Tianzhu Computer Vision and Pattern Recognition Artificial Intelligence Typical detection-free methods for image-to-point cloud registration leverage transformer-based architectures to aggregate cross-modal features and establish correspondences. However, they often struggle under challenging conditions, where noise disrupts similarity computation and leads to incorrect correspondences. Moreover, without dedicated designs, it remains difficult to effectively select informative and correlated representations across modalities, thereby limiting the robustness and accuracy of registration. To address these challenges, we propose a novel cross-modal registration framework composed of two key modules: the Iterative Agents Selection (IAS) module and the Reliable Agents Interaction (RAI) module. IAS enhances structural feature awareness with phase maps and employs reinforcement learning principles to efficiently select reliable agents. RAI then leverages these selected agents to guide cross-modal interactions, effectively reducing mismatches and improving overall robustness. Extensive experiments on the RGB-D Scenes v2 and 7-Scenes benchmarks demonstrate that our method consistently achieves state-of-the-art performance.
title	Adaptive Agent Selection and Interaction Network for Image-to-point cloud Registration
topic	Computer Vision and Pattern Recognition Artificial Intelligence
url	https://arxiv.org/abs/2511.05965

Similar Items