Saved in:
Bibliographic Details
Main Authors: Li, Zihao, Gao, Pan, You, Kang, Yan, Chuan, Paul, Manoranjan
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2407.08994
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866914867045203968
author Li, Zihao
Gao, Pan
You, Kang
Yan, Chuan
Paul, Manoranjan
author_facet Li, Zihao
Gao, Pan
You, Kang
Yan, Chuan
Paul, Manoranjan
contents Previous studies have demonstrated the effectiveness of point-based neural models on the point cloud analysis task. However, there remains a crucial issue on producing the efficient input embedding for raw point coordinates. Moreover, another issue lies in the limited efficiency of neighboring aggregations, which is a critical component in the network stem. In this paper, we propose a Global Attention-guided Dual-domain Feature Learning network (GAD) to address the above-mentioned issues. We first devise the Contextual Position-enhanced Transformer (CPT) module, which is armed with an improved global attention mechanism, to produce a global-aware input embedding that serves as the guidance to subsequent aggregations. Then, the Dual-domain K-nearest neighbor Feature Fusion (DKFF) is cascaded to conduct effective feature aggregation through novel dual-domain feature learning which appreciates both local geometric relations and long-distance semantic connections. Extensive experiments on multiple point cloud analysis tasks (e.g., classification, part segmentation, and scene semantic segmentation) demonstrate the superior performance of the proposed method and the efficacy of the devised modules.
format Preprint
id arxiv_https___arxiv_org_abs_2407_08994
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Global Attention-Guided Dual-Domain Point Cloud Feature Learning for Classification and Segmentation
Li, Zihao
Gao, Pan
You, Kang
Yan, Chuan
Paul, Manoranjan
Computer Vision and Pattern Recognition
Previous studies have demonstrated the effectiveness of point-based neural models on the point cloud analysis task. However, there remains a crucial issue on producing the efficient input embedding for raw point coordinates. Moreover, another issue lies in the limited efficiency of neighboring aggregations, which is a critical component in the network stem. In this paper, we propose a Global Attention-guided Dual-domain Feature Learning network (GAD) to address the above-mentioned issues. We first devise the Contextual Position-enhanced Transformer (CPT) module, which is armed with an improved global attention mechanism, to produce a global-aware input embedding that serves as the guidance to subsequent aggregations. Then, the Dual-domain K-nearest neighbor Feature Fusion (DKFF) is cascaded to conduct effective feature aggregation through novel dual-domain feature learning which appreciates both local geometric relations and long-distance semantic connections. Extensive experiments on multiple point cloud analysis tasks (e.g., classification, part segmentation, and scene semantic segmentation) demonstrate the superior performance of the proposed method and the efficacy of the devised modules.
title Global Attention-Guided Dual-Domain Point Cloud Feature Learning for Classification and Segmentation
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2407.08994