Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Li, Zihao, Gao, Pan, You, Kang, Yan, Chuan, Paul, Manoranjan
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2407.08994
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866914867045203968
author	Li, Zihao Gao, Pan You, Kang Yan, Chuan Paul, Manoranjan
author_facet	Li, Zihao Gao, Pan You, Kang Yan, Chuan Paul, Manoranjan
contents	Previous studies have demonstrated the effectiveness of point-based neural models on the point cloud analysis task. However, there remains a crucial issue on producing the efficient input embedding for raw point coordinates. Moreover, another issue lies in the limited efficiency of neighboring aggregations, which is a critical component in the network stem. In this paper, we propose a Global Attention-guided Dual-domain Feature Learning network (GAD) to address the above-mentioned issues. We first devise the Contextual Position-enhanced Transformer (CPT) module, which is armed with an improved global attention mechanism, to produce a global-aware input embedding that serves as the guidance to subsequent aggregations. Then, the Dual-domain K-nearest neighbor Feature Fusion (DKFF) is cascaded to conduct effective feature aggregation through novel dual-domain feature learning which appreciates both local geometric relations and long-distance semantic connections. Extensive experiments on multiple point cloud analysis tasks (e.g., classification, part segmentation, and scene semantic segmentation) demonstrate the superior performance of the proposed method and the efficacy of the devised modules.
format	Preprint
id	arxiv_https___arxiv_org_abs_2407_08994
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Global Attention-Guided Dual-Domain Point Cloud Feature Learning for Classification and Segmentation Li, Zihao Gao, Pan You, Kang Yan, Chuan Paul, Manoranjan Computer Vision and Pattern Recognition Previous studies have demonstrated the effectiveness of point-based neural models on the point cloud analysis task. However, there remains a crucial issue on producing the efficient input embedding for raw point coordinates. Moreover, another issue lies in the limited efficiency of neighboring aggregations, which is a critical component in the network stem. In this paper, we propose a Global Attention-guided Dual-domain Feature Learning network (GAD) to address the above-mentioned issues. We first devise the Contextual Position-enhanced Transformer (CPT) module, which is armed with an improved global attention mechanism, to produce a global-aware input embedding that serves as the guidance to subsequent aggregations. Then, the Dual-domain K-nearest neighbor Feature Fusion (DKFF) is cascaded to conduct effective feature aggregation through novel dual-domain feature learning which appreciates both local geometric relations and long-distance semantic connections. Extensive experiments on multiple point cloud analysis tasks (e.g., classification, part segmentation, and scene semantic segmentation) demonstrate the superior performance of the proposed method and the efficacy of the devised modules.
title	Global Attention-Guided Dual-Domain Point Cloud Feature Learning for Classification and Segmentation
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2407.08994

Similar Items