Saved in:
Bibliographic Details
Main Authors: Li, Zhe, Wang, Xiying, Zhao, Jinglin, Wang, Zheng, Liu, Debin, Yang, Laurence T.
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2402.02088
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866913886186242048
author Li, Zhe
Wang, Xiying
Zhao, Jinglin
Wang, Zheng
Liu, Debin
Yang, Laurence T.
author_facet Li, Zhe
Wang, Xiying
Zhao, Jinglin
Wang, Zheng
Liu, Debin
Yang, Laurence T.
contents Masked autoencoding and generative pretraining have achieved remarkable success in computer vision and natural language processing, and more recently, they have been extended to the point cloud domain. Nevertheless, existing point cloud models suffer from the issue of information leakage due to the pre-sampling of center points, which leads to trivial proxy tasks for the models. These approaches primarily focus on local feature reconstruction, limiting their ability to capture global patterns within point clouds. In this paper, we argue that the reduced difficulty of pretext tasks hampers the model's capacity to learn expressive representations. To address these limitations, we introduce a novel solution called the Differentiable Center Sampling Network (DCS-Net). It tackles the information leakage problem by incorporating both global feature reconstruction and local feature reconstruction as non-trivial proxy tasks, enabling simultaneous learning of both the global and local patterns within point cloud. Experimental results demonstrate that our method enhances the expressive capacity of existing point cloud models and effectively addresses the issue of information leakage.
format Preprint
id arxiv_https___arxiv_org_abs_2402_02088
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Mitigating Prior Shape Bias in Point Clouds via Differentiable Center Learning
Li, Zhe
Wang, Xiying
Zhao, Jinglin
Wang, Zheng
Liu, Debin
Yang, Laurence T.
Computer Vision and Pattern Recognition
Masked autoencoding and generative pretraining have achieved remarkable success in computer vision and natural language processing, and more recently, they have been extended to the point cloud domain. Nevertheless, existing point cloud models suffer from the issue of information leakage due to the pre-sampling of center points, which leads to trivial proxy tasks for the models. These approaches primarily focus on local feature reconstruction, limiting their ability to capture global patterns within point clouds. In this paper, we argue that the reduced difficulty of pretext tasks hampers the model's capacity to learn expressive representations. To address these limitations, we introduce a novel solution called the Differentiable Center Sampling Network (DCS-Net). It tackles the information leakage problem by incorporating both global feature reconstruction and local feature reconstruction as non-trivial proxy tasks, enabling simultaneous learning of both the global and local patterns within point cloud. Experimental results demonstrate that our method enhances the expressive capacity of existing point cloud models and effectively addresses the issue of information leakage.
title Mitigating Prior Shape Bias in Point Clouds via Differentiable Center Learning
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2402.02088