Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Zeng, Hongliang, Zhang, Ping, Li, Fang, Wang, Jiahua, Ye, Tingyu, Guo, Pengteng
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence
Online Access:	https://arxiv.org/abs/2406.17342
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866916358062604288
author	Zeng, Hongliang Zhang, Ping Li, Fang Wang, Jiahua Ye, Tingyu Guo, Pengteng
author_facet	Zeng, Hongliang Zhang, Ping Li, Fang Wang, Jiahua Ye, Tingyu Guo, Pengteng
contents	Representation and generative learning, as reconstruction-based methods, have demonstrated their potential for mutual reinforcement across various domains. In the field of point cloud processing, although existing studies have adopted training strategies from generative models to enhance representational capabilities, these methods are limited by their inability to genuinely generate 3D shapes. To explore the benefits of deeply integrating 3D representation learning and generative learning, we propose an innovative framework called \textit{Point-MGE}. Specifically, this framework first utilizes a vector quantized variational autoencoder to reconstruct a neural field representation of 3D shapes, thereby learning discrete semantic features of point patches. Subsequently, we design a sliding masking ratios to smooth the transition from representation learning to generative learning. Moreover, our method demonstrates strong generalization capability in learning high-capacity models, achieving new state-of-the-art performance across multiple downstream tasks. In shape classification, Point-MGE achieved an accuracy of 94.2% (+1.0%) on the ModelNet40 dataset and 92.9% (+5.5%) on the ScanObjectNN dataset. Experimental results also confirmed that Point-MGE can generate high-quality 3D shapes in both unconditional and conditional settings.
format	Preprint
id	arxiv_https___arxiv_org_abs_2406_17342
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Masked Generative Extractor for Synergistic Representation and 3D Generation of Point Clouds Zeng, Hongliang Zhang, Ping Li, Fang Wang, Jiahua Ye, Tingyu Guo, Pengteng Computer Vision and Pattern Recognition Artificial Intelligence Representation and generative learning, as reconstruction-based methods, have demonstrated their potential for mutual reinforcement across various domains. In the field of point cloud processing, although existing studies have adopted training strategies from generative models to enhance representational capabilities, these methods are limited by their inability to genuinely generate 3D shapes. To explore the benefits of deeply integrating 3D representation learning and generative learning, we propose an innovative framework called \textit{Point-MGE}. Specifically, this framework first utilizes a vector quantized variational autoencoder to reconstruct a neural field representation of 3D shapes, thereby learning discrete semantic features of point patches. Subsequently, we design a sliding masking ratios to smooth the transition from representation learning to generative learning. Moreover, our method demonstrates strong generalization capability in learning high-capacity models, achieving new state-of-the-art performance across multiple downstream tasks. In shape classification, Point-MGE achieved an accuracy of 94.2% (+1.0%) on the ModelNet40 dataset and 92.9% (+5.5%) on the ScanObjectNN dataset. Experimental results also confirmed that Point-MGE can generate high-quality 3D shapes in both unconditional and conditional settings.
title	Masked Generative Extractor for Synergistic Representation and 3D Generation of Point Clouds
topic	Computer Vision and Pattern Recognition Artificial Intelligence
url	https://arxiv.org/abs/2406.17342

Similar Items