Saved in:
Bibliographic Details
Main Authors: Peng, Dehua, Gui, Zhipeng, Gui, Jie, Wu, Huayi
Format: Preprint
Published: 2023
Subjects:
Online Access:https://arxiv.org/abs/2312.04065
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866910843458813952
author Peng, Dehua
Gui, Zhipeng
Gui, Jie
Wu, Huayi
author_facet Peng, Dehua
Gui, Zhipeng
Gui, Jie
Wu, Huayi
contents Boundary point detection aims to outline the external contour structure of clusters and enhance the inter-cluster discrimination, thus bolstering the performance of the downstream classification and clustering tasks. However, existing boundary point detectors are sensitive to density heterogeneity or cannot identify boundary points in concave structures and high-dimensional manifolds. In this work, we propose a robust and efficient boundary point detection method based on Local Direction Dispersion (LoDD). The core of boundary point detection lies in measuring the difference between boundary points and internal points. It is a common observation that an internal point is surrounded by its neighbors in all directions, while the neighbors of a boundary point tend to be distributed only in a certain directional range. By considering this observation, we adopt density-independent K-Nearest Neighbors (KNN) method to determine neighboring points and design a centrality metric LoDD using the eigenvalues of the covariance matrix to depict the distribution uniformity of KNN. We also develop a grid-structure assumption of data distribution to determine the parameters adaptively. The effectiveness of LoDD is demonstrated on synthetic datasets, real-world benchmarks, and application of training set split for deep learning model and hole detection on point cloud data. The datasets and toolkit are available at: https://github.com/ZPGuiGroupWhu/lodd.
format Preprint
id arxiv_https___arxiv_org_abs_2312_04065
institution arXiv
publishDate 2023
record_format arxiv
spellingShingle A Robust and Efficient Boundary Point Detection Method by Measuring Local Direction Dispersion
Peng, Dehua
Gui, Zhipeng
Gui, Jie
Wu, Huayi
Machine Learning
I.5.2
Boundary point detection aims to outline the external contour structure of clusters and enhance the inter-cluster discrimination, thus bolstering the performance of the downstream classification and clustering tasks. However, existing boundary point detectors are sensitive to density heterogeneity or cannot identify boundary points in concave structures and high-dimensional manifolds. In this work, we propose a robust and efficient boundary point detection method based on Local Direction Dispersion (LoDD). The core of boundary point detection lies in measuring the difference between boundary points and internal points. It is a common observation that an internal point is surrounded by its neighbors in all directions, while the neighbors of a boundary point tend to be distributed only in a certain directional range. By considering this observation, we adopt density-independent K-Nearest Neighbors (KNN) method to determine neighboring points and design a centrality metric LoDD using the eigenvalues of the covariance matrix to depict the distribution uniformity of KNN. We also develop a grid-structure assumption of data distribution to determine the parameters adaptively. The effectiveness of LoDD is demonstrated on synthetic datasets, real-world benchmarks, and application of training set split for deep learning model and hole detection on point cloud data. The datasets and toolkit are available at: https://github.com/ZPGuiGroupWhu/lodd.
title A Robust and Efficient Boundary Point Detection Method by Measuring Local Direction Dispersion
topic Machine Learning
I.5.2
url https://arxiv.org/abs/2312.04065