Saved in:
Bibliographic Details
Main Authors: Sun, Guodong, Peng, Yuting, Cheng, Le, Xu, Mengya, Wang, An, Wu, Bo, Ren, Hongliang, Zhang, Yang
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2402.17370
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866914693031919616
author Sun, Guodong
Peng, Yuting
Cheng, Le
Xu, Mengya
Wang, An
Wu, Bo
Ren, Hongliang
Zhang, Yang
author_facet Sun, Guodong
Peng, Yuting
Cheng, Le
Xu, Mengya
Wang, An
Wu, Bo
Ren, Hongliang
Zhang, Yang
contents The precise segmentation of ore images is critical to the successful execution of the beneficiation process. Due to the homogeneous appearance of the ores, which leads to low contrast and unclear boundaries, accurate segmentation becomes challenging, and recognition becomes problematic. This paper proposes a lightweight framework based on Multi-Layer Perceptron (MLP), which focuses on solving the problem of edge burring. Specifically, we introduce a lightweight backbone better suited for efficiently extracting low-level features. Besides, we design a feature pyramid network consisting of two MLP structures that balance local and global information thus enhancing detection accuracy. Furthermore, we propose a novel loss function that guides the prediction points to match the instance edge points to achieve clear object boundaries. We have conducted extensive experiments to validate the efficacy of our proposed method. Our approach achieves a remarkable processing speed of over 27 frames per second (FPS) with a model size of only 73 MB. Moreover, our method delivers a consistently high level of accuracy, with impressive performance scores of 60.4 and 48.9 in~$AP_{50}^{box}$ and~$AP_{50}^{mask}$ respectively, as compared to the currently available state-of-the-art techniques, when tested on the ore image dataset. The source code will be released at \url{https://github.com/MVME-HBUT/ORENEXT}.
format Preprint
id arxiv_https___arxiv_org_abs_2402_17370
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle An Efficient MLP-based Point-guided Segmentation Network for Ore Images with Ambiguous Boundary
Sun, Guodong
Peng, Yuting
Cheng, Le
Xu, Mengya
Wang, An
Wu, Bo
Ren, Hongliang
Zhang, Yang
Computer Vision and Pattern Recognition
The precise segmentation of ore images is critical to the successful execution of the beneficiation process. Due to the homogeneous appearance of the ores, which leads to low contrast and unclear boundaries, accurate segmentation becomes challenging, and recognition becomes problematic. This paper proposes a lightweight framework based on Multi-Layer Perceptron (MLP), which focuses on solving the problem of edge burring. Specifically, we introduce a lightweight backbone better suited for efficiently extracting low-level features. Besides, we design a feature pyramid network consisting of two MLP structures that balance local and global information thus enhancing detection accuracy. Furthermore, we propose a novel loss function that guides the prediction points to match the instance edge points to achieve clear object boundaries. We have conducted extensive experiments to validate the efficacy of our proposed method. Our approach achieves a remarkable processing speed of over 27 frames per second (FPS) with a model size of only 73 MB. Moreover, our method delivers a consistently high level of accuracy, with impressive performance scores of 60.4 and 48.9 in~$AP_{50}^{box}$ and~$AP_{50}^{mask}$ respectively, as compared to the currently available state-of-the-art techniques, when tested on the ore image dataset. The source code will be released at \url{https://github.com/MVME-HBUT/ORENEXT}.
title An Efficient MLP-based Point-guided Segmentation Network for Ore Images with Ambiguous Boundary
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2402.17370