Saved in:
Bibliographic Details
Main Authors: Gao, Fang, Li, Xuetao, Wang, Jiabao, Ma, Shengheng, Yu, Jun
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2410.05762
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866915789266747392
author Gao, Fang
Li, Xuetao
Wang, Jiabao
Ma, Shengheng
Yu, Jun
author_facet Gao, Fang
Li, Xuetao
Wang, Jiabao
Ma, Shengheng
Yu, Jun
contents With the development of steel materials, metallographic analysis has become increasingly important. Unfortunately, grain size analysis is a manual process that requires experts to evaluate metallographic photographs, which is unreliable and time-consuming. To resolve this problem, we propose a novel classifi-cation method based on deep learning, namely GSNets, a family of hybrid models which can effectively introduce guided self-attention for classifying grain size. Concretely, we build our models from three insights:(1) Introducing our novel guided self-attention module can assist the model in finding the generalized necessarily distinct vectors capable of retaining intricate rela-tional connections and rich local feature information; (2) By improving the pixel-wise linear independence of the feature map, the highly condensed semantic representation will be captured by the model; (3) Our novel triple-stream merging module can significantly improve the generalization capability and efficiency of the model. Experiments show that our GSNet yields a classifi-cation accuracy of 90.1%, surpassing the state-of-the-art Swin Transformer V2 by 1.9% on the steel grain size dataset, which comprises 3,599 images with 14 grain size levels. Furthermore, we intuitively believe our approach is applicable to broader ap-plications like object detection and semantic segmentation.
format Preprint
id arxiv_https___arxiv_org_abs_2410_05762
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Guided Self-attention: Find the Generalized Necessarily Distinct Vectors for Grain Size Grading
Gao, Fang
Li, Xuetao
Wang, Jiabao
Ma, Shengheng
Yu, Jun
Computer Vision and Pattern Recognition
With the development of steel materials, metallographic analysis has become increasingly important. Unfortunately, grain size analysis is a manual process that requires experts to evaluate metallographic photographs, which is unreliable and time-consuming. To resolve this problem, we propose a novel classifi-cation method based on deep learning, namely GSNets, a family of hybrid models which can effectively introduce guided self-attention for classifying grain size. Concretely, we build our models from three insights:(1) Introducing our novel guided self-attention module can assist the model in finding the generalized necessarily distinct vectors capable of retaining intricate rela-tional connections and rich local feature information; (2) By improving the pixel-wise linear independence of the feature map, the highly condensed semantic representation will be captured by the model; (3) Our novel triple-stream merging module can significantly improve the generalization capability and efficiency of the model. Experiments show that our GSNet yields a classifi-cation accuracy of 90.1%, surpassing the state-of-the-art Swin Transformer V2 by 1.9% on the steel grain size dataset, which comprises 3,599 images with 14 grain size levels. Furthermore, we intuitively believe our approach is applicable to broader ap-plications like object detection and semantic segmentation.
title Guided Self-attention: Find the Generalized Necessarily Distinct Vectors for Grain Size Grading
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2410.05762