Enregistré dans:
Détails bibliographiques
Auteurs principaux: Hong, Yuxin, Zhang, Xiao, Zhang, Xin, Zhou, Joey Tianyi
Format: Preprint
Publié: 2024
Sujets:
Accès en ligne:https://arxiv.org/abs/2406.05677
Tags: Ajouter un tag
Pas de tags, Soyez le premier à ajouter un tag!
_version_ 1866917765042929664
author Hong, Yuxin
Zhang, Xiao
Zhang, Xin
Zhou, Joey Tianyi
author_facet Hong, Yuxin
Zhang, Xiao
Zhang, Xin
Zhou, Joey Tianyi
contents In the medical field, managing high-dimensional massive medical imaging data and performing reliable medical analysis from it is a critical challenge, especially in resource-limited environments such as remote medical facilities and mobile devices. This necessitates effective dataset compression techniques to reduce storage, transmission, and computational cost. However, existing coreset selection methods are primarily designed for natural image datasets, and exhibit doubtful effectiveness when applied to medical image datasets due to challenges such as intra-class variation and inter-class similarity. In this paper, we propose a novel coreset selection strategy termed as Evolution-aware VAriance (EVA), which captures the evolutionary process of model training through a dual-window approach and reflects the fluctuation of sample importance more precisely through variance measurement. Extensive experiments on medical image datasets demonstrate the effectiveness of our strategy over previous SOTA methods, especially at high compression rates. EVA achieves 98.27% accuracy with only 10% training data, compared to 97.20% for the full training set. None of the compared baseline methods can exceed Random at 5% selection rate, while EVA outperforms Random by 5.61%, showcasing its potential for efficient medical image analysis.
format Preprint
id arxiv_https___arxiv_org_abs_2406_05677
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Evolution-aware VAriance (EVA) Coreset Selection for Medical Image Classification
Hong, Yuxin
Zhang, Xiao
Zhang, Xin
Zhou, Joey Tianyi
Computer Vision and Pattern Recognition
In the medical field, managing high-dimensional massive medical imaging data and performing reliable medical analysis from it is a critical challenge, especially in resource-limited environments such as remote medical facilities and mobile devices. This necessitates effective dataset compression techniques to reduce storage, transmission, and computational cost. However, existing coreset selection methods are primarily designed for natural image datasets, and exhibit doubtful effectiveness when applied to medical image datasets due to challenges such as intra-class variation and inter-class similarity. In this paper, we propose a novel coreset selection strategy termed as Evolution-aware VAriance (EVA), which captures the evolutionary process of model training through a dual-window approach and reflects the fluctuation of sample importance more precisely through variance measurement. Extensive experiments on medical image datasets demonstrate the effectiveness of our strategy over previous SOTA methods, especially at high compression rates. EVA achieves 98.27% accuracy with only 10% training data, compared to 97.20% for the full training set. None of the compared baseline methods can exceed Random at 5% selection rate, while EVA outperforms Random by 5.61%, showcasing its potential for efficient medical image analysis.
title Evolution-aware VAriance (EVA) Coreset Selection for Medical Image Classification
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2406.05677