Saved in:
Bibliographic Details
Main Authors: Di Via, Roberto, Santacesaria, Matteo, Odone, Francesca, Pastore, Vito Paolo
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2403.01470
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866908417549926400
author Di Via, Roberto
Santacesaria, Matteo
Odone, Francesca
Pastore, Vito Paolo
author_facet Di Via, Roberto
Santacesaria, Matteo
Odone, Francesca
Pastore, Vito Paolo
contents In recent years, deep learning has emerged as a promising technique for medical image analysis. However, this application domain is likely to suffer from a limited availability of large public datasets and annotations. A common solution to these challenges in deep learning is the usage of a transfer learning framework, typically with a fine-tuning protocol, where a large-scale source dataset is used to pre-train a model, further fine-tuned on the target dataset. In this paper, we present a systematic study analyzing whether the usage of small-scale in-domain x-ray image datasets may provide any improvement for landmark detection over models pre-trained on large natural image datasets only. We focus on the multi-landmark localization task for three datasets, including chest, head, and hand x-ray images. Our results show that using in-domain source datasets brings marginal or no benefit with respect to an ImageNet out-of-domain pre-training. Our findings can provide an indication for the development of robust landmark detection systems in medical images when no large annotated dataset is available.
format Preprint
id arxiv_https___arxiv_org_abs_2403_01470
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Is in-domain data beneficial in transfer learning for landmarks detection in x-ray images?
Di Via, Roberto
Santacesaria, Matteo
Odone, Francesca
Pastore, Vito Paolo
Computer Vision and Pattern Recognition
In recent years, deep learning has emerged as a promising technique for medical image analysis. However, this application domain is likely to suffer from a limited availability of large public datasets and annotations. A common solution to these challenges in deep learning is the usage of a transfer learning framework, typically with a fine-tuning protocol, where a large-scale source dataset is used to pre-train a model, further fine-tuned on the target dataset. In this paper, we present a systematic study analyzing whether the usage of small-scale in-domain x-ray image datasets may provide any improvement for landmark detection over models pre-trained on large natural image datasets only. We focus on the multi-landmark localization task for three datasets, including chest, head, and hand x-ray images. Our results show that using in-domain source datasets brings marginal or no benefit with respect to an ImageNet out-of-domain pre-training. Our findings can provide an indication for the development of robust landmark detection systems in medical images when no large annotated dataset is available.
title Is in-domain data beneficial in transfer learning for landmarks detection in x-ray images?
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2403.01470