Saved in:
Bibliographic Details
Main Authors: Bergström, Herman, Yue, Zhongqi, Johansson, Fredrik D.
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2510.24385
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866914120925708288
author Bergström, Herman
Yue, Zhongqi
Johansson, Fredrik D.
author_facet Bergström, Herman
Yue, Zhongqi
Johansson, Fredrik D.
contents Medical images used to train machine learning models are often accompanied by radiology reports containing rich expert annotations. However, relying on these reports as inputs for clinical prediction requires the timely manual work of a trained radiologist. This raises a natural question: when can radiology reports be leveraged during training to improve image-only classification? Prior works are limited to evaluating pre-trained image representations by fine-tuning them to predict diagnostic labels, often extracted from reports, ignoring tasks with labels that are weakly associated with the text. To address this gap, we conduct a systematic study of how radiology reports can be used during both pre-training and fine-tuning, across diagnostic and prognostic tasks (e.g., 12-month readmission), and under varying training set sizes. Our findings reveal that: (1) Leveraging reports during pre-training is beneficial for downstream classification tasks where the label is well-represented in the text; however, pre-training through explicit image-text alignment can be detrimental in settings where it's not; (2) Fine-tuning with reports can lead to significant improvements and even have a larger impact than the pre-training method in certain settings. These results provide actionable insights into when and how to leverage privileged text data to train medical image classifiers while highlighting gaps in current research.
format Preprint
id arxiv_https___arxiv_org_abs_2510_24385
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle When are radiology reports useful for training medical image classifiers?
Bergström, Herman
Yue, Zhongqi
Johansson, Fredrik D.
Computer Vision and Pattern Recognition
Medical images used to train machine learning models are often accompanied by radiology reports containing rich expert annotations. However, relying on these reports as inputs for clinical prediction requires the timely manual work of a trained radiologist. This raises a natural question: when can radiology reports be leveraged during training to improve image-only classification? Prior works are limited to evaluating pre-trained image representations by fine-tuning them to predict diagnostic labels, often extracted from reports, ignoring tasks with labels that are weakly associated with the text. To address this gap, we conduct a systematic study of how radiology reports can be used during both pre-training and fine-tuning, across diagnostic and prognostic tasks (e.g., 12-month readmission), and under varying training set sizes. Our findings reveal that: (1) Leveraging reports during pre-training is beneficial for downstream classification tasks where the label is well-represented in the text; however, pre-training through explicit image-text alignment can be detrimental in settings where it's not; (2) Fine-tuning with reports can lead to significant improvements and even have a larger impact than the pre-training method in certain settings. These results provide actionable insights into when and how to leverage privileged text data to train medical image classifiers while highlighting gaps in current research.
title When are radiology reports useful for training medical image classifiers?
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2510.24385