Saved in:
Bibliographic Details
Main Author: Singh, Sonit
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2408.11344
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866911997502685184
author Singh, Sonit
author_facet Singh, Sonit
contents Recent developments in the field of Natural Language Processing, especially language models such as the transformer have brought state-of-the-art results in language understanding and language generation. In this work, we investigate the use of the transformer model for radiology report generation from chest X-rays. We also highlight limitations in evaluating radiology report generation using only the standard language generation metrics. We then applied a transformer based radiology report generation architecture, and also compare the performance of a transformer based decoder with the recurrence based decoder. Experiments were performed using the IU-CXR dataset, showing superior results to its LSTM counterpart and being significantly faster. Finally, we identify the need of evaluating radiology report generation system using both language generation metrics and classification metrics, which helps to provide robust measure of generated reports in terms of their coherence and diagnostic value.
format Preprint
id arxiv_https___arxiv_org_abs_2408_11344
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Clinical Context-aware Radiology Report Generation from Medical Images using Transformers
Singh, Sonit
Computation and Language
Machine Learning
Recent developments in the field of Natural Language Processing, especially language models such as the transformer have brought state-of-the-art results in language understanding and language generation. In this work, we investigate the use of the transformer model for radiology report generation from chest X-rays. We also highlight limitations in evaluating radiology report generation using only the standard language generation metrics. We then applied a transformer based radiology report generation architecture, and also compare the performance of a transformer based decoder with the recurrence based decoder. Experiments were performed using the IU-CXR dataset, showing superior results to its LSTM counterpart and being significantly faster. Finally, we identify the need of evaluating radiology report generation system using both language generation metrics and classification metrics, which helps to provide robust measure of generated reports in terms of their coherence and diagnostic value.
title Clinical Context-aware Radiology Report Generation from Medical Images using Transformers
topic Computation and Language
Machine Learning
url https://arxiv.org/abs/2408.11344