Internformat: :: Library Catalog

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Wang, Xiao, Wang, Fuling, Wang, Haowen, Jiang, Bo, Li, Chuanfu, Wang, Yaowei, Tian, Yonghong, Tang, Jin
Format:	Preprint
Veröffentlicht:	2025
Schlagworte:	Image and Video Processing Artificial Intelligence Computer Vision and Pattern Recognition
Online-Zugang:	https://arxiv.org/abs/2501.03458
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

_version_	1866913637860376576
author	Wang, Xiao Wang, Fuling Wang, Haowen Jiang, Bo Li, Chuanfu Wang, Yaowei Tian, Yonghong Tang, Jin
author_facet	Wang, Xiao Wang, Fuling Wang, Haowen Jiang, Bo Li, Chuanfu Wang, Yaowei Tian, Yonghong Tang, Jin
contents	X-ray image based medical report generation achieves significant progress in recent years with the help of the large language model, however, these models have not fully exploited the effective information in visual image regions, resulting in reports that are linguistically sound but insufficient in describing key diseases. In this paper, we propose a novel associative memory-enhanced X-ray report generation model that effectively mimics the process of professional doctors writing medical reports. It considers both the mining of global and local visual information and associates historical report information to better complete the writing of the current report. Specifically, given an X-ray image, we first utilize a classification model along with its activation maps to accomplish the mining of visual regions highly associated with diseases and the learning of disease query tokens. Then, we employ a visual Hopfield network to establish memory associations for disease-related tokens, and a report Hopfield network to retrieve report memory information. This process facilitates the generation of high-quality reports based on a large language model and achieves state-of-the-art performance on multiple benchmark datasets, including the IU X-ray, MIMIC-CXR, and Chexpert Plus. The source code of this work is released on \url{https://github.com/Event-AHU/Medical_Image_Analysis}.
format	Preprint
id	arxiv_https___arxiv_org_abs_2501_03458
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Activating Associative Disease-Aware Vision Token Memory for LLM-Based X-ray Report Generation Wang, Xiao Wang, Fuling Wang, Haowen Jiang, Bo Li, Chuanfu Wang, Yaowei Tian, Yonghong Tang, Jin Image and Video Processing Artificial Intelligence Computer Vision and Pattern Recognition X-ray image based medical report generation achieves significant progress in recent years with the help of the large language model, however, these models have not fully exploited the effective information in visual image regions, resulting in reports that are linguistically sound but insufficient in describing key diseases. In this paper, we propose a novel associative memory-enhanced X-ray report generation model that effectively mimics the process of professional doctors writing medical reports. It considers both the mining of global and local visual information and associates historical report information to better complete the writing of the current report. Specifically, given an X-ray image, we first utilize a classification model along with its activation maps to accomplish the mining of visual regions highly associated with diseases and the learning of disease query tokens. Then, we employ a visual Hopfield network to establish memory associations for disease-related tokens, and a report Hopfield network to retrieve report memory information. This process facilitates the generation of high-quality reports based on a large language model and achieves state-of-the-art performance on multiple benchmark datasets, including the IU X-ray, MIMIC-CXR, and Chexpert Plus. The source code of this work is released on \url{https://github.com/Event-AHU/Medical_Image_Analysis}.
title	Activating Associative Disease-Aware Vision Token Memory for LLM-Based X-ray Report Generation
topic	Image and Video Processing Artificial Intelligence Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2501.03458

Ähnliche Einträge