Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Tan, Jing Wei, Kim, SeungKyu, Kim, Eunsu, Lee, Sung Hak, Ahn, Sangjeong, Jeong, Won-Ki
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2409.15574
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866929512378269696
author	Tan, Jing Wei Kim, SeungKyu Kim, Eunsu Lee, Sung Hak Ahn, Sangjeong Jeong, Won-Ki
author_facet	Tan, Jing Wei Kim, SeungKyu Kim, Eunsu Lee, Sung Hak Ahn, Sangjeong Jeong, Won-Ki
contents	Vision language models (VLM) have achieved success in both natural language comprehension and image recognition tasks. However, their use in pathology report generation for whole slide images (WSIs) is still limited due to the huge size of multi-scale WSIs and the high cost of WSI annotation. Moreover, in most of the existing research on pathology report generation, sufficient validation regarding clinical efficacy has not been conducted. Herein, we propose a novel Patient-level Multi-organ Pathology Report Generation (PMPRG) model, which utilizes the multi-scale WSI features from our proposed multi-scale regional vision transformer (MR-ViT) model and their real pathology reports to guide VLM training for accurate pathology report generation. The model then automatically generates a report based on the provided key features attended regional features. We assessed our model using a WSI dataset consisting of multiple organs, including the colon and kidney. Our model achieved a METEOR score of 0.68, demonstrating the effectiveness of our approach. This model allows pathologists to efficiently generate pathology reports for patients, regardless of the number of WSIs involved.
format	Preprint
id	arxiv_https___arxiv_org_abs_2409_15574
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Clinical-grade Multi-Organ Pathology Report Generation for Multi-scale Whole Slide Images via a Semantically Guided Medical Text Foundation Model Tan, Jing Wei Kim, SeungKyu Kim, Eunsu Lee, Sung Hak Ahn, Sangjeong Jeong, Won-Ki Computer Vision and Pattern Recognition Vision language models (VLM) have achieved success in both natural language comprehension and image recognition tasks. However, their use in pathology report generation for whole slide images (WSIs) is still limited due to the huge size of multi-scale WSIs and the high cost of WSI annotation. Moreover, in most of the existing research on pathology report generation, sufficient validation regarding clinical efficacy has not been conducted. Herein, we propose a novel Patient-level Multi-organ Pathology Report Generation (PMPRG) model, which utilizes the multi-scale WSI features from our proposed multi-scale regional vision transformer (MR-ViT) model and their real pathology reports to guide VLM training for accurate pathology report generation. The model then automatically generates a report based on the provided key features attended regional features. We assessed our model using a WSI dataset consisting of multiple organs, including the colon and kidney. Our model achieved a METEOR score of 0.68, demonstrating the effectiveness of our approach. This model allows pathologists to efficiently generate pathology reports for patients, regardless of the number of WSIs involved.
title	Clinical-grade Multi-Organ Pathology Report Generation for Multi-scale Whole Slide Images via a Semantically Guided Medical Text Foundation Model
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2409.15574

Similar Items