Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Nguyen, Anh Tien, Vuong, Trinh Thi Le, Kwak, Jin Tae
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition Machine Learning
Online Access:	https://arxiv.org/abs/2407.07360
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866929415971143680
author	Nguyen, Anh Tien Vuong, Trinh Thi Le Kwak, Jin Tae
author_facet	Nguyen, Anh Tien Vuong, Trinh Thi Le Kwak, Jin Tae
contents	Recently, vision-language pre-trained models have emerged in computational pathology. Previous works generally focused on the alignment of image-text pairs via the contrastive pre-training paradigm. Such pre-trained models have been applied to pathology image classification in zero-shot learning or transfer learning fashion. Herein, we hypothesize that the pre-trained vision-language models can be utilized for quantitative histopathology image analysis through a simple image-to-text retrieval. To this end, we propose a Text-based Quantitative and Explainable histopathology image analysis, which we call TQx. Given a set of histopathology images, we adopt a pre-trained vision-language model to retrieve a word-of-interest pool. The retrieved words are then used to quantify the histopathology images and generate understandable feature embeddings due to the direct mapping to the text description. To evaluate the proposed method, the text-based embeddings of four histopathology image datasets are utilized to perform clustering and classification tasks. The results demonstrate that TQx is able to quantify and analyze histopathology images that are comparable to the prevalent visual models in computational pathology.
format	Preprint
id	arxiv_https___arxiv_org_abs_2407_07360
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Towards a text-based quantitative and explainable histopathology image analysis Nguyen, Anh Tien Vuong, Trinh Thi Le Kwak, Jin Tae Computer Vision and Pattern Recognition Machine Learning Recently, vision-language pre-trained models have emerged in computational pathology. Previous works generally focused on the alignment of image-text pairs via the contrastive pre-training paradigm. Such pre-trained models have been applied to pathology image classification in zero-shot learning or transfer learning fashion. Herein, we hypothesize that the pre-trained vision-language models can be utilized for quantitative histopathology image analysis through a simple image-to-text retrieval. To this end, we propose a Text-based Quantitative and Explainable histopathology image analysis, which we call TQx. Given a set of histopathology images, we adopt a pre-trained vision-language model to retrieve a word-of-interest pool. The retrieved words are then used to quantify the histopathology images and generate understandable feature embeddings due to the direct mapping to the text description. To evaluate the proposed method, the text-based embeddings of four histopathology image datasets are utilized to perform clustering and classification tasks. The results demonstrate that TQx is able to quantify and analyze histopathology images that are comparable to the prevalent visual models in computational pathology.
title	Towards a text-based quantitative and explainable histopathology image analysis
topic	Computer Vision and Pattern Recognition Machine Learning
url	https://arxiv.org/abs/2407.07360

Similar Items