Saved in:
Bibliographic Details
Main Authors: Van, Minh-Hao, Verma, Prateek, Wu, Xintao
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2402.14162
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866913238712582144
author Van, Minh-Hao
Verma, Prateek
Wu, Xintao
author_facet Van, Minh-Hao
Verma, Prateek
Wu, Xintao
contents Recently, large language models (LLMs) have taken the spotlight in natural language processing. Further, integrating LLMs with vision enables the users to explore emergent abilities with multimodal data. Visual language models (VLMs), such as LLaVA, Flamingo, or CLIP, have demonstrated impressive performance on various visio-linguistic tasks. Consequently, there are enormous applications of large models that could be potentially used in the biomedical imaging field. Along that direction, there is a lack of related work to show the ability of large models to diagnose the diseases. In this work, we study the zero-shot and few-shot robustness of VLMs on the medical imaging analysis tasks. Our comprehensive experiments demonstrate the effectiveness of VLMs in analyzing biomedical images such as brain MRIs, microscopic images of blood cells, and chest X-rays.
format Preprint
id arxiv_https___arxiv_org_abs_2402_14162
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle On Large Visual Language Models for Medical Imaging Analysis: An Empirical Study
Van, Minh-Hao
Verma, Prateek
Wu, Xintao
Computer Vision and Pattern Recognition
Artificial Intelligence
Recently, large language models (LLMs) have taken the spotlight in natural language processing. Further, integrating LLMs with vision enables the users to explore emergent abilities with multimodal data. Visual language models (VLMs), such as LLaVA, Flamingo, or CLIP, have demonstrated impressive performance on various visio-linguistic tasks. Consequently, there are enormous applications of large models that could be potentially used in the biomedical imaging field. Along that direction, there is a lack of related work to show the ability of large models to diagnose the diseases. In this work, we study the zero-shot and few-shot robustness of VLMs on the medical imaging analysis tasks. Our comprehensive experiments demonstrate the effectiveness of VLMs in analyzing biomedical images such as brain MRIs, microscopic images of blood cells, and chest X-rays.
title On Large Visual Language Models for Medical Imaging Analysis: An Empirical Study
topic Computer Vision and Pattern Recognition
Artificial Intelligence
url https://arxiv.org/abs/2402.14162