Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Kim, Sekeun, Ren, Hui, Guo, Peng, Ali, Abder-Rahman, Zhang, Patrick, Kim, Kyungsang, Li, Xiang, Li, Quanzheng
Format: Preprint
Veröffentlicht: 2024
Schlagworte:
Online-Zugang:https://arxiv.org/abs/2404.05916
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
_version_ 1866909163828805632
author Kim, Sekeun
Ren, Hui
Guo, Peng
Ali, Abder-Rahman
Zhang, Patrick
Kim, Kyungsang
Li, Xiang
Li, Quanzheng
author_facet Kim, Sekeun
Ren, Hui
Guo, Peng
Ali, Abder-Rahman
Zhang, Patrick
Kim, Kyungsang
Li, Xiang
Li, Quanzheng
contents Echocardiography segmentation for cardiac analysis is time-consuming and resource-intensive due to the variability in image quality and the necessity to process scans from various standard views. While current automated segmentation methods in echocardiography show promising performance, they are trained on specific scan views to analyze corresponding data. However, this solution has a limitation as the number of required models increases with the number of standard views. To address this, in this paper, we present a prompt-driven universal method for view-agnostic echocardiography analysis. Considering the domain shift between standard views, we first introduce a method called prompt matching, aimed at learning prompts specific to different views by matching prompts and querying input embeddings using a pre-trained vision model. Then, we utilized a pre-trained medical language model to align textual information with pixel data for accurate segmentation. Extensive experiments on three standard views showed that our approach significantly outperforms the state-of-the-art universal methods and achieves comparable or even better performances over the segmentation model trained and tested on same views.
format Preprint
id arxiv_https___arxiv_org_abs_2404_05916
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Prompt-driven Universal Model for View-Agnostic Echocardiography Analysis
Kim, Sekeun
Ren, Hui
Guo, Peng
Ali, Abder-Rahman
Zhang, Patrick
Kim, Kyungsang
Li, Xiang
Li, Quanzheng
Computer Vision and Pattern Recognition
Echocardiography segmentation for cardiac analysis is time-consuming and resource-intensive due to the variability in image quality and the necessity to process scans from various standard views. While current automated segmentation methods in echocardiography show promising performance, they are trained on specific scan views to analyze corresponding data. However, this solution has a limitation as the number of required models increases with the number of standard views. To address this, in this paper, we present a prompt-driven universal method for view-agnostic echocardiography analysis. Considering the domain shift between standard views, we first introduce a method called prompt matching, aimed at learning prompts specific to different views by matching prompts and querying input embeddings using a pre-trained vision model. Then, we utilized a pre-trained medical language model to align textual information with pixel data for accurate segmentation. Extensive experiments on three standard views showed that our approach significantly outperforms the state-of-the-art universal methods and achieves comparable or even better performances over the segmentation model trained and tested on same views.
title Prompt-driven Universal Model for View-Agnostic Echocardiography Analysis
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2404.05916