Saved in:
Bibliographic Details
Main Authors: Liang, Peixian, Li, Songhao, Koga, Shunsuke, Li, Yutong, Alipour, Zahra, Tang, Yucheng, Xu, Daguang, Huang, Zhi
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2601.16451
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866917219374465024
author Liang, Peixian
Li, Songhao
Koga, Shunsuke
Li, Yutong
Alipour, Zahra
Tang, Yucheng
Xu, Daguang
Huang, Zhi
author_facet Liang, Peixian
Li, Songhao
Koga, Shunsuke
Li, Yutong
Alipour, Zahra
Tang, Yucheng
Xu, Daguang
Huang, Zhi
contents Accurate semantic segmentation for histopathology image is crucial for quantitative tissue analysis and downstream clinical modeling. Recent segmentation foundation models have improved generalization through large-scale pretraining, yet remain poorly aligned with pathology because they treat segmentation as a static visual prediction task. Here we present VISTA-PATH, an interactive, class-aware pathology segmentation foundation model designed to resolve heterogeneous structures, incorporate expert feedback, and produce pixel-level segmentation that are directly meaningful for clinical interpretation. VISTA-PATH jointly conditions segmentation on visual context, semantic tissue descriptions, and optional expert-provided spatial prompts, enabling precise multi-class segmentation across heterogeneous pathology images. To support this paradigm, we curate VISTA-PATH Data, a large-scale pathology segmentation corpus comprising over 1.6 million image-mask-text triplets spanning 9 organs and 93 tissue classes. Across extensive held-out and external benchmarks, VISTA-PATH consistently outperforms existing segmentation foundation models. Importantly, VISTA-PATH supports dynamic human-in-the-loop refinement by propagating sparse, patch-level bounding-box annotation feedback into whole-slide segmentation. Finally, we show that the high-fidelity, class-aware segmentation produced by VISTA-PATH is a preferred model for computational pathology. It improve tissue microenvironment analysis through proposed Tumor Interaction Score (TIS), which exhibits strong and significant associations with patient survival. Together, these results establish VISTA-PATH as a foundation model that elevates pathology image segmentation from a static prediction to an interactive and clinically grounded representation for digital pathology. Source code and demo can be found at https://github.com/zhihuanglab/VISTA-PATH.
format Preprint
id arxiv_https___arxiv_org_abs_2601_16451
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle VISTA-PATH: An interactive foundation model for pathology image segmentation and quantitative analysis in computational pathology
Liang, Peixian
Li, Songhao
Koga, Shunsuke
Li, Yutong
Alipour, Zahra
Tang, Yucheng
Xu, Daguang
Huang, Zhi
Computer Vision and Pattern Recognition
Accurate semantic segmentation for histopathology image is crucial for quantitative tissue analysis and downstream clinical modeling. Recent segmentation foundation models have improved generalization through large-scale pretraining, yet remain poorly aligned with pathology because they treat segmentation as a static visual prediction task. Here we present VISTA-PATH, an interactive, class-aware pathology segmentation foundation model designed to resolve heterogeneous structures, incorporate expert feedback, and produce pixel-level segmentation that are directly meaningful for clinical interpretation. VISTA-PATH jointly conditions segmentation on visual context, semantic tissue descriptions, and optional expert-provided spatial prompts, enabling precise multi-class segmentation across heterogeneous pathology images. To support this paradigm, we curate VISTA-PATH Data, a large-scale pathology segmentation corpus comprising over 1.6 million image-mask-text triplets spanning 9 organs and 93 tissue classes. Across extensive held-out and external benchmarks, VISTA-PATH consistently outperforms existing segmentation foundation models. Importantly, VISTA-PATH supports dynamic human-in-the-loop refinement by propagating sparse, patch-level bounding-box annotation feedback into whole-slide segmentation. Finally, we show that the high-fidelity, class-aware segmentation produced by VISTA-PATH is a preferred model for computational pathology. It improve tissue microenvironment analysis through proposed Tumor Interaction Score (TIS), which exhibits strong and significant associations with patient survival. Together, these results establish VISTA-PATH as a foundation model that elevates pathology image segmentation from a static prediction to an interactive and clinically grounded representation for digital pathology. Source code and demo can be found at https://github.com/zhihuanglab/VISTA-PATH.
title VISTA-PATH: An interactive foundation model for pathology image segmentation and quantitative analysis in computational pathology
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2601.16451