Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Lu, Yujie, Li, Jingwen, Ju, Sibo, Su, Yanzhou, yao, he, Liu, Yisong, Zhu, Min, Cheng, Junlong
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2602.19213
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866912917918580736
author	Lu, Yujie Li, Jingwen Ju, Sibo Su, Yanzhou yao, he Liu, Yisong Zhu, Min Cheng, Junlong
author_facet	Lu, Yujie Li, Jingwen Ju, Sibo Su, Yanzhou yao, he Liu, Yisong Zhu, Min Cheng, Junlong
contents	Medical image segmentation is vital for clinical diagnosis and quantitative analysis, yet remains challenging due to the heterogeneity of imaging modalities and the high cost of pixel-level annotations. Although general interactive segmentation models like SAM have achieved remarkable progress, their transfer to medical imaging still faces two key bottlenecks: (i) the lack of adaptive mechanisms for modality- and anatomy-specific tasks, which limits generalization in out-of-distribution medical scenarios; and (ii) current medical adaptation methods fine-tune on large, heterogeneous datasets without selection, leading to noisy supervision, higher cost, and negative transfer. To address these issues, we propose SegMoTE, an efficient and adaptive framework for medical image segmentation. SegMoTE preserves SAM's original prompt interface, efficient inference, and zero-shot generalization while introducing only a small number of learnable parameters to dynamically adapt across modalities and tasks. In addition, we design a progressive prompt tokenization mechanism that enables fully automatic segmentation, significantly reducing annotation dependence. Trained on MedSeg-HQ, a curated dataset less than 1% of existing large-scale datasets, SegMoTE achieves SOTA performance across diverse imaging modalities and anatomical tasks. It represents the first efficient, robust, and scalable adaptation of general segmentation models to the medical domain under extremely low annotation cost, advancing the practical deployment of foundation vision models in clinical applications.
format	Preprint
id	arxiv_https___arxiv_org_abs_2602_19213
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	SegMoTE: Token-Level Mixture of Experts for Medical Image Segmentation Lu, Yujie Li, Jingwen Ju, Sibo Su, Yanzhou yao, he Liu, Yisong Zhu, Min Cheng, Junlong Computer Vision and Pattern Recognition Medical image segmentation is vital for clinical diagnosis and quantitative analysis, yet remains challenging due to the heterogeneity of imaging modalities and the high cost of pixel-level annotations. Although general interactive segmentation models like SAM have achieved remarkable progress, their transfer to medical imaging still faces two key bottlenecks: (i) the lack of adaptive mechanisms for modality- and anatomy-specific tasks, which limits generalization in out-of-distribution medical scenarios; and (ii) current medical adaptation methods fine-tune on large, heterogeneous datasets without selection, leading to noisy supervision, higher cost, and negative transfer. To address these issues, we propose SegMoTE, an efficient and adaptive framework for medical image segmentation. SegMoTE preserves SAM's original prompt interface, efficient inference, and zero-shot generalization while introducing only a small number of learnable parameters to dynamically adapt across modalities and tasks. In addition, we design a progressive prompt tokenization mechanism that enables fully automatic segmentation, significantly reducing annotation dependence. Trained on MedSeg-HQ, a curated dataset less than 1% of existing large-scale datasets, SegMoTE achieves SOTA performance across diverse imaging modalities and anatomical tasks. It represents the first efficient, robust, and scalable adaptation of general segmentation models to the medical domain under extremely low annotation cost, advancing the practical deployment of foundation vision models in clinical applications.
title	SegMoTE: Token-Level Mixture of Experts for Medical Image Segmentation
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2602.19213

Similar Items