Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Lin, Yuqi, Li, Hengjia, Shao, Wenqi, Yang, Zheng, Zhao, Jun, He, Xiaofei, Luo, Ping, Zhang, Kaipeng
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2502.06756
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866913737524379648
author	Lin, Yuqi Li, Hengjia Shao, Wenqi Yang, Zheng Zhao, Jun He, Xiaofei Luo, Ping Zhang, Kaipeng
author_facet	Lin, Yuqi Li, Hengjia Shao, Wenqi Yang, Zheng Zhao, Jun He, Xiaofei Luo, Ping Zhang, Kaipeng
contents	In this paper, we explore a principal way to enhance the quality of widely pre-existing coarse masks, enabling them to serve as reliable training data for segmentation models to reduce the annotation cost. In contrast to prior refinement techniques that are tailored to specific models or tasks in a close-world manner, we propose SAMRefiner, a universal and efficient approach by adapting SAM to the mask refinement task. The core technique of our model is the noise-tolerant prompting scheme. Specifically, we introduce a multi-prompt excavation strategy to mine diverse input prompts for SAM (i.e., distance-guided points, context-aware elastic bounding boxes, and Gaussian-style masks) from initial coarse masks. These prompts can collaborate with each other to mitigate the effect of defects in coarse masks. In particular, considering the difficulty of SAM to handle the multi-object case in semantic segmentation, we introduce a split-then-merge (STM) pipeline. Additionally, we extend our method to SAMRefiner++ by introducing an additional IoU adaption step to further boost the performance of the generic SAMRefiner on the target dataset. This step is self-boosted and requires no additional annotation. The proposed framework is versatile and can flexibly cooperate with existing segmentation methods. We evaluate our mask framework on a wide range of benchmarks under different settings, demonstrating better accuracy and efficiency. SAMRefiner holds significant potential to expedite the evolution of refinement tools. Our code is available at https://github.com/linyq2117/SAMRefiner.
format	Preprint
id	arxiv_https___arxiv_org_abs_2502_06756
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	SAMRefiner: Taming Segment Anything Model for Universal Mask Refinement Lin, Yuqi Li, Hengjia Shao, Wenqi Yang, Zheng Zhao, Jun He, Xiaofei Luo, Ping Zhang, Kaipeng Computer Vision and Pattern Recognition In this paper, we explore a principal way to enhance the quality of widely pre-existing coarse masks, enabling them to serve as reliable training data for segmentation models to reduce the annotation cost. In contrast to prior refinement techniques that are tailored to specific models or tasks in a close-world manner, we propose SAMRefiner, a universal and efficient approach by adapting SAM to the mask refinement task. The core technique of our model is the noise-tolerant prompting scheme. Specifically, we introduce a multi-prompt excavation strategy to mine diverse input prompts for SAM (i.e., distance-guided points, context-aware elastic bounding boxes, and Gaussian-style masks) from initial coarse masks. These prompts can collaborate with each other to mitigate the effect of defects in coarse masks. In particular, considering the difficulty of SAM to handle the multi-object case in semantic segmentation, we introduce a split-then-merge (STM) pipeline. Additionally, we extend our method to SAMRefiner++ by introducing an additional IoU adaption step to further boost the performance of the generic SAMRefiner on the target dataset. This step is self-boosted and requires no additional annotation. The proposed framework is versatile and can flexibly cooperate with existing segmentation methods. We evaluate our mask framework on a wide range of benchmarks under different settings, demonstrating better accuracy and efficiency. SAMRefiner holds significant potential to expedite the evolution of refinement tools. Our code is available at https://github.com/linyq2117/SAMRefiner.
title	SAMRefiner: Taming Segment Anything Model for Universal Mask Refinement
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2502.06756

Similar Items