Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Author:	Herzog, Jonas
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2402.17614
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866909206154575872
author	Herzog, Jonas
author_facet	Herzog, Jonas
contents	Few-shot segmentation performance declines substantially when facing images from a domain different than the training domain, effectively limiting real-world use cases. To alleviate this, recently cross-domain few-shot segmentation (CD-FSS) has emerged. Works that address this task mainly attempted to learn segmentation on a source domain in a manner that generalizes across domains. Surprisingly, we can outperform these approaches while eliminating the training stage and removing their main segmentation network. We show test-time task-adaption is the key for successful CD-FSS instead. Task-adaption is achieved by appending small networks to the feature pyramid of a conventionally classification-pretrained backbone. To avoid overfitting to the few labeled samples in supervised fine-tuning, consistency across augmented views of input images serves as guidance while learning the parameters of the attached layers. Despite our self-restriction not to use any images other than the few labeled samples at test time, we achieve new state-of-the-art performance in CD-FSS, evidencing the need to rethink approaches for the task.
format	Preprint
id	arxiv_https___arxiv_org_abs_2402_17614
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Adapt Before Comparison: A New Perspective on Cross-Domain Few-Shot Segmentation Herzog, Jonas Computer Vision and Pattern Recognition Few-shot segmentation performance declines substantially when facing images from a domain different than the training domain, effectively limiting real-world use cases. To alleviate this, recently cross-domain few-shot segmentation (CD-FSS) has emerged. Works that address this task mainly attempted to learn segmentation on a source domain in a manner that generalizes across domains. Surprisingly, we can outperform these approaches while eliminating the training stage and removing their main segmentation network. We show test-time task-adaption is the key for successful CD-FSS instead. Task-adaption is achieved by appending small networks to the feature pyramid of a conventionally classification-pretrained backbone. To avoid overfitting to the few labeled samples in supervised fine-tuning, consistency across augmented views of input images serves as guidance while learning the parameters of the attached layers. Despite our self-restriction not to use any images other than the few labeled samples at test time, we achieve new state-of-the-art performance in CD-FSS, evidencing the need to rethink approaches for the task.
title	Adapt Before Comparison: A New Perspective on Cross-Domain Few-Shot Segmentation
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2402.17614

Similar Items