Saved in:
Bibliographic Details
Main Author: Herzog, Jonas
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2402.17614
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866909206154575872
author Herzog, Jonas
author_facet Herzog, Jonas
contents Few-shot segmentation performance declines substantially when facing images from a domain different than the training domain, effectively limiting real-world use cases. To alleviate this, recently cross-domain few-shot segmentation (CD-FSS) has emerged. Works that address this task mainly attempted to learn segmentation on a source domain in a manner that generalizes across domains. Surprisingly, we can outperform these approaches while eliminating the training stage and removing their main segmentation network. We show test-time task-adaption is the key for successful CD-FSS instead. Task-adaption is achieved by appending small networks to the feature pyramid of a conventionally classification-pretrained backbone. To avoid overfitting to the few labeled samples in supervised fine-tuning, consistency across augmented views of input images serves as guidance while learning the parameters of the attached layers. Despite our self-restriction not to use any images other than the few labeled samples at test time, we achieve new state-of-the-art performance in CD-FSS, evidencing the need to rethink approaches for the task.
format Preprint
id arxiv_https___arxiv_org_abs_2402_17614
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Adapt Before Comparison: A New Perspective on Cross-Domain Few-Shot Segmentation
Herzog, Jonas
Computer Vision and Pattern Recognition
Few-shot segmentation performance declines substantially when facing images from a domain different than the training domain, effectively limiting real-world use cases. To alleviate this, recently cross-domain few-shot segmentation (CD-FSS) has emerged. Works that address this task mainly attempted to learn segmentation on a source domain in a manner that generalizes across domains. Surprisingly, we can outperform these approaches while eliminating the training stage and removing their main segmentation network. We show test-time task-adaption is the key for successful CD-FSS instead. Task-adaption is achieved by appending small networks to the feature pyramid of a conventionally classification-pretrained backbone. To avoid overfitting to the few labeled samples in supervised fine-tuning, consistency across augmented views of input images serves as guidance while learning the parameters of the attached layers. Despite our self-restriction not to use any images other than the few labeled samples at test time, we achieve new state-of-the-art performance in CD-FSS, evidencing the need to rethink approaches for the task.
title Adapt Before Comparison: A New Perspective on Cross-Domain Few-Shot Segmentation
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2402.17614