Salvato in:
Dettagli Bibliografici
Autori principali: Jiao, Guanlong, Zhang, Chenyangguang, Yin, Haonan, Mo, Yu, Huang, Biqing, Pan, Hui, Luo, Yi, Liu, Jingxian
Natura: Preprint
Pubblicazione: 2024
Soggetti:
Accesso online:https://arxiv.org/abs/2404.13701
Tags: Aggiungi Tag
Nessun Tag, puoi essere il primo ad aggiungerne!!
_version_ 1866912517585895424
author Jiao, Guanlong
Zhang, Chenyangguang
Yin, Haonan
Mo, Yu
Huang, Biqing
Pan, Hui
Luo, Yi
Liu, Jingxian
author_facet Jiao, Guanlong
Zhang, Chenyangguang
Yin, Haonan
Mo, Yu
Huang, Biqing
Pan, Hui
Luo, Yi
Liu, Jingxian
contents Domain generalized semantic segmentation is an essential computer vision task, for which models only leverage source data to learn the capability of generalized semantic segmentation towards the unseen target domains. Previous works typically address this challenge by global style randomization or feature regularization. In this paper, we argue that given the observation that different local semantic regions perform different visual characteristics from the source domain to the target domain, methods focusing on global operations are hard to capture such regional discrepancies, thus failing to construct domain-invariant representations with the consistency from local to global level. Therefore, we propose the Semantic-Rearrangement-based Multi-Level Alignment (SRMA) to overcome this problem. SRMA first incorporates a Semantic Rearrangement Module (SRM), which conducts semantic region randomization to enhance the diversity of the source domain sufficiently. A Multi-Level Alignment module (MLA) is subsequently proposed with the help of such diversity to establish the global-regional-local consistent domain-invariant representations. By aligning features across randomized samples with domain-neutral knowledge at multiple levels, SRMA provides a more robust way to handle the source-target domain gap. Extensive experiments demonstrate the superiority of SRMA over the current state-of-the-art works on various benchmarks.
format Preprint
id arxiv_https___arxiv_org_abs_2404_13701
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Semantic-Rearrangement-Based Multi-Level Alignment for Domain Generalized Segmentation
Jiao, Guanlong
Zhang, Chenyangguang
Yin, Haonan
Mo, Yu
Huang, Biqing
Pan, Hui
Luo, Yi
Liu, Jingxian
Computer Vision and Pattern Recognition
Machine Learning
Domain generalized semantic segmentation is an essential computer vision task, for which models only leverage source data to learn the capability of generalized semantic segmentation towards the unseen target domains. Previous works typically address this challenge by global style randomization or feature regularization. In this paper, we argue that given the observation that different local semantic regions perform different visual characteristics from the source domain to the target domain, methods focusing on global operations are hard to capture such regional discrepancies, thus failing to construct domain-invariant representations with the consistency from local to global level. Therefore, we propose the Semantic-Rearrangement-based Multi-Level Alignment (SRMA) to overcome this problem. SRMA first incorporates a Semantic Rearrangement Module (SRM), which conducts semantic region randomization to enhance the diversity of the source domain sufficiently. A Multi-Level Alignment module (MLA) is subsequently proposed with the help of such diversity to establish the global-regional-local consistent domain-invariant representations. By aligning features across randomized samples with domain-neutral knowledge at multiple levels, SRMA provides a more robust way to handle the source-target domain gap. Extensive experiments demonstrate the superiority of SRMA over the current state-of-the-art works on various benchmarks.
title Semantic-Rearrangement-Based Multi-Level Alignment for Domain Generalized Segmentation
topic Computer Vision and Pattern Recognition
Machine Learning
url https://arxiv.org/abs/2404.13701