Saved in:
Bibliographic Details
Main Authors: Tarasiou, Michail, Guler, Riza Alp, Zafeiriou, Stefanos
Format: Preprint
Published: 2021
Subjects:
Online Access:https://arxiv.org/abs/2104.04310
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866911770273120256
author Tarasiou, Michail
Guler, Riza Alp
Zafeiriou, Stefanos
author_facet Tarasiou, Michail
Guler, Riza Alp
Zafeiriou, Stefanos
contents In this paper, we propose a fully supervised pre-training scheme based on contrastive learning particularly tailored to dense classification tasks. The proposed Context-Self Contrastive Loss (CSCL) learns an embedding space that makes semantic boundaries pop-up by use of a similarity metric between every location in a training sample and its local context. For crop type semantic segmentation from Satellite Image Time Series (SITS) we find performance at parcel boundaries to be a critical bottleneck and explain how CSCL tackles the underlying cause of that problem, improving the state-of-the-art performance in this task. Additionally, using images from the Sentinel-2 (S2) satellite missions we compile the largest, to our knowledge, SITS dataset densely annotated by crop type and parcel identities, which we make publicly available together with the data generation pipeline. Using that data we find CSCL, even with minimal pre-training, to improve all respective baselines and present a process for semantic segmentation at super-resolution for obtaining crop classes at a more granular level. The code and instructions to download the data can be found in https://github.com/michaeltrs/DeepSatModels.
format Preprint
id arxiv_https___arxiv_org_abs_2104_04310
institution arXiv
publishDate 2021
record_format arxiv
spellingShingle Context-self contrastive pretraining for crop type semantic segmentation
Tarasiou, Michail
Guler, Riza Alp
Zafeiriou, Stefanos
Computer Vision and Pattern Recognition
Machine Learning
In this paper, we propose a fully supervised pre-training scheme based on contrastive learning particularly tailored to dense classification tasks. The proposed Context-Self Contrastive Loss (CSCL) learns an embedding space that makes semantic boundaries pop-up by use of a similarity metric between every location in a training sample and its local context. For crop type semantic segmentation from Satellite Image Time Series (SITS) we find performance at parcel boundaries to be a critical bottleneck and explain how CSCL tackles the underlying cause of that problem, improving the state-of-the-art performance in this task. Additionally, using images from the Sentinel-2 (S2) satellite missions we compile the largest, to our knowledge, SITS dataset densely annotated by crop type and parcel identities, which we make publicly available together with the data generation pipeline. Using that data we find CSCL, even with minimal pre-training, to improve all respective baselines and present a process for semantic segmentation at super-resolution for obtaining crop classes at a more granular level. The code and instructions to download the data can be found in https://github.com/michaeltrs/DeepSatModels.
title Context-self contrastive pretraining for crop type semantic segmentation
topic Computer Vision and Pattern Recognition
Machine Learning
url https://arxiv.org/abs/2104.04310