Saved in:
Bibliographic Details
Main Authors: Li, Yunxiang, Chen, Meixu, Wang, Kai, Ma, Jun, Bovik, Alan C., Zhang, You
Format: Preprint
Published: 2023
Subjects:
Online Access:https://arxiv.org/abs/2305.15367
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866913664843382784
author Li, Yunxiang
Chen, Meixu
Wang, Kai
Ma, Jun
Bovik, Alan C.
Zhang, You
author_facet Li, Yunxiang
Chen, Meixu
Wang, Kai
Ma, Jun
Bovik, Alan C.
Zhang, You
contents Image translation has wide applications, such as style transfer and modality conversion, usually aiming to generate images having both high degrees of realism and faithfulness. These problems remain difficult, especially when it is important to preserve content structures. Traditional image-level similarity metrics are of limited use, since the content structures of an image are high-level, and not strongly governed by pixel-wise faithfulness to an original image. To fill this gap, we introduce SAMScore, a generic content structural similarity metric for evaluating the faithfulness of image translation models. SAMScore is based on the recent high-performance Segment Anything Model (SAM), which allows content similarity comparisons with standout accuracy. We applied SAMScore on 19 image translation tasks, and found that it is able to outperform all other competitive metrics on all tasks. We envision that SAMScore will prove to be a valuable tool that will help to drive the vibrant field of image translation, by allowing for more precise evaluations of new and evolving translation models. The code is available at https://github.com/Kent0n-Li/SAMScore.
format Preprint
id arxiv_https___arxiv_org_abs_2305_15367
institution arXiv
publishDate 2023
record_format arxiv
spellingShingle SAMScore: A Content Structural Similarity Metric for Image Translation Evaluation
Li, Yunxiang
Chen, Meixu
Wang, Kai
Ma, Jun
Bovik, Alan C.
Zhang, You
Computer Vision and Pattern Recognition
Artificial Intelligence
Image translation has wide applications, such as style transfer and modality conversion, usually aiming to generate images having both high degrees of realism and faithfulness. These problems remain difficult, especially when it is important to preserve content structures. Traditional image-level similarity metrics are of limited use, since the content structures of an image are high-level, and not strongly governed by pixel-wise faithfulness to an original image. To fill this gap, we introduce SAMScore, a generic content structural similarity metric for evaluating the faithfulness of image translation models. SAMScore is based on the recent high-performance Segment Anything Model (SAM), which allows content similarity comparisons with standout accuracy. We applied SAMScore on 19 image translation tasks, and found that it is able to outperform all other competitive metrics on all tasks. We envision that SAMScore will prove to be a valuable tool that will help to drive the vibrant field of image translation, by allowing for more precise evaluations of new and evolving translation models. The code is available at https://github.com/Kent0n-Li/SAMScore.
title SAMScore: A Content Structural Similarity Metric for Image Translation Evaluation
topic Computer Vision and Pattern Recognition
Artificial Intelligence
url https://arxiv.org/abs/2305.15367