Saved in:
Bibliographic Details
Main Authors: Mathian, E., Liu, H., Fernandez-Cuesta, L., Samaras, D., Foll, M., Chen, L.
Format: Preprint
Published: 2022
Subjects:
Online Access:https://arxiv.org/abs/2208.03486
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866910901029830656
author Mathian, E.
Liu, H.
Fernandez-Cuesta, L.
Samaras, D.
Foll, M.
Chen, L.
author_facet Mathian, E.
Liu, H.
Fernandez-Cuesta, L.
Samaras, D.
Foll, M.
Chen, L.
contents Unsupervised anomaly detection and localization is a crucial task as it is impossible to collect and label all possible anomalies. Many studies have emphasized the importance of integrating local and global information to achieve accurate segmentation of anomalies. To this end, there has been a growing interest in Transformer, which allows modeling long-range content interactions. However, global interactions through self attention are generally too expensive for most image scales. In this study, we introduce HaloAE, the first auto-encoder based on a local 2D version of Transformer with HaloNet. With HaloAE, we have created a hybrid model that combines convolution and local 2D block-wise self-attention layers and jointly performs anomaly detection and segmentation through a single model. We achieved competitive results on the MVTec dataset, suggesting that vision models incorporating Transformer could benefit from a local computation of the self-attention operation, and pave the way for other applications.
format Preprint
id arxiv_https___arxiv_org_abs_2208_03486
institution arXiv
publishDate 2022
record_format arxiv
spellingShingle HaloAE: An HaloNet based Local Transformer Auto-Encoder for Anomaly Detection and Localization
Mathian, E.
Liu, H.
Fernandez-Cuesta, L.
Samaras, D.
Foll, M.
Chen, L.
Computer Vision and Pattern Recognition
Artificial Intelligence
Unsupervised anomaly detection and localization is a crucial task as it is impossible to collect and label all possible anomalies. Many studies have emphasized the importance of integrating local and global information to achieve accurate segmentation of anomalies. To this end, there has been a growing interest in Transformer, which allows modeling long-range content interactions. However, global interactions through self attention are generally too expensive for most image scales. In this study, we introduce HaloAE, the first auto-encoder based on a local 2D version of Transformer with HaloNet. With HaloAE, we have created a hybrid model that combines convolution and local 2D block-wise self-attention layers and jointly performs anomaly detection and segmentation through a single model. We achieved competitive results on the MVTec dataset, suggesting that vision models incorporating Transformer could benefit from a local computation of the self-attention operation, and pave the way for other applications.
title HaloAE: An HaloNet based Local Transformer Auto-Encoder for Anomaly Detection and Localization
topic Computer Vision and Pattern Recognition
Artificial Intelligence
url https://arxiv.org/abs/2208.03486