Saved in:
Bibliographic Details
Main Authors: Sun, Zhongbin, Li, Xiaolong, Li, Yiran, Ma, Yue
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2409.05378
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866913493996797952
author Sun, Zhongbin
Li, Xiaolong
Li, Yiran
Ma, Yue
author_facet Sun, Zhongbin
Li, Xiaolong
Li, Yiran
Ma, Yue
contents Unsupervised anomaly detection is a challenging computer vision task, in which 2D-based anomaly detection methods have been extensively studied. However, multimodal anomaly detection based on RGB images and 3D point clouds requires further investigation. The existing methods are mainly inspired by memory bank based methods commonly used in 2D-based anomaly detection, which may cost extra memory for storing mutimodal features. In present study, a novel memoryless method MDSS is proposed for multimodal anomaly detection, which employs a light-weighted student-teacher network and a signed distance function to learn from RGB images and 3D point clouds respectively, and complements the anomaly information from the two modalities. Specifically, a student-teacher network is trained with normal RGB images and masks generated from point clouds by a dynamic loss, and the anomaly score map could be obtained from the discrepancy between the output of student and teacher. Furthermore, the signed distance function learns from normal point clouds to predict the signed distances between points and surface, and the obtained signed distances are used to generate anomaly score map. Subsequently, the anomaly score maps are aligned to generate the final anomaly score map for detection. The experimental results indicate that MDSS is comparable but more stable than the SOTA memory bank based method Shape-guided, and furthermore performs better than other baseline methods.
format Preprint
id arxiv_https___arxiv_org_abs_2409_05378
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Memoryless Multimodal Anomaly Detection via Student-Teacher Network and Signed Distance Learning
Sun, Zhongbin
Li, Xiaolong
Li, Yiran
Ma, Yue
Computer Vision and Pattern Recognition
Unsupervised anomaly detection is a challenging computer vision task, in which 2D-based anomaly detection methods have been extensively studied. However, multimodal anomaly detection based on RGB images and 3D point clouds requires further investigation. The existing methods are mainly inspired by memory bank based methods commonly used in 2D-based anomaly detection, which may cost extra memory for storing mutimodal features. In present study, a novel memoryless method MDSS is proposed for multimodal anomaly detection, which employs a light-weighted student-teacher network and a signed distance function to learn from RGB images and 3D point clouds respectively, and complements the anomaly information from the two modalities. Specifically, a student-teacher network is trained with normal RGB images and masks generated from point clouds by a dynamic loss, and the anomaly score map could be obtained from the discrepancy between the output of student and teacher. Furthermore, the signed distance function learns from normal point clouds to predict the signed distances between points and surface, and the obtained signed distances are used to generate anomaly score map. Subsequently, the anomaly score maps are aligned to generate the final anomaly score map for detection. The experimental results indicate that MDSS is comparable but more stable than the SOTA memory bank based method Shape-guided, and furthermore performs better than other baseline methods.
title Memoryless Multimodal Anomaly Detection via Student-Teacher Network and Signed Distance Learning
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2409.05378