Saved in:
Bibliographic Details
Main Authors: Gao, Yuan, Wang, Xiang, He, Xiangnan, Liu, Zhenguang, Feng, Huamin, Zhang, Yongdong
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2401.14155
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866909082868252672
author Gao, Yuan
Wang, Xiang
He, Xiangnan
Liu, Zhenguang
Feng, Huamin
Zhang, Yongdong
author_facet Gao, Yuan
Wang, Xiang
He, Xiangnan
Liu, Zhenguang
Feng, Huamin
Zhang, Yongdong
contents Graph anomaly detection (GAD) is a challenging binary classification problem due to its different structural distribution between anomalies and normal nodes -- abnormal nodes are a minority, therefore holding high heterophily and low homophily compared to normal nodes. Furthermore, due to various time factors and the annotation preferences of human experts, the heterophily and homophily can change across training and testing data, which is called structural distribution shift (SDS) in this paper. The mainstream methods are built on graph neural networks (GNNs), benefiting the classification of normals from aggregating homophilous neighbors, yet ignoring the SDS issue for anomalies and suffering from poor generalization. This work solves the problem from a feature view. We observe that the degree of SDS varies between anomalies and normal nodes. Hence to address the issue, the key lies in resisting high heterophily for anomalies meanwhile benefiting the learning of normals from homophily. We tease out the anomaly features on which we constrain to mitigate the effect of heterophilous neighbors and make them invariant. We term our proposed framework as Graph Decomposition Network (GDN). Extensive experiments are conducted on two benchmark datasets, and the proposed framework achieves a remarkable performance boost in GAD, especially in an SDS environment where anomalies have largely different structural distribution across training and testing environments. Codes are open-sourced in https://github.com/blacksingular/wsdm_GDN.
format Preprint
id arxiv_https___arxiv_org_abs_2401_14155
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Alleviating Structural Distribution Shift in Graph Anomaly Detection
Gao, Yuan
Wang, Xiang
He, Xiangnan
Liu, Zhenguang
Feng, Huamin
Zhang, Yongdong
Machine Learning
Artificial Intelligence
Graph anomaly detection (GAD) is a challenging binary classification problem due to its different structural distribution between anomalies and normal nodes -- abnormal nodes are a minority, therefore holding high heterophily and low homophily compared to normal nodes. Furthermore, due to various time factors and the annotation preferences of human experts, the heterophily and homophily can change across training and testing data, which is called structural distribution shift (SDS) in this paper. The mainstream methods are built on graph neural networks (GNNs), benefiting the classification of normals from aggregating homophilous neighbors, yet ignoring the SDS issue for anomalies and suffering from poor generalization. This work solves the problem from a feature view. We observe that the degree of SDS varies between anomalies and normal nodes. Hence to address the issue, the key lies in resisting high heterophily for anomalies meanwhile benefiting the learning of normals from homophily. We tease out the anomaly features on which we constrain to mitigate the effect of heterophilous neighbors and make them invariant. We term our proposed framework as Graph Decomposition Network (GDN). Extensive experiments are conducted on two benchmark datasets, and the proposed framework achieves a remarkable performance boost in GAD, especially in an SDS environment where anomalies have largely different structural distribution across training and testing environments. Codes are open-sourced in https://github.com/blacksingular/wsdm_GDN.
title Alleviating Structural Distribution Shift in Graph Anomaly Detection
topic Machine Learning
Artificial Intelligence
url https://arxiv.org/abs/2401.14155