Saved in:
Bibliographic Details
Main Authors: Zhong, Jiajun, Ye, Weiwei, Gui, Ning
Format: Preprint
Published: 2022
Subjects:
Online Access:https://arxiv.org/abs/2212.02810
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866909168207659008
author Zhong, Jiajun
Ye, Weiwei
Gui, Ning
author_facet Zhong, Jiajun
Ye, Weiwei
Gui, Ning
contents Effective data imputation demands rich latent ``structure" discovery capabilities from ``plain" tabular data. Recent advances in graph neural networks-based data imputation solutions show their strong structure learning potential by directly translating tabular data as bipartite graphs. However, due to a lack of relations between samples, those solutions treat all samples equally which is against one important observation: ``similar sample should give more information about missing values." This paper presents a novel Iterative graph Generation and Reconstruction framework for Missing data imputation(IGRM). Instead of treating all samples equally, we introduce the concept: ``friend networks" to represent different relations among samples. To generate an accurate friend network with missing data, an end-to-end friend network reconstruction solution is designed to allow for continuous friend network optimization during imputation learning. The representation of the optimized friend network, in turn, is used to further optimize the data imputation process with differentiated message passing. Experiment results on eight benchmark datasets show that IGRM yields 39.13% lower mean absolute error compared with nine baselines and 9.04% lower than the second-best. Our code is available at https://github.com/G-AILab/IGRM.
format Preprint
id arxiv_https___arxiv_org_abs_2212_02810
institution arXiv
publishDate 2022
record_format arxiv
spellingShingle Data Imputation with Iterative Graph Reconstruction
Zhong, Jiajun
Ye, Weiwei
Gui, Ning
Machine Learning
Effective data imputation demands rich latent ``structure" discovery capabilities from ``plain" tabular data. Recent advances in graph neural networks-based data imputation solutions show their strong structure learning potential by directly translating tabular data as bipartite graphs. However, due to a lack of relations between samples, those solutions treat all samples equally which is against one important observation: ``similar sample should give more information about missing values." This paper presents a novel Iterative graph Generation and Reconstruction framework for Missing data imputation(IGRM). Instead of treating all samples equally, we introduce the concept: ``friend networks" to represent different relations among samples. To generate an accurate friend network with missing data, an end-to-end friend network reconstruction solution is designed to allow for continuous friend network optimization during imputation learning. The representation of the optimized friend network, in turn, is used to further optimize the data imputation process with differentiated message passing. Experiment results on eight benchmark datasets show that IGRM yields 39.13% lower mean absolute error compared with nine baselines and 9.04% lower than the second-best. Our code is available at https://github.com/G-AILab/IGRM.
title Data Imputation with Iterative Graph Reconstruction
topic Machine Learning
url https://arxiv.org/abs/2212.02810