Saved in:
Bibliographic Details
Main Authors: Shen, Jiajun, Jin, Yufei, He, Yi, Zhu, Xingquan
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2510.03432
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866912626855903232
author Shen, Jiajun
Jin, Yufei
He, Yi
Zhu, Xingquan
author_facet Shen, Jiajun
Jin, Yufei
He, Yi
Zhu, Xingquan
contents Learning from large heterogeneous graphs presents significant challenges due to the scale of networks, heterogeneity in node and edge types, variations in nodal features, and complex local neighborhood structures. This paper advocates for ensemble learning as a natural solution to this problem, whereby training multiple graph learners under distinct sampling conditions, the ensemble inherently captures different aspects of graph heterogeneity. Yet, the crux lies in combining these learners to meet global optimization objective while maintaining computational efficiency on large-scale graphs. In response, we propose LHGEL, an ensemble framework that addresses these challenges through batch sampling with three key components, namely batch view aggregation, residual attention, and diversity regularization. Specifically, batch view aggregation samples subgraphs and forms multiple graph views, while residual attention adaptively weights the contributions of these views to guide node embeddings toward informative subgraphs, thereby improving the accuracy of base learners. Diversity regularization encourages representational disparity across embedding matrices derived from different views, promoting model diversity and ensemble robustness. Our theoretical study demonstrates that residual attention mitigates gradient vanishing issues commonly faced in ensemble learning. Empirical results on five real heterogeneous networks validate that our LHGEL approach consistently outperforms its state-of-the-art competitors by substantial margin. Codes and datasets are available at https://github.com/Chrisshen12/LHGEL.
format Preprint
id arxiv_https___arxiv_org_abs_2510_03432
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle LHGEL: Large Heterogeneous Graph Ensemble Learning using Batch View Aggregation
Shen, Jiajun
Jin, Yufei
He, Yi
Zhu, Xingquan
Machine Learning
Learning from large heterogeneous graphs presents significant challenges due to the scale of networks, heterogeneity in node and edge types, variations in nodal features, and complex local neighborhood structures. This paper advocates for ensemble learning as a natural solution to this problem, whereby training multiple graph learners under distinct sampling conditions, the ensemble inherently captures different aspects of graph heterogeneity. Yet, the crux lies in combining these learners to meet global optimization objective while maintaining computational efficiency on large-scale graphs. In response, we propose LHGEL, an ensemble framework that addresses these challenges through batch sampling with three key components, namely batch view aggregation, residual attention, and diversity regularization. Specifically, batch view aggregation samples subgraphs and forms multiple graph views, while residual attention adaptively weights the contributions of these views to guide node embeddings toward informative subgraphs, thereby improving the accuracy of base learners. Diversity regularization encourages representational disparity across embedding matrices derived from different views, promoting model diversity and ensemble robustness. Our theoretical study demonstrates that residual attention mitigates gradient vanishing issues commonly faced in ensemble learning. Empirical results on five real heterogeneous networks validate that our LHGEL approach consistently outperforms its state-of-the-art competitors by substantial margin. Codes and datasets are available at https://github.com/Chrisshen12/LHGEL.
title LHGEL: Large Heterogeneous Graph Ensemble Learning using Batch View Aggregation
topic Machine Learning
url https://arxiv.org/abs/2510.03432