Saved in:
Bibliographic Details
Main Authors: Singh, Arshdeep, Santra, Abhishek, Chakravarthy, Sharma
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2504.19328
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866910920485109760
author Singh, Arshdeep
Santra, Abhishek
Chakravarthy, Sharma
author_facet Singh, Arshdeep
Santra, Abhishek
Chakravarthy, Sharma
contents Graph mining analyzes real-world graphs to find core substructures (connected subgraphs) in applications modeled as graphs. Substructure discovery is a process that involves identifying meaningful patterns, structures, or components within a large data set. These substructures can be of various types, such as frequent patterns, motifs, or other relevant features within the data. To model complex data sets -- with multiple types of entities and relationships -- multilayer networks (or MLNs) have been shown to be more effective as compared to simple and attributed graphs. Analysis algorithms on MLNs using the decoupling approach have been shown to be both efficient and accurate. Hence, this paper focuses on substructure discovery in homogeneous multilayer networks (one type of MLN) using a novel decoupling-based approach. In this approach, each layer is processed independently, and then the results from two or more layers are composed to identify substructures in the entire MLN. The algorithm is designed and implemented, including the composition part, using one of the distributed processing frameworks (the Map/Reduce paradigm) to provide scalability. After establishing the correctness, we analyze the speedup and response time of the proposed algorithm and approach through extensive experimental analysis on large synthetic and real-world data sets with diverse graph characteristics.
format Preprint
id arxiv_https___arxiv_org_abs_2504_19328
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Scalable Substructure Discovery Algorithm For Homogeneous Multilayer Networks
Singh, Arshdeep
Santra, Abhishek
Chakravarthy, Sharma
Social and Information Networks
Databases
Information Retrieval
Graph mining analyzes real-world graphs to find core substructures (connected subgraphs) in applications modeled as graphs. Substructure discovery is a process that involves identifying meaningful patterns, structures, or components within a large data set. These substructures can be of various types, such as frequent patterns, motifs, or other relevant features within the data. To model complex data sets -- with multiple types of entities and relationships -- multilayer networks (or MLNs) have been shown to be more effective as compared to simple and attributed graphs. Analysis algorithms on MLNs using the decoupling approach have been shown to be both efficient and accurate. Hence, this paper focuses on substructure discovery in homogeneous multilayer networks (one type of MLN) using a novel decoupling-based approach. In this approach, each layer is processed independently, and then the results from two or more layers are composed to identify substructures in the entire MLN. The algorithm is designed and implemented, including the composition part, using one of the distributed processing frameworks (the Map/Reduce paradigm) to provide scalability. After establishing the correctness, we analyze the speedup and response time of the proposed algorithm and approach through extensive experimental analysis on large synthetic and real-world data sets with diverse graph characteristics.
title Scalable Substructure Discovery Algorithm For Homogeneous Multilayer Networks
topic Social and Information Networks
Databases
Information Retrieval
url https://arxiv.org/abs/2504.19328