Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Xenos, Alexandros, Dognin, Noel-Malod, Przulj, Natasa
Format: Preprint
Veröffentlicht: 2024
Schlagworte:
Online-Zugang:https://arxiv.org/abs/2410.01865
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
_version_ 1866916420882792448
author Xenos, Alexandros
Dognin, Noel-Malod
Przulj, Natasa
author_facet Xenos, Alexandros
Dognin, Noel-Malod
Przulj, Natasa
contents Low-dimensional embeddings are a cornerstone in the modelling and analysis of complex networks. However, most existing approaches for mining network embedding spaces rely on computationally intensive machine learning systems to facilitate downstream tasks. In the field of NLP, word embedding spaces capture semantic relationships \textit{linearly}, allowing for information retrieval using \textit{simple linear operations} on word embedding vectors. Here, we demonstrate that there are structural properties of network data that yields this linearity. We show that the more homophilic the network representation, the more linearly separable the corresponding network embedding space, yielding better downstream analysis results. Hence, we introduce novel graphlet-based methods enabling embedding of networks into more linearly separable spaces, allowing for their better mining. Our fundamental insights into the structure of network data that enable their \textit{\textbf{linear}} mining and exploitation enable the ML community to build upon, towards efficiently and explainably mining of the complex network data.
format Preprint
id arxiv_https___arxiv_org_abs_2410_01865
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Simplifying complex machine learning by linearly separable network embedding spaces
Xenos, Alexandros
Dognin, Noel-Malod
Przulj, Natasa
Social and Information Networks
Artificial Intelligence
Machine Learning
I.2; J.3
Low-dimensional embeddings are a cornerstone in the modelling and analysis of complex networks. However, most existing approaches for mining network embedding spaces rely on computationally intensive machine learning systems to facilitate downstream tasks. In the field of NLP, word embedding spaces capture semantic relationships \textit{linearly}, allowing for information retrieval using \textit{simple linear operations} on word embedding vectors. Here, we demonstrate that there are structural properties of network data that yields this linearity. We show that the more homophilic the network representation, the more linearly separable the corresponding network embedding space, yielding better downstream analysis results. Hence, we introduce novel graphlet-based methods enabling embedding of networks into more linearly separable spaces, allowing for their better mining. Our fundamental insights into the structure of network data that enable their \textit{\textbf{linear}} mining and exploitation enable the ML community to build upon, towards efficiently and explainably mining of the complex network data.
title Simplifying complex machine learning by linearly separable network embedding spaces
topic Social and Information Networks
Artificial Intelligence
Machine Learning
I.2; J.3
url https://arxiv.org/abs/2410.01865