Saved in:
Bibliographic Details
Main Authors: Sadlek, Lukáš, Husák, Martin, Čeleda, Pavel
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2407.03019
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866916310542188544
author Sadlek, Lukáš
Husák, Martin
Čeleda, Pavel
author_facet Sadlek, Lukáš
Husák, Martin
Čeleda, Pavel
contents Devices in computer networks cannot work without essential network services provided by a limited count of devices. Identification of device dependencies determines whether a pair of IP addresses is a dependency, i.e., the host with the first IP address is dependent on the second one. These dependencies cannot be identified manually in large and dynamically changing networks. Nevertheless, they are important due to possible unexpected failures, performance issues, and cascading effects. We address the identification of dependencies using a new approach based on graph-based machine learning. The approach belongs to link prediction based on a latent representation of the computer network's communication graph. It samples random walks over IP addresses that fulfill time conditions imposed on network dependencies. The constrained random walks are used by a neural network to construct IP address embedding, which is a space that contains IP addresses that often appear close together in the same communication chain (i.e., random walk). Dependency embedding is constructed by combining values for IP addresses from their embedding and used for training the resulting dependency classifier. We evaluated the approach using IP flow datasets from a controlled environment and university campus network that contain evidence about dependencies. Evaluation concerning the correctness and relationship to other approaches shows that the approach achieves acceptable performance. It can simultaneously consider all types of dependencies and is applicable for batch processing in operational conditions.
format Preprint
id arxiv_https___arxiv_org_abs_2407_03019
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Identification of Device Dependencies Using Link Prediction
Sadlek, Lukáš
Husák, Martin
Čeleda, Pavel
Cryptography and Security
Devices in computer networks cannot work without essential network services provided by a limited count of devices. Identification of device dependencies determines whether a pair of IP addresses is a dependency, i.e., the host with the first IP address is dependent on the second one. These dependencies cannot be identified manually in large and dynamically changing networks. Nevertheless, they are important due to possible unexpected failures, performance issues, and cascading effects. We address the identification of dependencies using a new approach based on graph-based machine learning. The approach belongs to link prediction based on a latent representation of the computer network's communication graph. It samples random walks over IP addresses that fulfill time conditions imposed on network dependencies. The constrained random walks are used by a neural network to construct IP address embedding, which is a space that contains IP addresses that often appear close together in the same communication chain (i.e., random walk). Dependency embedding is constructed by combining values for IP addresses from their embedding and used for training the resulting dependency classifier. We evaluated the approach using IP flow datasets from a controlled environment and university campus network that contain evidence about dependencies. Evaluation concerning the correctness and relationship to other approaches shows that the approach achieves acceptable performance. It can simultaneously consider all types of dependencies and is applicable for batch processing in operational conditions.
title Identification of Device Dependencies Using Link Prediction
topic Cryptography and Security
url https://arxiv.org/abs/2407.03019