Saved in:
Bibliographic Details
Main Authors: D'Inverno, Giuseppe Alessio, Bianchini, Monica, Scarselli, Franco
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2401.12362
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866911822923169792
author D'Inverno, Giuseppe Alessio
Bianchini, Monica
Scarselli, Franco
author_facet D'Inverno, Giuseppe Alessio
Bianchini, Monica
Scarselli, Franco
contents Graph Neural Networks (GNNs) have emerged in recent years as a powerful tool to learn tasks across a wide range of graph domains in a data-driven fashion; based on a message passing mechanism, GNNs have gained increasing popularity due to their intuitive formulation, closely linked with the Weisfeiler-Lehman (WL) test for graph isomorphism, to which they have proven equivalent. From a theoretical point of view, GNNs have been shown to be universal approximators, and their generalization capability (namely, bounds on the Vapnik Chervonekis (VC) dimension) has recently been investigated for GNNs with piecewise polynomial activation functions. The aim of our work is to extend this analysis on the VC dimension of GNNs to other commonly used activation functions, such as sigmoid and hyperbolic tangent, using the framework of Pfaffian function theory. Bounds are provided with respect to architecture parameters (depth, number of neurons, input size) as well as with respect to the number of colors resulting from the 1-WL test applied on the graph domain. The theoretical analysis is supported by a preliminary experimental study.
format Preprint
id arxiv_https___arxiv_org_abs_2401_12362
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle VC dimension of Graph Neural Networks with Pfaffian activation functions
D'Inverno, Giuseppe Alessio
Bianchini, Monica
Scarselli, Franco
Machine Learning
Graph Neural Networks (GNNs) have emerged in recent years as a powerful tool to learn tasks across a wide range of graph domains in a data-driven fashion; based on a message passing mechanism, GNNs have gained increasing popularity due to their intuitive formulation, closely linked with the Weisfeiler-Lehman (WL) test for graph isomorphism, to which they have proven equivalent. From a theoretical point of view, GNNs have been shown to be universal approximators, and their generalization capability (namely, bounds on the Vapnik Chervonekis (VC) dimension) has recently been investigated for GNNs with piecewise polynomial activation functions. The aim of our work is to extend this analysis on the VC dimension of GNNs to other commonly used activation functions, such as sigmoid and hyperbolic tangent, using the framework of Pfaffian function theory. Bounds are provided with respect to architecture parameters (depth, number of neurons, input size) as well as with respect to the number of colors resulting from the 1-WL test applied on the graph domain. The theoretical analysis is supported by a preliminary experimental study.
title VC dimension of Graph Neural Networks with Pfaffian activation functions
topic Machine Learning
url https://arxiv.org/abs/2401.12362