Saved in:
Bibliographic Details
Main Authors: Dubbeldam, Elias, Mohammadi, Reza, Schoonhoven, Marit, Birbil, S. Ilker
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2510.04543
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866914379151179776
author Dubbeldam, Elias
Mohammadi, Reza
Schoonhoven, Marit
Birbil, S. Ilker
author_facet Dubbeldam, Elias
Mohammadi, Reza
Schoonhoven, Marit
Birbil, S. Ilker
contents Accurate predictions on tabular data rely on capturing complex, dataset-specific feature interactions. Attention-based methods and graph neural networks, referred to as graph-based tabular deep learning (GTDL), aim to improve predictions by modeling these interactions as a graph. In this work, we analyze how these methods model the feature interactions. Current GTDL approaches primarily focus on optimizing predictive accuracy, often neglecting the accurate modeling of the underlying graph structure. Using synthetic datasets with known ground-truth graph structures, we find that current GTDL methods fail to recover meaningful feature interactions, as their edge recovery is close to random. This suggests that the attention mechanism and message-passing schemes used in GTDL do not effectively capture feature interactions. Furthermore, when we impose the true interaction structure, we find that the predictive accuracy improves. This highlights the need for GTDL methods to prioritize accurate modeling of the graph structure, as it leads to better predictions.
format Preprint
id arxiv_https___arxiv_org_abs_2510_04543
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle The Role of Feature Interactions in Graph-based Tabular Deep Learning
Dubbeldam, Elias
Mohammadi, Reza
Schoonhoven, Marit
Birbil, S. Ilker
Machine Learning
Accurate predictions on tabular data rely on capturing complex, dataset-specific feature interactions. Attention-based methods and graph neural networks, referred to as graph-based tabular deep learning (GTDL), aim to improve predictions by modeling these interactions as a graph. In this work, we analyze how these methods model the feature interactions. Current GTDL approaches primarily focus on optimizing predictive accuracy, often neglecting the accurate modeling of the underlying graph structure. Using synthetic datasets with known ground-truth graph structures, we find that current GTDL methods fail to recover meaningful feature interactions, as their edge recovery is close to random. This suggests that the attention mechanism and message-passing schemes used in GTDL do not effectively capture feature interactions. Furthermore, when we impose the true interaction structure, we find that the predictive accuracy improves. This highlights the need for GTDL methods to prioritize accurate modeling of the graph structure, as it leads to better predictions.
title The Role of Feature Interactions in Graph-based Tabular Deep Learning
topic Machine Learning
url https://arxiv.org/abs/2510.04543