Saved in:
Bibliographic Details
Main Authors: Gerlach, Benedict, Anastacio, Marie, Hoos, Holger H.
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2507.10048
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866912481362837504
author Gerlach, Benedict
Anastacio, Marie
Hoos, Holger H.
author_facet Gerlach, Benedict
Anastacio, Marie
Hoos, Holger H.
contents As machine learning gets adopted into the industry quickly, trustworthiness is increasingly in focus. Yet, efficiency and sustainability of robust training pipelines still have to be established. In this work, we consider a simple pipeline for training adversarially robust decision trees and investigate the efficiency of each step. Our pipeline consists of three stages. Firstly, we choose the perturbation size automatically for each dataset. For that, we introduce a simple algorithm, instead of relying on intuition or prior work. Moreover, we show that the perturbation size can be estimated from smaller models than the one intended for full training, and thus significant gains in efficiency can be achieved. Secondly, we train state-of-the-art adversarial training methods and evaluate them regarding both their training time and adversarial accuracy. Thirdly, we certify the robustness of each of the models thus obtained and investigate the time required for this. We find that verification time, which is critical to the efficiency of the full pipeline, is not correlated with training time.
format Preprint
id arxiv_https___arxiv_org_abs_2507_10048
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle On the Efficiency of Training Robust Decision Trees
Gerlach, Benedict
Anastacio, Marie
Hoos, Holger H.
Machine Learning
As machine learning gets adopted into the industry quickly, trustworthiness is increasingly in focus. Yet, efficiency and sustainability of robust training pipelines still have to be established. In this work, we consider a simple pipeline for training adversarially robust decision trees and investigate the efficiency of each step. Our pipeline consists of three stages. Firstly, we choose the perturbation size automatically for each dataset. For that, we introduce a simple algorithm, instead of relying on intuition or prior work. Moreover, we show that the perturbation size can be estimated from smaller models than the one intended for full training, and thus significant gains in efficiency can be achieved. Secondly, we train state-of-the-art adversarial training methods and evaluate them regarding both their training time and adversarial accuracy. Thirdly, we certify the robustness of each of the models thus obtained and investigate the time required for this. We find that verification time, which is critical to the efficiency of the full pipeline, is not correlated with training time.
title On the Efficiency of Training Robust Decision Trees
topic Machine Learning
url https://arxiv.org/abs/2507.10048