Saved in:
Bibliographic Details
Main Authors: Mu, Wenchuan, Lim, Kwan Hui
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2404.16457
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866910423055335424
author Mu, Wenchuan
Lim, Kwan Hui
author_facet Mu, Wenchuan
Lim, Kwan Hui
contents In deep learning applications, robustness measures the ability of neural models that handle slight changes in input data, which could lead to potential safety hazards, especially in safety-critical applications. Pre-deployment assessment of model robustness is essential, but existing methods often suffer from either high costs or imprecise results. To enhance safety in real-world scenarios, metrics that effectively capture the model's robustness are needed. To address this issue, we compare the rigour and usage conditions of various assessment methods based on different definitions. Then, we propose a straightforward and practical metric utilizing hypothesis testing for probabilistic robustness and have integrated it into the TorchAttacks library. Through a comparative analysis of diverse robustness assessment methods, our approach contributes to a deeper understanding of model robustness in safety-critical applications.
format Preprint
id arxiv_https___arxiv_org_abs_2404_16457
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Towards Precise Observations of Neural Model Robustness in Classification
Mu, Wenchuan
Lim, Kwan Hui
Software Engineering
Artificial Intelligence
In deep learning applications, robustness measures the ability of neural models that handle slight changes in input data, which could lead to potential safety hazards, especially in safety-critical applications. Pre-deployment assessment of model robustness is essential, but existing methods often suffer from either high costs or imprecise results. To enhance safety in real-world scenarios, metrics that effectively capture the model's robustness are needed. To address this issue, we compare the rigour and usage conditions of various assessment methods based on different definitions. Then, we propose a straightforward and practical metric utilizing hypothesis testing for probabilistic robustness and have integrated it into the TorchAttacks library. Through a comparative analysis of diverse robustness assessment methods, our approach contributes to a deeper understanding of model robustness in safety-critical applications.
title Towards Precise Observations of Neural Model Robustness in Classification
topic Software Engineering
Artificial Intelligence
url https://arxiv.org/abs/2404.16457