Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Mu, Wenchuan, Lim, Kwan Hui
Format:	Preprint
Published:	2024
Subjects:	Software Engineering Artificial Intelligence
Online Access:	https://arxiv.org/abs/2404.16457
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866910423055335424
author	Mu, Wenchuan Lim, Kwan Hui
author_facet	Mu, Wenchuan Lim, Kwan Hui
contents	In deep learning applications, robustness measures the ability of neural models that handle slight changes in input data, which could lead to potential safety hazards, especially in safety-critical applications. Pre-deployment assessment of model robustness is essential, but existing methods often suffer from either high costs or imprecise results. To enhance safety in real-world scenarios, metrics that effectively capture the model's robustness are needed. To address this issue, we compare the rigour and usage conditions of various assessment methods based on different definitions. Then, we propose a straightforward and practical metric utilizing hypothesis testing for probabilistic robustness and have integrated it into the TorchAttacks library. Through a comparative analysis of diverse robustness assessment methods, our approach contributes to a deeper understanding of model robustness in safety-critical applications.
format	Preprint
id	arxiv_https___arxiv_org_abs_2404_16457
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Towards Precise Observations of Neural Model Robustness in Classification Mu, Wenchuan Lim, Kwan Hui Software Engineering Artificial Intelligence In deep learning applications, robustness measures the ability of neural models that handle slight changes in input data, which could lead to potential safety hazards, especially in safety-critical applications. Pre-deployment assessment of model robustness is essential, but existing methods often suffer from either high costs or imprecise results. To enhance safety in real-world scenarios, metrics that effectively capture the model's robustness are needed. To address this issue, we compare the rigour and usage conditions of various assessment methods based on different definitions. Then, we propose a straightforward and practical metric utilizing hypothesis testing for probabilistic robustness and have integrated it into the TorchAttacks library. Through a comparative analysis of diverse robustness assessment methods, our approach contributes to a deeper understanding of model robustness in safety-critical applications.
title	Towards Precise Observations of Neural Model Robustness in Classification
topic	Software Engineering Artificial Intelligence
url	https://arxiv.org/abs/2404.16457

Similar Items