Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Ahmadi, Saba, Bhandari, Siddharth, Blum, Avrim, Dan, Chen, Jain, Prabhav
Format:	Preprint
Published:	2024
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2406.03458
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866918064790962176
author	Ahmadi, Saba Bhandari, Siddharth Blum, Avrim Dan, Chen Jain, Prabhav
author_facet	Ahmadi, Saba Bhandari, Siddharth Blum, Avrim Dan, Chen Jain, Prabhav
contents	We initiate the study of a new notion of adversarial loss which we call distributional adversarial loss. In this notion, we assume for each original example, the allowed adversarial perturbation set is a family of distributions, and the adversarial loss over each example is the maximum loss over all the associated distributions. The goal is to minimize the overall adversarial loss. We show sample complexity bounds in the PAC-learning setting for our notion of adversarial loss. Our notion of adversarial loss contrasts the prior work on robust learning that considers a set of points, not distributions, as the perturbation set of each clean example. As an application of our approach, we show how to unify the two lines of work on randomized smoothing and robust learning in the PAC-learning setting and derive sample complexity bounds for randomized smoothing methods. Furthermore, we investigate the role of randomness in achieving robustness against adversarial attacks. We show a general derandomization technique that preserves the extent of a randomized classifier's robustness against adversarial attacks and show its effectiveness empirically.
format	Preprint
id	arxiv_https___arxiv_org_abs_2406_03458
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Distributional Adversarial Loss Ahmadi, Saba Bhandari, Siddharth Blum, Avrim Dan, Chen Jain, Prabhav Machine Learning We initiate the study of a new notion of adversarial loss which we call distributional adversarial loss. In this notion, we assume for each original example, the allowed adversarial perturbation set is a family of distributions, and the adversarial loss over each example is the maximum loss over all the associated distributions. The goal is to minimize the overall adversarial loss. We show sample complexity bounds in the PAC-learning setting for our notion of adversarial loss. Our notion of adversarial loss contrasts the prior work on robust learning that considers a set of points, not distributions, as the perturbation set of each clean example. As an application of our approach, we show how to unify the two lines of work on randomized smoothing and robust learning in the PAC-learning setting and derive sample complexity bounds for randomized smoothing methods. Furthermore, we investigate the role of randomness in achieving robustness against adversarial attacks. We show a general derandomization technique that preserves the extent of a randomized classifier's robustness against adversarial attacks and show its effectiveness empirically.
title	Distributional Adversarial Loss
topic	Machine Learning
url	https://arxiv.org/abs/2406.03458

Similar Items