Saved in:
Bibliographic Details
Main Authors: Ahmadi, Saba, Bhandari, Siddharth, Blum, Avrim, Dan, Chen, Jain, Prabhav
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2406.03458
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866918064790962176
author Ahmadi, Saba
Bhandari, Siddharth
Blum, Avrim
Dan, Chen
Jain, Prabhav
author_facet Ahmadi, Saba
Bhandari, Siddharth
Blum, Avrim
Dan, Chen
Jain, Prabhav
contents We initiate the study of a new notion of adversarial loss which we call distributional adversarial loss. In this notion, we assume for each original example, the allowed adversarial perturbation set is a family of distributions, and the adversarial loss over each example is the maximum loss over all the associated distributions. The goal is to minimize the overall adversarial loss. We show sample complexity bounds in the PAC-learning setting for our notion of adversarial loss. Our notion of adversarial loss contrasts the prior work on robust learning that considers a set of points, not distributions, as the perturbation set of each clean example. As an application of our approach, we show how to unify the two lines of work on randomized smoothing and robust learning in the PAC-learning setting and derive sample complexity bounds for randomized smoothing methods. Furthermore, we investigate the role of randomness in achieving robustness against adversarial attacks. We show a general derandomization technique that preserves the extent of a randomized classifier's robustness against adversarial attacks and show its effectiveness empirically.
format Preprint
id arxiv_https___arxiv_org_abs_2406_03458
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Distributional Adversarial Loss
Ahmadi, Saba
Bhandari, Siddharth
Blum, Avrim
Dan, Chen
Jain, Prabhav
Machine Learning
We initiate the study of a new notion of adversarial loss which we call distributional adversarial loss. In this notion, we assume for each original example, the allowed adversarial perturbation set is a family of distributions, and the adversarial loss over each example is the maximum loss over all the associated distributions. The goal is to minimize the overall adversarial loss. We show sample complexity bounds in the PAC-learning setting for our notion of adversarial loss. Our notion of adversarial loss contrasts the prior work on robust learning that considers a set of points, not distributions, as the perturbation set of each clean example. As an application of our approach, we show how to unify the two lines of work on randomized smoothing and robust learning in the PAC-learning setting and derive sample complexity bounds for randomized smoothing methods. Furthermore, we investigate the role of randomness in achieving robustness against adversarial attacks. We show a general derandomization technique that preserves the extent of a randomized classifier's robustness against adversarial attacks and show its effectiveness empirically.
title Distributional Adversarial Loss
topic Machine Learning
url https://arxiv.org/abs/2406.03458