Saved in:
Bibliographic Details
Main Authors: Sulehman, Yusuf, Mu, Tingting
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2403.18613
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866929451017699328
author Sulehman, Yusuf
Mu, Tingting
author_facet Sulehman, Yusuf
Mu, Tingting
contents Estimating the Lipschitz constant of deep neural networks is of growing interest as it is useful for informing on generalisability and adversarial robustness. Convolutional neural networks (CNNs) in particular, underpin much of the recent success in computer vision related applications. However, although existing methods for estimating the Lipschitz constant can be tight, they have limited scalability when applied to CNNs. To tackle this, we propose a novel method to accelerate Lipschitz constant estimation for CNNs. The core idea is to divide a large convolutional block via a joint layer and width-wise partition, into a collection of smaller blocks. We prove an upper-bound on the Lipschitz constant of the larger block in terms of the Lipschitz constants of the smaller blocks. Through varying the partition factor, the resulting method can be adjusted to prioritise either accuracy or scalability and permits parallelisation. We demonstrate an enhanced scalability and comparable accuracy to existing baselines through a range of experiments.
format Preprint
id arxiv_https___arxiv_org_abs_2403_18613
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Scalable Lipschitz Estimation for CNNs
Sulehman, Yusuf
Mu, Tingting
Machine Learning
Estimating the Lipschitz constant of deep neural networks is of growing interest as it is useful for informing on generalisability and adversarial robustness. Convolutional neural networks (CNNs) in particular, underpin much of the recent success in computer vision related applications. However, although existing methods for estimating the Lipschitz constant can be tight, they have limited scalability when applied to CNNs. To tackle this, we propose a novel method to accelerate Lipschitz constant estimation for CNNs. The core idea is to divide a large convolutional block via a joint layer and width-wise partition, into a collection of smaller blocks. We prove an upper-bound on the Lipschitz constant of the larger block in terms of the Lipschitz constants of the smaller blocks. Through varying the partition factor, the resulting method can be adjusted to prioritise either accuracy or scalability and permits parallelisation. We demonstrate an enhanced scalability and comparable accuracy to existing baselines through a range of experiments.
title Scalable Lipschitz Estimation for CNNs
topic Machine Learning
url https://arxiv.org/abs/2403.18613