Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Sulehman, Yusuf, Mu, Tingting
Format:	Preprint
Published:	2024
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2403.18613
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866929451017699328
author	Sulehman, Yusuf Mu, Tingting
author_facet	Sulehman, Yusuf Mu, Tingting
contents	Estimating the Lipschitz constant of deep neural networks is of growing interest as it is useful for informing on generalisability and adversarial robustness. Convolutional neural networks (CNNs) in particular, underpin much of the recent success in computer vision related applications. However, although existing methods for estimating the Lipschitz constant can be tight, they have limited scalability when applied to CNNs. To tackle this, we propose a novel method to accelerate Lipschitz constant estimation for CNNs. The core idea is to divide a large convolutional block via a joint layer and width-wise partition, into a collection of smaller blocks. We prove an upper-bound on the Lipschitz constant of the larger block in terms of the Lipschitz constants of the smaller blocks. Through varying the partition factor, the resulting method can be adjusted to prioritise either accuracy or scalability and permits parallelisation. We demonstrate an enhanced scalability and comparable accuracy to existing baselines through a range of experiments.
format	Preprint
id	arxiv_https___arxiv_org_abs_2403_18613
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Scalable Lipschitz Estimation for CNNs Sulehman, Yusuf Mu, Tingting Machine Learning Estimating the Lipschitz constant of deep neural networks is of growing interest as it is useful for informing on generalisability and adversarial robustness. Convolutional neural networks (CNNs) in particular, underpin much of the recent success in computer vision related applications. However, although existing methods for estimating the Lipschitz constant can be tight, they have limited scalability when applied to CNNs. To tackle this, we propose a novel method to accelerate Lipschitz constant estimation for CNNs. The core idea is to divide a large convolutional block via a joint layer and width-wise partition, into a collection of smaller blocks. We prove an upper-bound on the Lipschitz constant of the larger block in terms of the Lipschitz constants of the smaller blocks. Through varying the partition factor, the resulting method can be adjusted to prioritise either accuracy or scalability and permits parallelisation. We demonstrate an enhanced scalability and comparable accuracy to existing baselines through a range of experiments.
title	Scalable Lipschitz Estimation for CNNs
topic	Machine Learning
url	https://arxiv.org/abs/2403.18613

Similar Items