Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Prach, Bernd, Lampert, Christoph H.
Format:	Preprint
Published:	2023
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2311.06103
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866914820001890304
author	Prach, Bernd Lampert, Christoph H.
author_facet	Prach, Bernd Lampert, Christoph H.
contents	A crucial property for achieving secure, trustworthy and interpretable deep learning systems is their robustness: small changes to a system's inputs should not result in large changes to its outputs. Mathematically, this means one strives for networks with a small Lipschitz constant. Several recent works have focused on how to construct such Lipschitz networks, typically by imposing constraints on the weight matrices. In this work, we study an orthogonal aspect, namely the role of the activation function. We show that commonly used activation functions, such as MaxMin, as well as all piece-wise linear ones with two segments unnecessarily restrict the class of representable functions, even in the simplest one-dimensional setting. We furthermore introduce the new N-activation function that is provably more expressive than currently popular activation functions. We provide code at https://github.com/berndprach/NActivation.
format	Preprint
id	arxiv_https___arxiv_org_abs_2311_06103
institution	arXiv
publishDate	2023
record_format	arxiv
spellingShingle	1-Lipschitz Neural Networks are more expressive with N-Activations Prach, Bernd Lampert, Christoph H. Machine Learning A crucial property for achieving secure, trustworthy and interpretable deep learning systems is their robustness: small changes to a system's inputs should not result in large changes to its outputs. Mathematically, this means one strives for networks with a small Lipschitz constant. Several recent works have focused on how to construct such Lipschitz networks, typically by imposing constraints on the weight matrices. In this work, we study an orthogonal aspect, namely the role of the activation function. We show that commonly used activation functions, such as MaxMin, as well as all piece-wise linear ones with two segments unnecessarily restrict the class of representable functions, even in the simplest one-dimensional setting. We furthermore introduce the new N-activation function that is provably more expressive than currently popular activation functions. We provide code at https://github.com/berndprach/NActivation.
title	1-Lipschitz Neural Networks are more expressive with N-Activations
topic	Machine Learning
url	https://arxiv.org/abs/2311.06103

Similar Items