Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Williamson, Måns, Eisenmann, Monika, Stillfjord, Tony
Format:	Preprint
Published:	2024
Subjects:	Optimization and Control
Online Access:	https://arxiv.org/abs/2406.16640
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866916298878877696
author	Williamson, Måns Eisenmann, Monika Stillfjord, Tony
author_facet	Williamson, Måns Eisenmann, Monika Stillfjord, Tony
contents	Choosing the optimization algorithm that performs best on a given machine learning problem is often delicate, and there is no guarantee that current state-of-the-art algorithms will perform well across all tasks. Consequently, the more reliable methods that one has at hand, the larger the likelihood of a good end result. To this end, we introduce and analyze a large class of stochastic so-called soft-clipping schemes with a broad range of applications. Despite the wide adoption of clipping techniques in practice, soft-clipping methods have not been analyzed to a large extent in the literature. In particular, a rigorous mathematical analysis is lacking in the general, nonlinear case. Our analysis lays a theoretical foundation for a large class of such schemes, and motivates their usage. In particular, under standard assumptions such as Lipschitz continuous gradients of the objective function, we give rigorous proofs of convergence in expectation. These include rates in both the convex and the non-convex case, as well as almost sure convergence to a stationary point in the non-convex case. The computational cost of the analyzed schemes is essentially the same as that of stochastic gradient descent.
format	Preprint
id	arxiv_https___arxiv_org_abs_2406_16640
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Analysis of a Class of Stochastic Component-Wise Soft-Clipping Schemes Williamson, Måns Eisenmann, Monika Stillfjord, Tony Optimization and Control Choosing the optimization algorithm that performs best on a given machine learning problem is often delicate, and there is no guarantee that current state-of-the-art algorithms will perform well across all tasks. Consequently, the more reliable methods that one has at hand, the larger the likelihood of a good end result. To this end, we introduce and analyze a large class of stochastic so-called soft-clipping schemes with a broad range of applications. Despite the wide adoption of clipping techniques in practice, soft-clipping methods have not been analyzed to a large extent in the literature. In particular, a rigorous mathematical analysis is lacking in the general, nonlinear case. Our analysis lays a theoretical foundation for a large class of such schemes, and motivates their usage. In particular, under standard assumptions such as Lipschitz continuous gradients of the objective function, we give rigorous proofs of convergence in expectation. These include rates in both the convex and the non-convex case, as well as almost sure convergence to a stationary point in the non-convex case. The computational cost of the analyzed schemes is essentially the same as that of stochastic gradient descent.
title	Analysis of a Class of Stochastic Component-Wise Soft-Clipping Schemes
topic	Optimization and Control
url	https://arxiv.org/abs/2406.16640

Similar Items