Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Vernimmen, Pierre, Glineur, François
Format:	Preprint
Published:	2025
Subjects:	Optimization and Control Machine Learning
Online Access:	https://arxiv.org/abs/2506.09730
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866916817741545472
author	Vernimmen, Pierre Glineur, François
author_facet	Vernimmen, Pierre Glineur, François
contents	This work assesses both empirically and theoretically, using the performance estimation methodology, how robust different first-order optimization methods are when subject to relative inexactness in their gradient computations. Relative inexactness occurs, for example, when compressing the gradient using fewer bits of information, which happens when dealing with large-scale problems on GPUs. Three major families of methods are analyzed: constant step gradient descent, long-step methods, and accelerated methods. The latter two are first shown to be theoretically not robust to inexactness. Then, a semi-heuristic shortening factor is introduced to improve their theoretical guarantees. All methods are subsequently tested on a concrete inexact problem, with two different types of relative inexactness, and it is observed that both accelerated methods are much more robust than expected, and that the shortening factor significantly helps the long-step methods. In the end, all shortened methods appear to be promising, even in this inexact setting.
format	Preprint
id	arxiv_https___arxiv_org_abs_2506_09730
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Empirical and computer-aided robustness analysis of long-step and accelerated methods in smooth convex optimization Vernimmen, Pierre Glineur, François Optimization and Control Machine Learning This work assesses both empirically and theoretically, using the performance estimation methodology, how robust different first-order optimization methods are when subject to relative inexactness in their gradient computations. Relative inexactness occurs, for example, when compressing the gradient using fewer bits of information, which happens when dealing with large-scale problems on GPUs. Three major families of methods are analyzed: constant step gradient descent, long-step methods, and accelerated methods. The latter two are first shown to be theoretically not robust to inexactness. Then, a semi-heuristic shortening factor is introduced to improve their theoretical guarantees. All methods are subsequently tested on a concrete inexact problem, with two different types of relative inexactness, and it is observed that both accelerated methods are much more robust than expected, and that the shortening factor significantly helps the long-step methods. In the end, all shortened methods appear to be promising, even in this inexact setting.
title	Empirical and computer-aided robustness analysis of long-step and accelerated methods in smooth convex optimization
topic	Optimization and Control Machine Learning
url	https://arxiv.org/abs/2506.09730

Similar Items