Saved in:
Bibliographic Details
Main Authors: Vernimmen, Pierre, Glineur, François
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2506.09730
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866916817741545472
author Vernimmen, Pierre
Glineur, François
author_facet Vernimmen, Pierre
Glineur, François
contents This work assesses both empirically and theoretically, using the performance estimation methodology, how robust different first-order optimization methods are when subject to relative inexactness in their gradient computations. Relative inexactness occurs, for example, when compressing the gradient using fewer bits of information, which happens when dealing with large-scale problems on GPUs. Three major families of methods are analyzed: constant step gradient descent, long-step methods, and accelerated methods. The latter two are first shown to be theoretically not robust to inexactness. Then, a semi-heuristic shortening factor is introduced to improve their theoretical guarantees. All methods are subsequently tested on a concrete inexact problem, with two different types of relative inexactness, and it is observed that both accelerated methods are much more robust than expected, and that the shortening factor significantly helps the long-step methods. In the end, all shortened methods appear to be promising, even in this inexact setting.
format Preprint
id arxiv_https___arxiv_org_abs_2506_09730
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Empirical and computer-aided robustness analysis of long-step and accelerated methods in smooth convex optimization
Vernimmen, Pierre
Glineur, François
Optimization and Control
Machine Learning
This work assesses both empirically and theoretically, using the performance estimation methodology, how robust different first-order optimization methods are when subject to relative inexactness in their gradient computations. Relative inexactness occurs, for example, when compressing the gradient using fewer bits of information, which happens when dealing with large-scale problems on GPUs. Three major families of methods are analyzed: constant step gradient descent, long-step methods, and accelerated methods. The latter two are first shown to be theoretically not robust to inexactness. Then, a semi-heuristic shortening factor is introduced to improve their theoretical guarantees. All methods are subsequently tested on a concrete inexact problem, with two different types of relative inexactness, and it is observed that both accelerated methods are much more robust than expected, and that the shortening factor significantly helps the long-step methods. In the end, all shortened methods appear to be promising, even in this inexact setting.
title Empirical and computer-aided robustness analysis of long-step and accelerated methods in smooth convex optimization
topic Optimization and Control
Machine Learning
url https://arxiv.org/abs/2506.09730