Affichage MARC: :: Library Catalog

Enregistré dans:

Détails bibliographiques
Auteurs principaux:	Katende, Ronald, Kasumba, Henry
Format:	Preprint
Publié:	2024
Sujets:	Optimization and Control
Accès en ligne:	https://arxiv.org/abs/2409.12604
Tags:	Ajouter un tag Pas de tags, Soyez le premier à ajouter un tag!

_version_	1866909767355596800
author	Katende, Ronald Kasumba, Henry
author_facet	Katende, Ronald Kasumba, Henry
contents	High-dimensional non-convex optimization problems in engineering design, control, and learning are often hindered by saddle points, flat plateaus, and strongly anisotropic curvature. This paper develops a unified, curvature-adaptive framework that combines stochastic perturbations, adaptive learning rates, and randomized subspace descent to enhance escape efficiency and scalability. We show theoretically that gradient flow almost surely avoids strict saddles, with escape probability increasing exponentially in dimension. For noise-perturbed gradient descent, we derive explicit escape-time bounds that depend on local curvature and noise magnitude. Adaptive step sizes further reduce escape times by responding to local gradient variability, while randomized subspace descent preserves descent directions in low-dimensional projections and ensures global convergence with logarithmic dependence on dimension. Numerical experiments on nonlinear and constrained benchmarks validate these results, demonstrating faster escape, improved robustness to ill-conditioning, and lower total runtime compared to standard first- and second-order methods. The proposed approach offers practical tools for large-scale engineering optimization tasks where curvature, noise, and dimensionality interplay critically.
format	Preprint
id	arxiv_https___arxiv_org_abs_2409_12604
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Curvature-Adaptive Perturbation and Subspace Descent for Robust Saddle Point Escape in High-Dimensional Optimization Katende, Ronald Kasumba, Henry Optimization and Control High-dimensional non-convex optimization problems in engineering design, control, and learning are often hindered by saddle points, flat plateaus, and strongly anisotropic curvature. This paper develops a unified, curvature-adaptive framework that combines stochastic perturbations, adaptive learning rates, and randomized subspace descent to enhance escape efficiency and scalability. We show theoretically that gradient flow almost surely avoids strict saddles, with escape probability increasing exponentially in dimension. For noise-perturbed gradient descent, we derive explicit escape-time bounds that depend on local curvature and noise magnitude. Adaptive step sizes further reduce escape times by responding to local gradient variability, while randomized subspace descent preserves descent directions in low-dimensional projections and ensures global convergence with logarithmic dependence on dimension. Numerical experiments on nonlinear and constrained benchmarks validate these results, demonstrating faster escape, improved robustness to ill-conditioning, and lower total runtime compared to standard first- and second-order methods. The proposed approach offers practical tools for large-scale engineering optimization tasks where curvature, noise, and dimensionality interplay critically.
title	Curvature-Adaptive Perturbation and Subspace Descent for Robust Saddle Point Escape in High-Dimensional Optimization
topic	Optimization and Control
url	https://arxiv.org/abs/2409.12604

Documents similaires