Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Zheng, Tianqi, Loizou, Nicolas, You, Pengcheng, Mallada, Enrique
Format:	Preprint
Published:	2024
Subjects:	Optimization and Control Machine Learning
Online Access:	https://arxiv.org/abs/2403.09090
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866929276195962880
author	Zheng, Tianqi Loizou, Nicolas You, Pengcheng Mallada, Enrique
author_facet	Zheng, Tianqi Loizou, Nicolas You, Pengcheng Mallada, Enrique
contents	Gradient Descent Ascent (GDA) methods for min-max optimization problems typically produce oscillatory behavior that can lead to instability, e.g., in bilinear settings. To address this problem, we introduce a dissipation term into the GDA updates to dampen these oscillations. The proposed Dissipative GDA (DGDA) method can be seen as performing standard GDA on a state-augmented and regularized saddle function that does not strictly introduce additional convexity/concavity. We theoretically show the linear convergence of DGDA in the bilinear and strongly convex-strongly concave settings and assess its performance by comparing DGDA with other methods such as GDA, Extra-Gradient (EG), and Optimistic GDA. Our findings demonstrate that DGDA surpasses these methods, achieving superior convergence rates. We support our claims with two numerical examples that showcase DGDA's effectiveness in solving saddle point problems.
format	Preprint
id	arxiv_https___arxiv_org_abs_2403_09090
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Dissipative Gradient Descent Ascent Method: A Control Theory Inspired Algorithm for Min-max Optimization Zheng, Tianqi Loizou, Nicolas You, Pengcheng Mallada, Enrique Optimization and Control Machine Learning Gradient Descent Ascent (GDA) methods for min-max optimization problems typically produce oscillatory behavior that can lead to instability, e.g., in bilinear settings. To address this problem, we introduce a dissipation term into the GDA updates to dampen these oscillations. The proposed Dissipative GDA (DGDA) method can be seen as performing standard GDA on a state-augmented and regularized saddle function that does not strictly introduce additional convexity/concavity. We theoretically show the linear convergence of DGDA in the bilinear and strongly convex-strongly concave settings and assess its performance by comparing DGDA with other methods such as GDA, Extra-Gradient (EG), and Optimistic GDA. Our findings demonstrate that DGDA surpasses these methods, achieving superior convergence rates. We support our claims with two numerical examples that showcase DGDA's effectiveness in solving saddle point problems.
title	Dissipative Gradient Descent Ascent Method: A Control Theory Inspired Algorithm for Min-max Optimization
topic	Optimization and Control Machine Learning
url	https://arxiv.org/abs/2403.09090

Similar Items