Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Almansoori, Abdulla Jasem, Ivanova, Maria, Veprikov, Andrey, Beznosikov, Aleksandr, Horváth, Samuel, Takáč, Martin
Format:	Preprint
Published:	2025
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2509.19977
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866909804329435136
author	Almansoori, Abdulla Jasem Ivanova, Maria Veprikov, Andrey Beznosikov, Aleksandr Horváth, Samuel Takáč, Martin
author_facet	Almansoori, Abdulla Jasem Ivanova, Maria Veprikov, Andrey Beznosikov, Aleksandr Horváth, Samuel Takáč, Martin
contents	Low-Rank Adaptation (LoRA) fine-tunes large models by learning low-rank updates on top of frozen weights, dramatically reducing trainable parameters and memory. However, there is still a gap between full training with low-rank projections (SVDLoRA) and LoRA fine-tuning, indicating that LoRA steps can be further improved. In this study, we propose OPLoRA, a memory-efficient optimizer that closes this gap by casting LoRA optimization as an interpretable sub-problem and solving it efficiently with alternating least squares updates, where 1-2 alternating steps are empirically found to be sufficient to closely match truncated SVD without ever forming the full matrix. We also retrieve the recently proposed preconditioning methods for LoRA as a special case. OPLoRA supports momentum by maintaining a low-rank estimate using the same subroutine (LoRSum) for computing the step, with a memory budget of 3 times the number of LoRA parameters (i.e., same as Adam). We also propose an experimental scaled variant that uses the K-FAC metric, which could be of interest. Across a linear task, MNIST, CIFAR-100, and RoBERTa-base (MNLI), OPLoRA consistently approaches SVDLoRA's performance using significantly less memory.
format	Preprint
id	arxiv_https___arxiv_org_abs_2509_19977
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Faster Than SVD, Smarter Than SGD: The OPLoRA Alternating Update Almansoori, Abdulla Jasem Ivanova, Maria Veprikov, Andrey Beznosikov, Aleksandr Horváth, Samuel Takáč, Martin Machine Learning Low-Rank Adaptation (LoRA) fine-tunes large models by learning low-rank updates on top of frozen weights, dramatically reducing trainable parameters and memory. However, there is still a gap between full training with low-rank projections (SVDLoRA) and LoRA fine-tuning, indicating that LoRA steps can be further improved. In this study, we propose OPLoRA, a memory-efficient optimizer that closes this gap by casting LoRA optimization as an interpretable sub-problem and solving it efficiently with alternating least squares updates, where 1-2 alternating steps are empirically found to be sufficient to closely match truncated SVD without ever forming the full matrix. We also retrieve the recently proposed preconditioning methods for LoRA as a special case. OPLoRA supports momentum by maintaining a low-rank estimate using the same subroutine (LoRSum) for computing the step, with a memory budget of 3 times the number of LoRA parameters (i.e., same as Adam). We also propose an experimental scaled variant that uses the K-FAC metric, which could be of interest. Across a linear task, MNIST, CIFAR-100, and RoBERTa-base (MNLI), OPLoRA consistently approaches SVDLoRA's performance using significantly less memory.
title	Faster Than SVD, Smarter Than SGD: The OPLoRA Alternating Update
topic	Machine Learning
url	https://arxiv.org/abs/2509.19977

Similar Items