Saved in:
Bibliographic Details
Main Authors: Chen, Xiangyu, Liu, Jing, Wang, Ye, Wang, Pu Perry, Brand, Matthew, Wang, Guanghui, Koike-Akino, Toshiaki
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2403.11887
Tags: Add Tag
No Tags, Be the first to tag this record!
Table of Contents:
  • Low-rank adaptation (LoRA) and its variants are widely employed in fine-tuning large models, including large language models for natural language processing and diffusion models for computer vision. This paper proposes a generalized framework called SuperLoRA that unifies and extends different LoRA variants, which can be realized under different hyper-parameter settings. Introducing grouping, folding, shuffling, projecting, and tensor factoring, SuperLoRA offers high flexibility compared with other LoRA variants and demonstrates superior performance for transfer learning tasks especially in the extremely few-parameter regimes.