Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Zhang, Suoxin, He, Run, Fang, Di, Tan, Xiang, Chen, Kaixuan, Zhuang, Huiping
Format:	Preprint
Published:	2026
Subjects:	Artificial Intelligence Computation and Language Machine Learning
Online Access:	https://arxiv.org/abs/2605.06183
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866917468109275136
author	Zhang, Suoxin He, Run Fang, Di Tan, Xiang Chen, Kaixuan Zhuang, Huiping
author_facet	Zhang, Suoxin He, Run Fang, Di Tan, Xiang Chen, Kaixuan Zhuang, Huiping
contents	Low-rank adaptation (LoRA) is a widely used parameter-efficient fine-tuning method that places trainable low-rank adapters into frozen pre-trained models. Recent studies show that using fewer LoRA adapters may still maintain or even improve performance, but existing methods still distribute adapters broadly, leaving where to place a limited number of adapters to maximize performance largely open. To investigate this, we introduce PAGE (Projected Adapter Gradient Energy), a gradient-based sensitivity probe that estimates the initial trainable gradient energy available to each candidate LoRA adapter. Surprisingly, we find that PAGE is highly concentrated on a single shallow FFN down-projection across two model families and four downstream tasks. We term this module the dominant adaptation module and show that its layer index is architecture-dependent but task-stable. Motivated by this finding, we propose DomLoRA, a placement method that places a single adapter at the dominant adaptation module. With only ~0.7% of vanilla LoRA's trainable parameters, DomLoRA outperforms it on average across various downstream tasks, including instruction following, mathematical reasoning, code generation, and multi-turn conversation. This method also improves other LoRA variants, supporting the dominant adaptation module perspective as a practical placement guideline.
format	Preprint
id	arxiv_https___arxiv_org_abs_2605_06183
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	Rethinking Adapter Placement: A Dominant Adaptation Module Perspective Zhang, Suoxin He, Run Fang, Di Tan, Xiang Chen, Kaixuan Zhuang, Huiping Artificial Intelligence Computation and Language Machine Learning Low-rank adaptation (LoRA) is a widely used parameter-efficient fine-tuning method that places trainable low-rank adapters into frozen pre-trained models. Recent studies show that using fewer LoRA adapters may still maintain or even improve performance, but existing methods still distribute adapters broadly, leaving where to place a limited number of adapters to maximize performance largely open. To investigate this, we introduce PAGE (Projected Adapter Gradient Energy), a gradient-based sensitivity probe that estimates the initial trainable gradient energy available to each candidate LoRA adapter. Surprisingly, we find that PAGE is highly concentrated on a single shallow FFN down-projection across two model families and four downstream tasks. We term this module the dominant adaptation module and show that its layer index is architecture-dependent but task-stable. Motivated by this finding, we propose DomLoRA, a placement method that places a single adapter at the dominant adaptation module. With only ~0.7% of vanilla LoRA's trainable parameters, DomLoRA outperforms it on average across various downstream tasks, including instruction following, mathematical reasoning, code generation, and multi-turn conversation. This method also improves other LoRA variants, supporting the dominant adaptation module perspective as a practical placement guideline.
title	Rethinking Adapter Placement: A Dominant Adaptation Module Perspective
topic	Artificial Intelligence Computation and Language Machine Learning
url	https://arxiv.org/abs/2605.06183

Similar Items