Saved in:
Bibliographic Details
Main Authors: Zhang, Suoxin, He, Run, Fang, Di, Tan, Xiang, Chen, Kaixuan, Zhuang, Huiping
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2605.06183
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866917468109275136
author Zhang, Suoxin
He, Run
Fang, Di
Tan, Xiang
Chen, Kaixuan
Zhuang, Huiping
author_facet Zhang, Suoxin
He, Run
Fang, Di
Tan, Xiang
Chen, Kaixuan
Zhuang, Huiping
contents Low-rank adaptation (LoRA) is a widely used parameter-efficient fine-tuning method that places trainable low-rank adapters into frozen pre-trained models. Recent studies show that using fewer LoRA adapters may still maintain or even improve performance, but existing methods still distribute adapters broadly, leaving where to place a limited number of adapters to maximize performance largely open. To investigate this, we introduce PAGE (Projected Adapter Gradient Energy), a gradient-based sensitivity probe that estimates the initial trainable gradient energy available to each candidate LoRA adapter. Surprisingly, we find that PAGE is highly concentrated on a single shallow FFN down-projection across two model families and four downstream tasks. We term this module the dominant adaptation module and show that its layer index is architecture-dependent but task-stable. Motivated by this finding, we propose DomLoRA, a placement method that places a single adapter at the dominant adaptation module. With only ~0.7% of vanilla LoRA's trainable parameters, DomLoRA outperforms it on average across various downstream tasks, including instruction following, mathematical reasoning, code generation, and multi-turn conversation. This method also improves other LoRA variants, supporting the dominant adaptation module perspective as a practical placement guideline.
format Preprint
id arxiv_https___arxiv_org_abs_2605_06183
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Rethinking Adapter Placement: A Dominant Adaptation Module Perspective
Zhang, Suoxin
He, Run
Fang, Di
Tan, Xiang
Chen, Kaixuan
Zhuang, Huiping
Artificial Intelligence
Computation and Language
Machine Learning
Low-rank adaptation (LoRA) is a widely used parameter-efficient fine-tuning method that places trainable low-rank adapters into frozen pre-trained models. Recent studies show that using fewer LoRA adapters may still maintain or even improve performance, but existing methods still distribute adapters broadly, leaving where to place a limited number of adapters to maximize performance largely open. To investigate this, we introduce PAGE (Projected Adapter Gradient Energy), a gradient-based sensitivity probe that estimates the initial trainable gradient energy available to each candidate LoRA adapter. Surprisingly, we find that PAGE is highly concentrated on a single shallow FFN down-projection across two model families and four downstream tasks. We term this module the dominant adaptation module and show that its layer index is architecture-dependent but task-stable. Motivated by this finding, we propose DomLoRA, a placement method that places a single adapter at the dominant adaptation module. With only ~0.7% of vanilla LoRA's trainable parameters, DomLoRA outperforms it on average across various downstream tasks, including instruction following, mathematical reasoning, code generation, and multi-turn conversation. This method also improves other LoRA variants, supporting the dominant adaptation module perspective as a practical placement guideline.
title Rethinking Adapter Placement: A Dominant Adaptation Module Perspective
topic Artificial Intelligence
Computation and Language
Machine Learning
url https://arxiv.org/abs/2605.06183