Saved in:
| Main Author: | Xu, Yongzhong |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.18649 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
SwitchLoRA: Switched Low-Rank Adaptation Can Learn Full-Rank Information
by: Zhou, Kaiye, et al.
Published: (2024)
by: Zhou, Kaiye, et al.
Published: (2024)
Low-Dimensional and Transversely Curved Optimization Dynamics in Grokking
by: Xu, Yongzhong
Published: (2026)
by: Xu, Yongzhong
Published: (2026)
Optimizer-Induced Low-Dimensional Drift and Transverse Dynamics in Transformer Training
by: Xu, Yongzhong
Published: (2026)
by: Xu, Yongzhong
Published: (2026)
Low-Dimensional Execution Manifolds in Transformer Learning Dynamics: Evidence from Modular Arithmetic Tasks
by: Xu, Yongzhong
Published: (2026)
by: Xu, Yongzhong
Published: (2026)
Spectral Edge Dynamics Reveal Functional Modes of Learning
by: Xu, Yongzhong
Published: (2026)
by: Xu, Yongzhong
Published: (2026)
Early-Warning Signals of Grokking via Loss-Landscape Geometry
by: Xu, Yongzhong
Published: (2026)
by: Xu, Yongzhong
Published: (2026)
Gradient-Direction Sensitivity Reveals Linear-Centroid Coupling Hidden by Optimizer Trajectories
by: Xu, Yongzhong
Published: (2026)
by: Xu, Yongzhong
Published: (2026)
The Geometry of Multi-Task Grokking: Transverse Instability, Superposition, and Weight Decay Phase Structure
by: Xu, Yongzhong
Published: (2026)
by: Xu, Yongzhong
Published: (2026)
Spectral Probe-Circuits: A Three-Step Recipe for Identifying Attention-Head Circuits in Pretrained Transformers
by: Xu, Yongzhong
Published: (2026)
by: Xu, Yongzhong
Published: (2026)
Spectral Edge Dynamics of Training Trajectories: Signal--Noise Geometry Across Scales
by: Xu, Yongzhong
Published: (2026)
by: Xu, Yongzhong
Published: (2026)
Spectral Edge Dynamics: An Analytical-Empirical Study of Phase Transitions in Neural Network Training
by: Xu, Yongzhong
Published: (2026)
by: Xu, Yongzhong
Published: (2026)
When Do Attention Circuits Form? Developmental Trajectories of Capability and Attention-Sink Emergence Across Three 1B-ClassArchitectures
by: Xu, Yongzhong
Published: (2026)
by: Xu, Yongzhong
Published: (2026)
Feature Repulsion and Spectral Lock-in: An Empirical Study of Two-Layer Network Grokking
by: Xu, Yongzhong
Published: (2026)
by: Xu, Yongzhong
Published: (2026)
DEL-Ranking: Ranking-Correction Denoising Framework for Elucidating Molecular Affinities in DNA-Encoded Libraries
by: Cao, Hanqun, et al.
Published: (2024)
by: Cao, Hanqun, et al.
Published: (2024)
Low-Rank Compression of Language Models via Differentiable Rank Selection
by: Sundrani, Sidhant, et al.
Published: (2025)
by: Sundrani, Sidhant, et al.
Published: (2025)
Riemannian Networks over Full-Rank Correlation Matrices
by: Chen, Ziheng, et al.
Published: (2026)
by: Chen, Ziheng, et al.
Published: (2026)
Hybrid-LoRA: Bridging Full Fine-Tuning and Low-Rank Adaptation for Post-Training
by: Zhang, Chengqian, et al.
Published: (2026)
by: Zhang, Chengqian, et al.
Published: (2026)
Ranking-aware Reinforcement Learning for Ordinal Ranking
by: Hao, Aiming, et al.
Published: (2026)
by: Hao, Aiming, et al.
Published: (2026)
Rank-1 LoRAs Encode Interpretable Reasoning Signals
by: Ward, Jake, et al.
Published: (2025)
by: Ward, Jake, et al.
Published: (2025)
Energy-Structured Low-Rank Adaptation for Continual Learning
by: Li, Longhua, et al.
Published: (2026)
by: Li, Longhua, et al.
Published: (2026)
PLAN: Proactive Low-Rank Allocation for Continual Learning
by: Wang, Xiequn, et al.
Published: (2025)
by: Wang, Xiequn, et al.
Published: (2025)
Toward Learning POMDPs Beyond Full-Rank Actions and State Observability
by: Shaw, Seiji, et al.
Published: (2026)
by: Shaw, Seiji, et al.
Published: (2026)
Mixture-of-Subspaces in Low-Rank Adaptation
by: Wu, Taiqiang, et al.
Published: (2024)
by: Wu, Taiqiang, et al.
Published: (2024)
Towards Symmetric Low-Rank Adapters
by: Panoutsos, Tales, et al.
Published: (2025)
by: Panoutsos, Tales, et al.
Published: (2025)
The Primacy of Magnitude in Low-Rank Adaptation
by: Zhang, Zicheng, et al.
Published: (2025)
by: Zhang, Zicheng, et al.
Published: (2025)
FRoD: Full-Rank Efficient Fine-Tuning with Rotational Degrees for Fast Convergence
by: Wan, Guoan, et al.
Published: (2025)
by: Wan, Guoan, et al.
Published: (2025)
Preference Learning Algorithms Do Not Learn Preference Rankings
by: Chen, Angelica, et al.
Published: (2024)
by: Chen, Angelica, et al.
Published: (2024)
MoR: Mixture of Ranks for Low-Rank Adaptation Tuning
by: Tang, Chuanyu, et al.
Published: (2024)
by: Tang, Chuanyu, et al.
Published: (2024)
Low-Rank Adaptation for Critic Learning in Off-Policy Reinforcement Learning
by: Zhuang, Yuan, et al.
Published: (2026)
by: Zhuang, Yuan, et al.
Published: (2026)
Communication-Efficient Federated Low-Rank Update Algorithm and its Connection to Implicit Regularization
by: Park, Haemin, et al.
Published: (2024)
by: Park, Haemin, et al.
Published: (2024)
Emergent Low-Rank Training Dynamics in MLPs with Smooth Activations
by: Xu, Alec S., et al.
Published: (2026)
by: Xu, Alec S., et al.
Published: (2026)
Federated Dynamical Low-Rank Training with Global Loss Convergence Guarantees
by: Schotthöfer, Steffen, et al.
Published: (2024)
by: Schotthöfer, Steffen, et al.
Published: (2024)
Polynomial Expansion Rank Adaptation: Enhancing Low-Rank Fine-Tuning with High-Order Interactions
by: Zhang, Wenhao, et al.
Published: (2026)
by: Zhang, Wenhao, et al.
Published: (2026)
GeoRA: Geometry-Aware Low-Rank Adaptation for RLVR
by: Zhang, Jiaying, et al.
Published: (2026)
by: Zhang, Jiaying, et al.
Published: (2026)
Probe-Free Low-Rank Activation Intervention
by: Jiang, Chonghe, et al.
Published: (2025)
by: Jiang, Chonghe, et al.
Published: (2025)
Low-Rank MDPs with Continuous Action Spaces
by: Bennett, Andrew, et al.
Published: (2023)
by: Bennett, Andrew, et al.
Published: (2023)
Low Rank Gradients and Where to Find Them
by: Sonthalia, Rishi, et al.
Published: (2025)
by: Sonthalia, Rishi, et al.
Published: (2025)
Stable-LoRA: Stabilizing Feature Learning of Low-Rank Adaptation
by: Wu, Yize, et al.
Published: (2026)
by: Wu, Yize, et al.
Published: (2026)
Tensor and Matrix Low-Rank Value-Function Approximation in Reinforcement Learning
by: Rozada, Sergio, et al.
Published: (2022)
by: Rozada, Sergio, et al.
Published: (2022)
Hessian Aware Low-Rank Perturbation for Order-Robust Continual Learning
by: Li, Jiaqi, et al.
Published: (2023)
by: Li, Jiaqi, et al.
Published: (2023)
Similar Items
-
SwitchLoRA: Switched Low-Rank Adaptation Can Learn Full-Rank Information
by: Zhou, Kaiye, et al.
Published: (2024) -
Low-Dimensional and Transversely Curved Optimization Dynamics in Grokking
by: Xu, Yongzhong
Published: (2026) -
Optimizer-Induced Low-Dimensional Drift and Transverse Dynamics in Transformer Training
by: Xu, Yongzhong
Published: (2026) -
Low-Dimensional Execution Manifolds in Transformer Learning Dynamics: Evidence from Modular Arithmetic Tasks
by: Xu, Yongzhong
Published: (2026) -
Spectral Edge Dynamics Reveal Functional Modes of Learning
by: Xu, Yongzhong
Published: (2026)