Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Liu, Meitong, Jung, Christopher, Li, Rui, Feng, Xue, Zhao, Han
Format:	Preprint
Published:	2026
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2603.28739
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866908921883525120
author	Liu, Meitong Jung, Christopher Li, Rui Feng, Xue Zhao, Han
author_facet	Liu, Meitong Jung, Christopher Li, Rui Feng, Xue Zhao, Han
contents	In transfer learning, the learner leverages auxiliary data to improve generalization on a main task. However, the precise theoretical understanding of when and how auxiliary data help remains incomplete. We provide new insights on this issue in two canonical linear settings: ordinary least squares regression and under-parameterized linear neural networks. For linear regression, we derive exact closed-form expressions for the expected generalization error with bias-variance decomposition, yielding necessary and sufficient conditions for auxiliary tasks to improve generalization on the main task. We also derive globally optimal task weights as outputs of solvable optimization programs, with consistency guarantees for empirical estimates. For linear neural networks with shared representations of width $q \leq K$, where $K$ is the number of auxiliary tasks, we derive a non-asymptotic expectation bound on the generalization error, yielding the first non-vacuous sufficient condition for beneficial auxiliary learning in this setting, as well as principled directions for task weight curation. We achieve this by proving a new column-wise low-rank perturbation bound for random matrices, which improves upon existing bounds by preserving fine-grained column structures. Our results are verified on synthetic data simulated with controlled parameters.
format	Preprint
id	arxiv_https___arxiv_org_abs_2603_28739
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	Expectation Error Bounds for Transfer Learning in Linear Regression and Linear Neural Networks Liu, Meitong Jung, Christopher Li, Rui Feng, Xue Zhao, Han Machine Learning In transfer learning, the learner leverages auxiliary data to improve generalization on a main task. However, the precise theoretical understanding of when and how auxiliary data help remains incomplete. We provide new insights on this issue in two canonical linear settings: ordinary least squares regression and under-parameterized linear neural networks. For linear regression, we derive exact closed-form expressions for the expected generalization error with bias-variance decomposition, yielding necessary and sufficient conditions for auxiliary tasks to improve generalization on the main task. We also derive globally optimal task weights as outputs of solvable optimization programs, with consistency guarantees for empirical estimates. For linear neural networks with shared representations of width $q \leq K$, where $K$ is the number of auxiliary tasks, we derive a non-asymptotic expectation bound on the generalization error, yielding the first non-vacuous sufficient condition for beneficial auxiliary learning in this setting, as well as principled directions for task weight curation. We achieve this by proving a new column-wise low-rank perturbation bound for random matrices, which improves upon existing bounds by preserving fine-grained column structures. Our results are verified on synthetic data simulated with controlled parameters.
title	Expectation Error Bounds for Transfer Learning in Linear Regression and Linear Neural Networks
topic	Machine Learning
url	https://arxiv.org/abs/2603.28739

Similar Items