Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Dar, Yehuda, LeJeune, Daniel, Baraniuk, Richard G.
Format:	Preprint
Published:	2021
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2103.05621
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866929367175659520
author	Dar, Yehuda LeJeune, Daniel Baraniuk, Richard G.
author_facet	Dar, Yehuda LeJeune, Daniel Baraniuk, Richard G.
contents	We study a fundamental transfer learning process from source to target linear regression tasks, including overparameterized settings where there are more learned parameters than data samples. The target task learning is addressed by using its training data together with the parameters previously computed for the source task. We define a transfer learning approach to the target task as a linear regression optimization with a regularization on the distance between the to-be-learned target parameters and the already-learned source parameters. We analytically characterize the generalization performance of our transfer learning approach and demonstrate its ability to resolve the peak in generalization errors in double descent phenomena of the minimum L2-norm solution to linear regression. Moreover, we show that for sufficiently related tasks, the optimally tuned transfer learning approach can outperform the optimally tuned ridge regression method, even when the true parameter vector conforms to an isotropic Gaussian prior distribution. Namely, we demonstrate that transfer learning can beat the minimum mean square error (MMSE) solution of the independent target task. Our results emphasize the ability of transfer learning to extend the solution space to the target task and, by that, to have an improved MMSE solution. We formulate the linear MMSE solution to our transfer learning setting and point out its key differences from the common design philosophy to transfer learning.
format	Preprint
id	arxiv_https___arxiv_org_abs_2103_05621
institution	arXiv
publishDate	2021
record_format	arxiv
spellingShingle	The Common Intuition to Transfer Learning Can Win or Lose: Case Studies for Linear Regression Dar, Yehuda LeJeune, Daniel Baraniuk, Richard G. Machine Learning We study a fundamental transfer learning process from source to target linear regression tasks, including overparameterized settings where there are more learned parameters than data samples. The target task learning is addressed by using its training data together with the parameters previously computed for the source task. We define a transfer learning approach to the target task as a linear regression optimization with a regularization on the distance between the to-be-learned target parameters and the already-learned source parameters. We analytically characterize the generalization performance of our transfer learning approach and demonstrate its ability to resolve the peak in generalization errors in double descent phenomena of the minimum L2-norm solution to linear regression. Moreover, we show that for sufficiently related tasks, the optimally tuned transfer learning approach can outperform the optimally tuned ridge regression method, even when the true parameter vector conforms to an isotropic Gaussian prior distribution. Namely, we demonstrate that transfer learning can beat the minimum mean square error (MMSE) solution of the independent target task. Our results emphasize the ability of transfer learning to extend the solution space to the target task and, by that, to have an improved MMSE solution. We formulate the linear MMSE solution to our transfer learning setting and point out its key differences from the common design philosophy to transfer learning.
title	The Common Intuition to Transfer Learning Can Win or Lose: Case Studies for Linear Regression
topic	Machine Learning
url	https://arxiv.org/abs/2103.05621

Similar Items