Saved in:
Bibliographic Details
Main Authors: Gao, Ziwen, He, Baihua, Yang, Yuhong
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2605.13421
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866910216243642368
author Gao, Ziwen
He, Baihua
Yang, Yuhong
author_facet Gao, Ziwen
He, Baihua
Yang, Yuhong
contents Many pre-trained models (PTMs) are available in modern applications. Because different PTMs are often trained on different datasets, their performances can vary substantially for different new tasks, and the ranking of the candidates may depend heavily on the input. Motivated by this, we propose a localized model averaging method with weights modeled as functions of the covariates, making it substantially more versatile than existing model averaging methods. This formulation allows the model averaging procedure to adaptively capture the varying relative advantages of different PTMs across heterogeneous contexts. Specifically, we learn flexible local weights under a general loss framework that accommodates a broad class of prediction tasks. We further establish the asymptotic optimality of the proposed method for both in-sample and out-of-sample risks, as well as the consistency of the estimated weights. Extensive numerical experiments further demonstrate the effectiveness of the proposed method.
format Preprint
id arxiv_https___arxiv_org_abs_2605_13421
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Combining pre-trained models via localized model averaging
Gao, Ziwen
He, Baihua
Yang, Yuhong
Methodology
Many pre-trained models (PTMs) are available in modern applications. Because different PTMs are often trained on different datasets, their performances can vary substantially for different new tasks, and the ranking of the candidates may depend heavily on the input. Motivated by this, we propose a localized model averaging method with weights modeled as functions of the covariates, making it substantially more versatile than existing model averaging methods. This formulation allows the model averaging procedure to adaptively capture the varying relative advantages of different PTMs across heterogeneous contexts. Specifically, we learn flexible local weights under a general loss framework that accommodates a broad class of prediction tasks. We further establish the asymptotic optimality of the proposed method for both in-sample and out-of-sample risks, as well as the consistency of the estimated weights. Extensive numerical experiments further demonstrate the effectiveness of the proposed method.
title Combining pre-trained models via localized model averaging
topic Methodology
url https://arxiv.org/abs/2605.13421