Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Li, Zexi, Lin, Jie, Li, Zhiqi, Zhu, Didi, Shen, Tao, Lin, Tao, Wu, Chao, Lane, Nicholas D.
Format:	Preprint
Published:	2024
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2402.18949
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866918032408838144
author	Li, Zexi Lin, Jie Li, Zhiqi Zhu, Didi Shen, Tao Lin, Tao Wu, Chao Lane, Nicholas D.
author_facet	Li, Zexi Lin, Jie Li, Zhiqi Zhu, Didi Shen, Tao Lin, Tao Wu, Chao Lane, Nicholas D.
contents	Federated learning (FL) involves multiple heterogeneous clients collaboratively training a global model via iterative local updates and model fusion. The generalization of FL's global model has a large gap compared with centralized training, which is its bottleneck for broader applications. In this paper, we study and improve FL's generalization through a fundamental ``connectivity'' perspective, which means how the local models are connected in the parameter region and fused into a generalized global model. The term ``connectivity'' is derived from linear mode connectivity (LMC), studying the interpolated loss landscape of two different solutions (e.g., modes) of neural networks. Bridging the gap between LMC and FL, in this paper, we leverage fixed anchor models to empirically and theoretically study the transitivity property of connectivity from two models (LMC) to a group of models (model fusion in FL). Based on the findings, we propose FedGuCci(+), improving group connectivity for better generalization. It is shown that our methods can boost the generalization of FL under client heterogeneity across various tasks (4 CV datasets and 6 NLP datasets) and model architectures (e.g., ViTs and PLMs). The code is available here: \href{https://github.com/ZexiLee/fedgucci}{\faGithub~FedGuCci Codebase}.
format	Preprint
id	arxiv_https___arxiv_org_abs_2402_18949
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	FedGuCci: Making Local Models More Connected in Landscape for Federated Learning Li, Zexi Lin, Jie Li, Zhiqi Zhu, Didi Shen, Tao Lin, Tao Wu, Chao Lane, Nicholas D. Machine Learning Federated learning (FL) involves multiple heterogeneous clients collaboratively training a global model via iterative local updates and model fusion. The generalization of FL's global model has a large gap compared with centralized training, which is its bottleneck for broader applications. In this paper, we study and improve FL's generalization through a fundamental ``connectivity'' perspective, which means how the local models are connected in the parameter region and fused into a generalized global model. The term ``connectivity'' is derived from linear mode connectivity (LMC), studying the interpolated loss landscape of two different solutions (e.g., modes) of neural networks. Bridging the gap between LMC and FL, in this paper, we leverage fixed anchor models to empirically and theoretically study the transitivity property of connectivity from two models (LMC) to a group of models (model fusion in FL). Based on the findings, we propose FedGuCci(+), improving group connectivity for better generalization. It is shown that our methods can boost the generalization of FL under client heterogeneity across various tasks (4 CV datasets and 6 NLP datasets) and model architectures (e.g., ViTs and PLMs). The code is available here: \href{https://github.com/ZexiLee/fedgucci}{\faGithub~FedGuCci Codebase}.
title	FedGuCci: Making Local Models More Connected in Landscape for Federated Learning
topic	Machine Learning
url	https://arxiv.org/abs/2402.18949

Similar Items