Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Qu, Zhaonan, Lin, Kaixiang, Li, Zhaojian, Zhou, Jiayu, Zhou, Zhengyuan
Format:	Preprint
Published:	2020
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2007.05690
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866917556990771200
author	Qu, Zhaonan Lin, Kaixiang Li, Zhaojian Zhou, Jiayu Zhou, Zhengyuan
author_facet	Qu, Zhaonan Lin, Kaixiang Li, Zhaojian Zhou, Jiayu Zhou, Zhengyuan
contents	Federated learning (FL) learns a model jointly from a set of participating devices without sharing each other's privately held data. The characteristics of non-i.i.d. data across the network, low device participation, high communication costs, and the mandate that data remain private bring challenges in understanding the convergence of FL algorithms, particularly regarding how convergence scales with the number of participating devices. In this paper, we focus on Federated Averaging (FedAvg), one of the most popular and effective FL algorithms in use today, as well as its Nesterov accelerated variant, and conduct a systematic study of how their convergence scale with the number of participating devices under non-i.i.d. data and partial participation in convex settings. We provide a unified analysis that establishes convergence guarantees for FedAvg under strongly convex, convex, and overparameterized strongly convex problems. We show that FedAvg enjoys linear speedup in each case, although with different convergence rates and communication efficiencies. For strongly convex and convex problems, we also characterize the corresponding convergence rates for the Nesterov accelerated FedAvg algorithm, which are the first linear speedup guarantees for momentum variants of FedAvg in convex settings. Empirical studies of the algorithms in various settings have supported our theoretical results.
format	Preprint
id	arxiv_https___arxiv_org_abs_2007_05690
institution	arXiv
publishDate	2020
record_format	arxiv
spellingShingle	A Unified Linear Speedup Analysis of Federated Averaging and Nesterov FedAvg Qu, Zhaonan Lin, Kaixiang Li, Zhaojian Zhou, Jiayu Zhou, Zhengyuan Machine Learning Federated learning (FL) learns a model jointly from a set of participating devices without sharing each other's privately held data. The characteristics of non-i.i.d. data across the network, low device participation, high communication costs, and the mandate that data remain private bring challenges in understanding the convergence of FL algorithms, particularly regarding how convergence scales with the number of participating devices. In this paper, we focus on Federated Averaging (FedAvg), one of the most popular and effective FL algorithms in use today, as well as its Nesterov accelerated variant, and conduct a systematic study of how their convergence scale with the number of participating devices under non-i.i.d. data and partial participation in convex settings. We provide a unified analysis that establishes convergence guarantees for FedAvg under strongly convex, convex, and overparameterized strongly convex problems. We show that FedAvg enjoys linear speedup in each case, although with different convergence rates and communication efficiencies. For strongly convex and convex problems, we also characterize the corresponding convergence rates for the Nesterov accelerated FedAvg algorithm, which are the first linear speedup guarantees for momentum variants of FedAvg in convex settings. Empirical studies of the algorithms in various settings have supported our theoretical results.
title	A Unified Linear Speedup Analysis of Federated Averaging and Nesterov FedAvg
topic	Machine Learning
url	https://arxiv.org/abs/2007.05690

Similar Items