Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Gupta, Sunny, Jindal, Mohit, Kashyap, Pankhi, Jeevan, Pranav, Sethi, Amit
Format:	Preprint
Published:	2024
Subjects:	Machine Learning Computer Vision and Pattern Recognition Distributed, Parallel, and Cluster Computing Optimization and Control I.2.6; C.1.4; D.1.3; I.5.1; H.3.4
Online Access:	https://arxiv.org/abs/2409.15216
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866917791711363072
author	Gupta, Sunny Jindal, Mohit Kashyap, Pankhi Jeevan, Pranav Sethi, Amit
author_facet	Gupta, Sunny Jindal, Mohit Kashyap, Pankhi Jeevan, Pranav Sethi, Amit
contents	Federated learning faces a critical challenge in balancing communication efficiency with rapid convergence, especially for second-order methods. While Newton-type algorithms achieve linear convergence in communication rounds, transmitting full Hessian matrices is often impractical due to quadratic complexity. We introduce Federated Learning with Enhanced Nesterov-Newton Sketch (FLeNS), a novel method that harnesses both the acceleration capabilities of Nesterov's method and the dimensionality reduction benefits of Hessian sketching. FLeNS approximates the centralized Newton's method without relying on the exact Hessian, significantly reducing communication overhead. By combining Nesterov's acceleration with adaptive Hessian sketching, FLeNS preserves crucial second-order information while preserving the rapid convergence characteristics. Our theoretical analysis, grounded in statistical learning, demonstrates that FLeNS achieves super-linear convergence rates in communication rounds - a notable advancement in federated optimization. We provide rigorous convergence guarantees and characterize tradeoffs between acceleration, sketch size, and convergence speed. Extensive empirical evaluation validates our theoretical findings, showcasing FLeNS's state-of-the-art performance with reduced communication requirements, particularly in privacy-sensitive and edge-computing scenarios. The code is available at https://github.com/sunnyinAI/FLeNS
format	Preprint
id	arxiv_https___arxiv_org_abs_2409_15216
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	FLeNS: Federated Learning with Enhanced Nesterov-Newton Sketch Gupta, Sunny Jindal, Mohit Kashyap, Pankhi Jeevan, Pranav Sethi, Amit Machine Learning Computer Vision and Pattern Recognition Distributed, Parallel, and Cluster Computing Optimization and Control I.2.6; C.1.4; D.1.3; I.5.1; H.3.4 Federated learning faces a critical challenge in balancing communication efficiency with rapid convergence, especially for second-order methods. While Newton-type algorithms achieve linear convergence in communication rounds, transmitting full Hessian matrices is often impractical due to quadratic complexity. We introduce Federated Learning with Enhanced Nesterov-Newton Sketch (FLeNS), a novel method that harnesses both the acceleration capabilities of Nesterov's method and the dimensionality reduction benefits of Hessian sketching. FLeNS approximates the centralized Newton's method without relying on the exact Hessian, significantly reducing communication overhead. By combining Nesterov's acceleration with adaptive Hessian sketching, FLeNS preserves crucial second-order information while preserving the rapid convergence characteristics. Our theoretical analysis, grounded in statistical learning, demonstrates that FLeNS achieves super-linear convergence rates in communication rounds - a notable advancement in federated optimization. We provide rigorous convergence guarantees and characterize tradeoffs between acceleration, sketch size, and convergence speed. Extensive empirical evaluation validates our theoretical findings, showcasing FLeNS's state-of-the-art performance with reduced communication requirements, particularly in privacy-sensitive and edge-computing scenarios. The code is available at https://github.com/sunnyinAI/FLeNS
title	FLeNS: Federated Learning with Enhanced Nesterov-Newton Sketch
topic	Machine Learning Computer Vision and Pattern Recognition Distributed, Parallel, and Cluster Computing Optimization and Control I.2.6; C.1.4; D.1.3; I.5.1; H.3.4
url	https://arxiv.org/abs/2409.15216

Similar Items