Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Liu, Wei, Panda, Anweshit, Pandey, Ujwal, Cook, Haven, Slota, George M., Wang, Naigang, Chen, Jie, Xu, Yangyang
Format:	Preprint
Published:	2026
Subjects:	Machine Learning Distributed, Parallel, and Cluster Computing Optimization and Control
Online Access:	https://arxiv.org/abs/2604.09970
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866913022354653184
author	Liu, Wei Panda, Anweshit Pandey, Ujwal Cook, Haven Slota, George M. Wang, Naigang Chen, Jie Xu, Yangyang
author_facet	Liu, Wei Panda, Anweshit Pandey, Ujwal Cook, Haven Slota, George M. Wang, Naigang Chen, Jie Xu, Yangyang
contents	In the decentralized distributed learning, achieving fast convergence and low communication cost is essential for scalability and high efficiency. Adaptive gradient methods, such as Adam, have demonstrated strong practical performance in deep learning and centralized distributed settings. However, their convergence properties remain largely unexplored in decentralized settings involving multiple local training steps, such as federated learning. To address this limitation, we propose LoDAdaC, a unified multiple Local Training (MLT) Decentralized framework with Adam-type updates and Compressed communication (CC). LoDAdaC accommodates a broad class of optimizers for its local adaptive updates, including AMSGrad, Adam, and AdaGrad; it is compatible with standard (possibly biased) compressors such as low-bit quantization and sparsification. MLT and CC enable LoDAdaC to achieve multiplied reduction of communication cost, while the technique of adaptive updates enables fast convergence. We rigorously prove the combined advantage through complexity analysis. In addition, experiments on image classification and GPT-style language model training validate our theoretical findings and show that LoDAdaC significantly outperforms existing decentralized algorithms in terms of convergence speed and communication efficiency.
format	Preprint
id	arxiv_https___arxiv_org_abs_2604_09970
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	LoDAdaC: a unified local training-based decentralized framework with adaptive gradients and compressed communication Liu, Wei Panda, Anweshit Pandey, Ujwal Cook, Haven Slota, George M. Wang, Naigang Chen, Jie Xu, Yangyang Machine Learning Distributed, Parallel, and Cluster Computing Optimization and Control In the decentralized distributed learning, achieving fast convergence and low communication cost is essential for scalability and high efficiency. Adaptive gradient methods, such as Adam, have demonstrated strong practical performance in deep learning and centralized distributed settings. However, their convergence properties remain largely unexplored in decentralized settings involving multiple local training steps, such as federated learning. To address this limitation, we propose LoDAdaC, a unified multiple Local Training (MLT) Decentralized framework with Adam-type updates and Compressed communication (CC). LoDAdaC accommodates a broad class of optimizers for its local adaptive updates, including AMSGrad, Adam, and AdaGrad; it is compatible with standard (possibly biased) compressors such as low-bit quantization and sparsification. MLT and CC enable LoDAdaC to achieve multiplied reduction of communication cost, while the technique of adaptive updates enables fast convergence. We rigorously prove the combined advantage through complexity analysis. In addition, experiments on image classification and GPT-style language model training validate our theoretical findings and show that LoDAdaC significantly outperforms existing decentralized algorithms in terms of convergence speed and communication efficiency.
title	LoDAdaC: a unified local training-based decentralized framework with adaptive gradients and compressed communication
topic	Machine Learning Distributed, Parallel, and Cluster Computing Optimization and Control
url	https://arxiv.org/abs/2604.09970

Similar Items