Saved in:
Bibliographic Details
Main Authors: Liu, Wei, Panda, Anweshit, Pandey, Ujwal, Cook, Haven, Slota, George M., Wang, Naigang, Chen, Jie, Xu, Yangyang
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2604.09970
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866913022354653184
author Liu, Wei
Panda, Anweshit
Pandey, Ujwal
Cook, Haven
Slota, George M.
Wang, Naigang
Chen, Jie
Xu, Yangyang
author_facet Liu, Wei
Panda, Anweshit
Pandey, Ujwal
Cook, Haven
Slota, George M.
Wang, Naigang
Chen, Jie
Xu, Yangyang
contents In the decentralized distributed learning, achieving fast convergence and low communication cost is essential for scalability and high efficiency. Adaptive gradient methods, such as Adam, have demonstrated strong practical performance in deep learning and centralized distributed settings. However, their convergence properties remain largely unexplored in decentralized settings involving multiple local training steps, such as federated learning. To address this limitation, we propose LoDAdaC, a unified multiple Local Training (MLT) Decentralized framework with Adam-type updates and Compressed communication (CC). LoDAdaC accommodates a broad class of optimizers for its local adaptive updates, including AMSGrad, Adam, and AdaGrad; it is compatible with standard (possibly biased) compressors such as low-bit quantization and sparsification. MLT and CC enable LoDAdaC to achieve multiplied reduction of communication cost, while the technique of adaptive updates enables fast convergence. We rigorously prove the combined advantage through complexity analysis. In addition, experiments on image classification and GPT-style language model training validate our theoretical findings and show that LoDAdaC significantly outperforms existing decentralized algorithms in terms of convergence speed and communication efficiency.
format Preprint
id arxiv_https___arxiv_org_abs_2604_09970
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle LoDAdaC: a unified local training-based decentralized framework with adaptive gradients and compressed communication
Liu, Wei
Panda, Anweshit
Pandey, Ujwal
Cook, Haven
Slota, George M.
Wang, Naigang
Chen, Jie
Xu, Yangyang
Machine Learning
Distributed, Parallel, and Cluster Computing
Optimization and Control
In the decentralized distributed learning, achieving fast convergence and low communication cost is essential for scalability and high efficiency. Adaptive gradient methods, such as Adam, have demonstrated strong practical performance in deep learning and centralized distributed settings. However, their convergence properties remain largely unexplored in decentralized settings involving multiple local training steps, such as federated learning. To address this limitation, we propose LoDAdaC, a unified multiple Local Training (MLT) Decentralized framework with Adam-type updates and Compressed communication (CC). LoDAdaC accommodates a broad class of optimizers for its local adaptive updates, including AMSGrad, Adam, and AdaGrad; it is compatible with standard (possibly biased) compressors such as low-bit quantization and sparsification. MLT and CC enable LoDAdaC to achieve multiplied reduction of communication cost, while the technique of adaptive updates enables fast convergence. We rigorously prove the combined advantage through complexity analysis. In addition, experiments on image classification and GPT-style language model training validate our theoretical findings and show that LoDAdaC significantly outperforms existing decentralized algorithms in terms of convergence speed and communication efficiency.
title LoDAdaC: a unified local training-based decentralized framework with adaptive gradients and compressed communication
topic Machine Learning
Distributed, Parallel, and Cluster Computing
Optimization and Control
url https://arxiv.org/abs/2604.09970