Saved in:
Bibliographic Details
Main Authors: Chu, Yun-Wei, Han, Dong-Jun, Brinton, Christopher G.
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2401.07456
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866913798831472640
author Chu, Yun-Wei
Han, Dong-Jun
Brinton, Christopher G.
author_facet Chu, Yun-Wei
Han, Dong-Jun
Brinton, Christopher G.
contents Federated learning (FL) is a promising distributed machine learning paradigm that enables multiple clients to collaboratively train a global model. In this paper, we focus on a practical federated multilingual learning setup where clients with their own language-specific data aim to collaboratively construct a high-quality neural machine translation (NMT) model. However, communication constraints in practical network systems present challenges for exchanging large-scale NMT engines between FL parties. We propose a meta-learning-based adaptive parameter selection methodology, MetaSend, that improves the communication efficiency of model transmissions from clients during FL-based multilingual NMT training. Our approach learns a dynamic threshold for filtering parameters prior to transmission without compromising the NMT model quality, based on the tensor deviations of clients between different FL rounds. Through experiments on two NMT datasets with different language distributions, we demonstrate that MetaSend obtains substantial improvements over baselines in translation quality in the presence of a limited communication budget.
format Preprint
id arxiv_https___arxiv_org_abs_2401_07456
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Only Send What You Need: Learning to Communicate Efficiently in Federated Multilingual Machine Translation
Chu, Yun-Wei
Han, Dong-Jun
Brinton, Christopher G.
Computation and Language
Artificial Intelligence
Federated learning (FL) is a promising distributed machine learning paradigm that enables multiple clients to collaboratively train a global model. In this paper, we focus on a practical federated multilingual learning setup where clients with their own language-specific data aim to collaboratively construct a high-quality neural machine translation (NMT) model. However, communication constraints in practical network systems present challenges for exchanging large-scale NMT engines between FL parties. We propose a meta-learning-based adaptive parameter selection methodology, MetaSend, that improves the communication efficiency of model transmissions from clients during FL-based multilingual NMT training. Our approach learns a dynamic threshold for filtering parameters prior to transmission without compromising the NMT model quality, based on the tensor deviations of clients between different FL rounds. Through experiments on two NMT datasets with different language distributions, we demonstrate that MetaSend obtains substantial improvements over baselines in translation quality in the presence of a limited communication budget.
title Only Send What You Need: Learning to Communicate Efficiently in Federated Multilingual Machine Translation
topic Computation and Language
Artificial Intelligence
url https://arxiv.org/abs/2401.07456