Saved in:
Bibliographic Details
Main Authors: Hsu, Chih-Fan, Chang, Ming-Ching, Chen, Wei-Chao
Format: Preprint
Published: 2022
Subjects:
Online Access:https://arxiv.org/abs/2210.00737
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866910671408463872
author Hsu, Chih-Fan
Chang, Ming-Ching
Chen, Wei-Chao
author_facet Hsu, Chih-Fan
Chang, Ming-Ching
Chen, Wei-Chao
contents The absence of training data and their distribution changes in federated learning (FL) can significantly undermine model performance, especially in cross-silo scenarios. To address this challenge, we introduce the Federated Learning with Data Digest (FedDig) framework. FedDig manages unexpected distribution changes using a novel privacy-controllable data digest representation. This framework allows FL users to adjust the protection levels of the digest by manipulating hyperparameters that control the mixing of multiple low-dimensional features and applying differential privacy perturbation to these mixed features. Evaluation of FedDig across four diverse public datasets shows that it consistently outperforms five baseline algorithms by substantial margins in various data absence scenarios. We also thoroughly explored FedDig's hyperparameters, demonstrating its adaptability. Notably, the FedDig plugin design is inherently extensible and compatible with existing FL algorithms.
format Preprint
id arxiv_https___arxiv_org_abs_2210_00737
institution arXiv
publishDate 2022
record_format arxiv
spellingShingle Mitigating Data Absence in Federated Learning Using Privacy-Controllable Data Digests
Hsu, Chih-Fan
Chang, Ming-Ching
Chen, Wei-Chao
Machine Learning
The absence of training data and their distribution changes in federated learning (FL) can significantly undermine model performance, especially in cross-silo scenarios. To address this challenge, we introduce the Federated Learning with Data Digest (FedDig) framework. FedDig manages unexpected distribution changes using a novel privacy-controllable data digest representation. This framework allows FL users to adjust the protection levels of the digest by manipulating hyperparameters that control the mixing of multiple low-dimensional features and applying differential privacy perturbation to these mixed features. Evaluation of FedDig across four diverse public datasets shows that it consistently outperforms five baseline algorithms by substantial margins in various data absence scenarios. We also thoroughly explored FedDig's hyperparameters, demonstrating its adaptability. Notably, the FedDig plugin design is inherently extensible and compatible with existing FL algorithms.
title Mitigating Data Absence in Federated Learning Using Privacy-Controllable Data Digests
topic Machine Learning
url https://arxiv.org/abs/2210.00737