Saved in:
Bibliographic Details
Main Authors: Sugawara, Sota, Kawamata, Yuji, Toyoda, Akihiro, Nakayama, Tomoru, Okada, Yukihiko
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2601.09304
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866908764851929088
author Sugawara, Sota
Kawamata, Yuji
Toyoda, Akihiro
Nakayama, Tomoru
Okada, Yukihiko
author_facet Sugawara, Sota
Kawamata, Yuji
Toyoda, Akihiro
Nakayama, Tomoru
Okada, Yukihiko
contents Federated Learning (FL) enables distributed learning across multiple clients without sharing raw data. When statistical heterogeneity across clients is severe, Clustered Federated Learning (CFL) can improve performance by grouping similar clients and training cluster-wise models. However, most CFL approaches rely on multiple communication rounds for cluster estimation and model updates, which limits their practicality under tight constraints on communication rounds. We propose Data Collaboration-based Clustered Federated Learning (DC-CFL), a single-round framework that completes both client clustering and cluster-wise learning, using only the information shared in DC analysis. DC-CFL quantifies inter-client similarity via total variation distance between label distributions, estimates clusters using hierarchical clustering, and performs cluster-wise learning via DC analysis. Experiments on multiple open datasets under representative non-IID conditions show that DC-CFL achieves accuracy comparable to multi-round baselines while requiring only one communication round. These results indicate that DC-CFL is a practical alternative for collaborative AI model development when multiple communication rounds are impractical.
format Preprint
id arxiv_https___arxiv_org_abs_2601_09304
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Single-Round Clustered Federated Learning via Data Collaboration Analysis for Non-IID Data
Sugawara, Sota
Kawamata, Yuji
Toyoda, Akihiro
Nakayama, Tomoru
Okada, Yukihiko
Machine Learning
Federated Learning (FL) enables distributed learning across multiple clients without sharing raw data. When statistical heterogeneity across clients is severe, Clustered Federated Learning (CFL) can improve performance by grouping similar clients and training cluster-wise models. However, most CFL approaches rely on multiple communication rounds for cluster estimation and model updates, which limits their practicality under tight constraints on communication rounds. We propose Data Collaboration-based Clustered Federated Learning (DC-CFL), a single-round framework that completes both client clustering and cluster-wise learning, using only the information shared in DC analysis. DC-CFL quantifies inter-client similarity via total variation distance between label distributions, estimates clusters using hierarchical clustering, and performs cluster-wise learning via DC analysis. Experiments on multiple open datasets under representative non-IID conditions show that DC-CFL achieves accuracy comparable to multi-round baselines while requiring only one communication round. These results indicate that DC-CFL is a practical alternative for collaborative AI model development when multiple communication rounds are impractical.
title Single-Round Clustered Federated Learning via Data Collaboration Analysis for Non-IID Data
topic Machine Learning
url https://arxiv.org/abs/2601.09304