Saved in:
Bibliographic Details
Main Authors: Khan, Muhammad Irfan, Kontio, Elina, Khan, Suleiman A., Jafaritadi, Mojtaba
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2412.20253
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866916545220837376
author Khan, Muhammad Irfan
Kontio, Elina
Khan, Suleiman A.
Jafaritadi, Mojtaba
author_facet Khan, Muhammad Irfan
Kontio, Elina
Khan, Suleiman A.
Jafaritadi, Mojtaba
contents Federated learning (FL) enables collaborative model training across decentralized datasets while preserving data privacy. However, optimally selecting participating collaborators in dynamic FL environments remains challenging. We present RL-HSimAgg, a novel reinforcement learning (RL) and similarity-weighted aggregation (simAgg) algorithm using harmonic mean to manage outlier data points. This paper proposes applying multi-armed bandit algorithms to improve collaborator selection and model generalization. By balancing exploration-exploitation trade-offs, these RL methods can promote resource-efficient training with diverse datasets. We demonstrate the effectiveness of Epsilon-greedy (EG) and upper confidence bound (UCB) algorithms for federated brain lesion segmentation. In simulation experiments on internal and external validation sets, RL-HSimAgg with UCB collaborator outperformed the EG method across all metrics, achieving higher Dice scores for Enhancing Tumor (0.7334 vs 0.6797), Tumor Core (0.7432 vs 0.6821), and Whole Tumor (0.8252 vs 0.7931) segmentation. Therefore, for the Federated Tumor Segmentation Challenge (FeTS 2024), we consider UCB as our primary client selection approach in federated Glioblastoma lesion segmentation of multi-modal MRIs. In conclusion, our research demonstrates that RL-based collaborator management, e.g. using UCB, can potentially improve model robustness and flexibility in distributed learning environments, particularly in domains like brain tumor segmentation.
format Preprint
id arxiv_https___arxiv_org_abs_2412_20253
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Election of Collaborators via Reinforcement Learning for Federated Brain Tumor Segmentation
Khan, Muhammad Irfan
Kontio, Elina
Khan, Suleiman A.
Jafaritadi, Mojtaba
Machine Learning
Computer Vision and Pattern Recognition
Federated learning (FL) enables collaborative model training across decentralized datasets while preserving data privacy. However, optimally selecting participating collaborators in dynamic FL environments remains challenging. We present RL-HSimAgg, a novel reinforcement learning (RL) and similarity-weighted aggregation (simAgg) algorithm using harmonic mean to manage outlier data points. This paper proposes applying multi-armed bandit algorithms to improve collaborator selection and model generalization. By balancing exploration-exploitation trade-offs, these RL methods can promote resource-efficient training with diverse datasets. We demonstrate the effectiveness of Epsilon-greedy (EG) and upper confidence bound (UCB) algorithms for federated brain lesion segmentation. In simulation experiments on internal and external validation sets, RL-HSimAgg with UCB collaborator outperformed the EG method across all metrics, achieving higher Dice scores for Enhancing Tumor (0.7334 vs 0.6797), Tumor Core (0.7432 vs 0.6821), and Whole Tumor (0.8252 vs 0.7931) segmentation. Therefore, for the Federated Tumor Segmentation Challenge (FeTS 2024), we consider UCB as our primary client selection approach in federated Glioblastoma lesion segmentation of multi-modal MRIs. In conclusion, our research demonstrates that RL-based collaborator management, e.g. using UCB, can potentially improve model robustness and flexibility in distributed learning environments, particularly in domains like brain tumor segmentation.
title Election of Collaborators via Reinforcement Learning for Federated Brain Tumor Segmentation
topic Machine Learning
Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2412.20253