Enregistré dans:
Détails bibliographiques
Auteurs principaux: Wagner, Nicolas, Fan, Dongyang, Jaggi, Martin
Format: Preprint
Publié: 2024
Sujets:
Accès en ligne:https://arxiv.org/abs/2404.09753
Tags: Ajouter un tag
Pas de tags, Soyez le premier à ajouter un tag!
_version_ 1866911979922259968
author Wagner, Nicolas
Fan, Dongyang
Jaggi, Martin
author_facet Wagner, Nicolas
Fan, Dongyang
Jaggi, Martin
contents We explore on-device self-supervised collaborative fine-tuning of large language models with limited local data availability. Taking inspiration from the collaborative learning community, we introduce three distinct trust-weighted gradient aggregation schemes: weight similarity-based, prediction similarity-based and validation performance-based. To minimize communication overhead, we integrate Low-Rank Adaptation (LoRA) and only exchange LoRA weight updates. Our protocols, driven by prediction and performance metrics, surpass both FedAvg and local fine-tuning methods, which is particularly evident in realistic scenarios with more diverse local data distributions. The results underscore the effectiveness of our approach in addressing heterogeneity and scarcity within local datasets.
format Preprint
id arxiv_https___arxiv_org_abs_2404_09753
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Personalized Collaborative Fine-Tuning for On-Device Large Language Models
Wagner, Nicolas
Fan, Dongyang
Jaggi, Martin
Computation and Language
Machine Learning
We explore on-device self-supervised collaborative fine-tuning of large language models with limited local data availability. Taking inspiration from the collaborative learning community, we introduce three distinct trust-weighted gradient aggregation schemes: weight similarity-based, prediction similarity-based and validation performance-based. To minimize communication overhead, we integrate Low-Rank Adaptation (LoRA) and only exchange LoRA weight updates. Our protocols, driven by prediction and performance metrics, surpass both FedAvg and local fine-tuning methods, which is particularly evident in realistic scenarios with more diverse local data distributions. The results underscore the effectiveness of our approach in addressing heterogeneity and scarcity within local datasets.
title Personalized Collaborative Fine-Tuning for On-Device Large Language Models
topic Computation and Language
Machine Learning
url https://arxiv.org/abs/2404.09753