Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Matsutani, Hiroki, Kondo, Masaaki, Sunaga, Kazuki, Marculescu, Radu
Format:	Preprint
Published:	2024
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2410.21073
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866909368245551104
author	Matsutani, Hiroki Kondo, Masaaki Sunaga, Kazuki Marculescu, Radu
author_facet	Matsutani, Hiroki Kondo, Masaaki Sunaga, Kazuki Marculescu, Radu
contents	This paper proposes Skip2-LoRA as a lightweight fine-tuning method for deep neural networks to address the gap between pre-trained and deployed models. In our approach, trainable LoRA (low-rank adaptation) adapters are inserted between the last layer and every other layer to enhance the network expressive power while keeping the backward computation cost low. This architecture is well-suited to cache intermediate computation results of the forward pass and then can skip the forward computation of seen samples as training epochs progress. We implemented the combination of the proposed architecture and cache, denoted as Skip2-LoRA, and tested it on a $15 single board computer. Our results show that Skip2-LoRA reduces the fine-tuning time by 90.0% on average compared to the counterpart that has the same number of trainable parameters while preserving the accuracy, while taking only a few seconds on the microcontroller board.
format	Preprint
id	arxiv_https___arxiv_org_abs_2410_21073
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Skip2-LoRA: A Lightweight On-device DNN Fine-tuning Method for Low-cost Edge Devices Matsutani, Hiroki Kondo, Masaaki Sunaga, Kazuki Marculescu, Radu Machine Learning Artificial Intelligence This paper proposes Skip2-LoRA as a lightweight fine-tuning method for deep neural networks to address the gap between pre-trained and deployed models. In our approach, trainable LoRA (low-rank adaptation) adapters are inserted between the last layer and every other layer to enhance the network expressive power while keeping the backward computation cost low. This architecture is well-suited to cache intermediate computation results of the forward pass and then can skip the forward computation of seen samples as training epochs progress. We implemented the combination of the proposed architecture and cache, denoted as Skip2-LoRA, and tested it on a $15 single board computer. Our results show that Skip2-LoRA reduces the fine-tuning time by 90.0% on average compared to the counterpart that has the same number of trainable parameters while preserving the accuracy, while taking only a few seconds on the microcontroller board.
title	Skip2-LoRA: A Lightweight On-device DNN Fine-tuning Method for Low-cost Edge Devices
topic	Machine Learning Artificial Intelligence
url	https://arxiv.org/abs/2410.21073

Similar Items