Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Kwon, Yongchan, Wu, Eric, Wu, Kevin, Zou, James
Format:	Preprint
Published:	2023
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2310.00902
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866914712408555520
author	Kwon, Yongchan Wu, Eric Wu, Kevin Zou, James
author_facet	Kwon, Yongchan Wu, Eric Wu, Kevin Zou, James
contents	Quantifying the impact of training data points is crucial for understanding the outputs of machine learning models and for improving the transparency of the AI pipeline. The influence function is a principled and popular data attribution method, but its computational cost often makes it challenging to use. This issue becomes more pronounced in the setting of large language models and text-to-image models. In this work, we propose DataInf, an efficient influence approximation method that is practical for large-scale generative AI models. Leveraging an easy-to-compute closed-form expression, DataInf outperforms existing influence computation algorithms in terms of computational and memory efficiency. Our theoretical analysis shows that DataInf is particularly well-suited for parameter-efficient fine-tuning techniques such as LoRA. Through systematic empirical evaluations, we show that DataInf accurately approximates influence scores and is orders of magnitude faster than existing methods. In applications to RoBERTa-large, Llama-2-13B-chat, and stable-diffusion-v1.5 models, DataInf effectively identifies the most influential fine-tuning examples better than other approximate influence scores. Moreover, it can help to identify which data points are mislabeled.
format	Preprint
id	arxiv_https___arxiv_org_abs_2310_00902
institution	arXiv
publishDate	2023
record_format	arxiv
spellingShingle	DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models Kwon, Yongchan Wu, Eric Wu, Kevin Zou, James Machine Learning Quantifying the impact of training data points is crucial for understanding the outputs of machine learning models and for improving the transparency of the AI pipeline. The influence function is a principled and popular data attribution method, but its computational cost often makes it challenging to use. This issue becomes more pronounced in the setting of large language models and text-to-image models. In this work, we propose DataInf, an efficient influence approximation method that is practical for large-scale generative AI models. Leveraging an easy-to-compute closed-form expression, DataInf outperforms existing influence computation algorithms in terms of computational and memory efficiency. Our theoretical analysis shows that DataInf is particularly well-suited for parameter-efficient fine-tuning techniques such as LoRA. Through systematic empirical evaluations, we show that DataInf accurately approximates influence scores and is orders of magnitude faster than existing methods. In applications to RoBERTa-large, Llama-2-13B-chat, and stable-diffusion-v1.5 models, DataInf effectively identifies the most influential fine-tuning examples better than other approximate influence scores. Moreover, it can help to identify which data points are mislabeled.
title	DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models
topic	Machine Learning
url	https://arxiv.org/abs/2310.00902

Similar Items