Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Zhang, Yong, Li, Hanzhang, Li, Zhitao, Cheng, Ning, Li, Ming, Xiao, Jing, Wang, Jianzong
Format:	Preprint
Published:	2024
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2401.09783
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866917570124185600
author	Zhang, Yong Li, Hanzhang Li, Zhitao Cheng, Ning Li, Ming Xiao, Jing Wang, Jianzong
author_facet	Zhang, Yong Li, Hanzhang Li, Zhitao Cheng, Ning Li, Ming Xiao, Jing Wang, Jianzong
contents	Large Language Models (LLMs) have shown significant promise in various applications, including zero-shot and few-shot learning. However, their performance can be hampered by inherent biases. Instead of traditionally sought methods that aim to minimize or correct these biases, this study introduces a novel methodology named ``bias-kNN''. This approach capitalizes on the biased outputs, harnessing them as primary features for kNN and supplementing with gold labels. Our comprehensive evaluations, spanning diverse domain text classification datasets and different GPT-2 model sizes, indicate the adaptability and efficacy of the ``bias-kNN'' method. Remarkably, this approach not only outperforms conventional in-context learning in few-shot scenarios but also demonstrates robustness across a spectrum of samples, templates and verbalizers. This study, therefore, presents a unique perspective on harnessing biases, transforming them into assets for enhanced model performance.
format	Preprint
id	arxiv_https___arxiv_org_abs_2401_09783
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Leveraging Biases in Large Language Models: "bias-kNN'' for Effective Few-Shot Learning Zhang, Yong Li, Hanzhang Li, Zhitao Cheng, Ning Li, Ming Xiao, Jing Wang, Jianzong Computation and Language Large Language Models (LLMs) have shown significant promise in various applications, including zero-shot and few-shot learning. However, their performance can be hampered by inherent biases. Instead of traditionally sought methods that aim to minimize or correct these biases, this study introduces a novel methodology named ``bias-kNN''. This approach capitalizes on the biased outputs, harnessing them as primary features for kNN and supplementing with gold labels. Our comprehensive evaluations, spanning diverse domain text classification datasets and different GPT-2 model sizes, indicate the adaptability and efficacy of the ``bias-kNN'' method. Remarkably, this approach not only outperforms conventional in-context learning in few-shot scenarios but also demonstrates robustness across a spectrum of samples, templates and verbalizers. This study, therefore, presents a unique perspective on harnessing biases, transforming them into assets for enhanced model performance.
title	Leveraging Biases in Large Language Models: "bias-kNN'' for Effective Few-Shot Learning
topic	Computation and Language
url	https://arxiv.org/abs/2401.09783

Similar Items