Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Vandemoortele, Nathan, Steenwinckel, Bram, Ongenae, Femke, Van Hoecke, Sofie
Format:	Preprint
Published:	2025
Subjects:	Computation and Language Artificial Intelligence Machine Learning
Online Access:	https://arxiv.org/abs/2502.08436
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866912688435625984
author	Vandemoortele, Nathan Steenwinckel, Bram Ongenae, Femke Van Hoecke, Sofie
author_facet	Vandemoortele, Nathan Steenwinckel, Bram Ongenae, Femke Van Hoecke, Sofie
contents	We present Label Space Reduction (LSR), a novel method for improving zero-shot classification performance of Large Language Models (LLMs). LSR iteratively refines the classification label space by systematically ranking and reducing candidate classes, enabling the model to concentrate on the most relevant options. By leveraging unlabeled data with the statistical learning capabilities of data-driven models, LSR dynamically optimizes the label space representation at test time. Our experiments across seven benchmarks demonstrate that LSR improves macro-F1 scores by an average of 7.0% (up to 14.2%) with Llama-3.1-70B and 3.3% (up to 11.1%) with Claude-3.5-Sonnet compared to standard zero-shot classification baselines. To reduce the computational overhead of LSR, which requires an additional LLM call at each iteration, we propose distilling the model into a probabilistic classifier, allowing for efficient inference.
format	Preprint
id	arxiv_https___arxiv_org_abs_2502_08436
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	From Haystack to Needle: Label Space Reduction for Zero-shot Classification Vandemoortele, Nathan Steenwinckel, Bram Ongenae, Femke Van Hoecke, Sofie Computation and Language Artificial Intelligence Machine Learning We present Label Space Reduction (LSR), a novel method for improving zero-shot classification performance of Large Language Models (LLMs). LSR iteratively refines the classification label space by systematically ranking and reducing candidate classes, enabling the model to concentrate on the most relevant options. By leveraging unlabeled data with the statistical learning capabilities of data-driven models, LSR dynamically optimizes the label space representation at test time. Our experiments across seven benchmarks demonstrate that LSR improves macro-F1 scores by an average of 7.0% (up to 14.2%) with Llama-3.1-70B and 3.3% (up to 11.1%) with Claude-3.5-Sonnet compared to standard zero-shot classification baselines. To reduce the computational overhead of LSR, which requires an additional LLM call at each iteration, we propose distilling the model into a probabilistic classifier, allowing for efficient inference.
title	From Haystack to Needle: Label Space Reduction for Zero-shot Classification
topic	Computation and Language Artificial Intelligence Machine Learning
url	https://arxiv.org/abs/2502.08436

Similar Items