Saved in:
Bibliographic Details
Main Authors: Vandemoortele, Nathan, Steenwinckel, Bram, Ongenae, Femke, Van Hoecke, Sofie
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2502.08436
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866912688435625984
author Vandemoortele, Nathan
Steenwinckel, Bram
Ongenae, Femke
Van Hoecke, Sofie
author_facet Vandemoortele, Nathan
Steenwinckel, Bram
Ongenae, Femke
Van Hoecke, Sofie
contents We present Label Space Reduction (LSR), a novel method for improving zero-shot classification performance of Large Language Models (LLMs). LSR iteratively refines the classification label space by systematically ranking and reducing candidate classes, enabling the model to concentrate on the most relevant options. By leveraging unlabeled data with the statistical learning capabilities of data-driven models, LSR dynamically optimizes the label space representation at test time. Our experiments across seven benchmarks demonstrate that LSR improves macro-F1 scores by an average of 7.0% (up to 14.2%) with Llama-3.1-70B and 3.3% (up to 11.1%) with Claude-3.5-Sonnet compared to standard zero-shot classification baselines. To reduce the computational overhead of LSR, which requires an additional LLM call at each iteration, we propose distilling the model into a probabilistic classifier, allowing for efficient inference.
format Preprint
id arxiv_https___arxiv_org_abs_2502_08436
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle From Haystack to Needle: Label Space Reduction for Zero-shot Classification
Vandemoortele, Nathan
Steenwinckel, Bram
Ongenae, Femke
Van Hoecke, Sofie
Computation and Language
Artificial Intelligence
Machine Learning
We present Label Space Reduction (LSR), a novel method for improving zero-shot classification performance of Large Language Models (LLMs). LSR iteratively refines the classification label space by systematically ranking and reducing candidate classes, enabling the model to concentrate on the most relevant options. By leveraging unlabeled data with the statistical learning capabilities of data-driven models, LSR dynamically optimizes the label space representation at test time. Our experiments across seven benchmarks demonstrate that LSR improves macro-F1 scores by an average of 7.0% (up to 14.2%) with Llama-3.1-70B and 3.3% (up to 11.1%) with Claude-3.5-Sonnet compared to standard zero-shot classification baselines. To reduce the computational overhead of LSR, which requires an additional LLM call at each iteration, we propose distilling the model into a probabilistic classifier, allowing for efficient inference.
title From Haystack to Needle: Label Space Reduction for Zero-shot Classification
topic Computation and Language
Artificial Intelligence
Machine Learning
url https://arxiv.org/abs/2502.08436