Saved in:
Bibliographic Details
Main Authors: Piotrowski, Bartosz, Drzewakowski, Witold, Staniszewski, Konrad, Miłoś, Piotr
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2504.16760
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866916703836831744
author Piotrowski, Bartosz
Drzewakowski, Witold
Staniszewski, Konrad
Miłoś, Piotr
author_facet Piotrowski, Bartosz
Drzewakowski, Witold
Staniszewski, Konrad
Miłoś, Piotr
contents Verifiers are auxiliary models that assess the correctness of outputs generated by base large language models (LLMs). They play a crucial role in many strategies for solving reasoning-intensive problems with LLMs. Typically, verifiers are LLMs themselves, often as large (or larger) than the base model they support, making them computationally expensive. In this work, we introduce a novel lightweight verification approach, LiLaVe, which reliably extracts correctness signals from the hidden states of the base LLM. A key advantage of LiLaVe is its ability to operate with only a small fraction of the computational budget required by traditional LLM-based verifiers. To demonstrate its practicality, we couple LiLaVe with popular meta-generation strategies, like best-of-n or self-consistency. Moreover, we design novel LiLaVe-based approaches, like conditional self-correction or conditional majority voting, that significantly improve both accuracy and efficiency in generation tasks with smaller LLMs. Our work demonstrates the fruitfulness of extracting latent information from the hidden states of LLMs, and opens the door to scalable and resource-efficient solutions for reasoning-intensive applications.
format Preprint
id arxiv_https___arxiv_org_abs_2504_16760
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Lightweight Latent Verifiers for Efficient Meta-Generation Strategies
Piotrowski, Bartosz
Drzewakowski, Witold
Staniszewski, Konrad
Miłoś, Piotr
Artificial Intelligence
Verifiers are auxiliary models that assess the correctness of outputs generated by base large language models (LLMs). They play a crucial role in many strategies for solving reasoning-intensive problems with LLMs. Typically, verifiers are LLMs themselves, often as large (or larger) than the base model they support, making them computationally expensive. In this work, we introduce a novel lightweight verification approach, LiLaVe, which reliably extracts correctness signals from the hidden states of the base LLM. A key advantage of LiLaVe is its ability to operate with only a small fraction of the computational budget required by traditional LLM-based verifiers. To demonstrate its practicality, we couple LiLaVe with popular meta-generation strategies, like best-of-n or self-consistency. Moreover, we design novel LiLaVe-based approaches, like conditional self-correction or conditional majority voting, that significantly improve both accuracy and efficiency in generation tasks with smaller LLMs. Our work demonstrates the fruitfulness of extracting latent information from the hidden states of LLMs, and opens the door to scalable and resource-efficient solutions for reasoning-intensive applications.
title Lightweight Latent Verifiers for Efficient Meta-Generation Strategies
topic Artificial Intelligence
url https://arxiv.org/abs/2504.16760