Saved in:
Bibliographic Details
Main Authors: Troshin, Sergey, Saparina, Irina, Fokkens, Antske, Niculae, Vlad
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2509.17570
Tags: Add Tag
No Tags, Be the first to tag this record!
Table of Contents:
  • Large language models increasingly rely on explicit reasoning chains and can produce multiple plausible responses for a given context. We study the candidate sampler that produces the set of plausible responses contrasting the ancestral (parallel) sampling against two alternatives: enumeration, which asks the model to produce $n$ candidates in one pass, and iterative sampling, which proposes candidates sequentially while conditioning on the currently generated response set. Under matched budgets, we compare these samplers on quality, lexical and computation flow diversity, and efficiency. Our empirical results demonstrate that enumeration and iterative strategies result in higher diversity at comparable quality. Our findings highlight the potential of simple non-independent sampling strategies to improve response diversity without sacrificing generation quality.