Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Wong, Justin, Orlovskiy, Yury, Luo, Michael, Seshia, Sanjit A., Gonzalez, Joseph E.
Format:	Preprint
Published:	2024
Subjects:	Computation and Language Artificial Intelligence
Online Access:	https://arxiv.org/abs/2410.09038
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866917801790275584
author	Wong, Justin Orlovskiy, Yury Luo, Michael Seshia, Sanjit A. Gonzalez, Joseph E.
author_facet	Wong, Justin Orlovskiy, Yury Luo, Michael Seshia, Sanjit A. Gonzalez, Joseph E.
contents	Generating diverse responses from large language models (LLMs) is crucial for applications such as planning/search and synthetic data generation, where diversity provides distinct answers across generations. Prior approaches rely on increasing temperature to increase diversity. However, contrary to popular belief, we show not only does this approach produce lower quality individual generations as temperature increases, but it depends on model's next-token probabilities being similar to the true distribution of answers. We propose SimpleStrat, an alternative approach that uses the language model itself to partition the space into strata. At inference, a random stratum is selected and a sample drawn from within the strata. To measure diversity, we introduce CoverageQA, a dataset of underspecified questions with multiple equally plausible answers, and assess diversity by measuring KL Divergence between the output distribution and uniform distribution over valid ground truth answers. As computing probability per response/solution for proprietary models is infeasible, we measure recall on ground truth solutions. Our evaluation show using SimpleStrat achieves higher recall by 0.05 compared to GPT-4o and 0.36 average reduction in KL Divergence compared to Llama 3.
format	Preprint
id	arxiv_https___arxiv_org_abs_2410_09038
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	SimpleStrat: Diversifying Language Model Generation with Stratification Wong, Justin Orlovskiy, Yury Luo, Michael Seshia, Sanjit A. Gonzalez, Joseph E. Computation and Language Artificial Intelligence Generating diverse responses from large language models (LLMs) is crucial for applications such as planning/search and synthetic data generation, where diversity provides distinct answers across generations. Prior approaches rely on increasing temperature to increase diversity. However, contrary to popular belief, we show not only does this approach produce lower quality individual generations as temperature increases, but it depends on model's next-token probabilities being similar to the true distribution of answers. We propose SimpleStrat, an alternative approach that uses the language model itself to partition the space into strata. At inference, a random stratum is selected and a sample drawn from within the strata. To measure diversity, we introduce CoverageQA, a dataset of underspecified questions with multiple equally plausible answers, and assess diversity by measuring KL Divergence between the output distribution and uniform distribution over valid ground truth answers. As computing probability per response/solution for proprietary models is infeasible, we measure recall on ground truth solutions. Our evaluation show using SimpleStrat achieves higher recall by 0.05 compared to GPT-4o and 0.36 average reduction in KL Divergence compared to Llama 3.
title	SimpleStrat: Diversifying Language Model Generation with Stratification
topic	Computation and Language Artificial Intelligence
url	https://arxiv.org/abs/2410.09038

Similar Items