Saved in:
Bibliographic Details
Main Authors: Zhang, Haoting, He, Jinghai, Righter, Rhonda, Zheng, Zeyu
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2404.08164
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866910452381908992
author Zhang, Haoting
He, Jinghai
Righter, Rhonda
Zheng, Zeyu
author_facet Zhang, Haoting
He, Jinghai
Righter, Rhonda
Zheng, Zeyu
contents With the advancement in generative language models, the selection of prompts has gained significant attention in recent years. A prompt is an instruction or description provided by the user, serving as a guide for the generative language model in content generation. Despite existing methods for prompt selection that are based on human labor, we consider facilitating this selection through simulation optimization, aiming to maximize a pre-defined score for the selected prompt. Specifically, we propose a two-stage framework. In the first stage, we determine a feasible set of prompts in sufficient numbers, where each prompt is represented by a moderate-dimensional vector. In the subsequent stage for evaluation and selection, we construct a surrogate model of the score regarding the moderate-dimensional vectors that represent the prompts. We propose sequentially selecting the prompt for evaluation based on this constructed surrogate model. We prove the consistency of the sequential evaluation procedure in our framework. We also conduct numerical experiments to demonstrate the efficacy of our proposed framework, providing practical instructions for implementation.
format Preprint
id arxiv_https___arxiv_org_abs_2404_08164
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Language Model Prompt Selection via Simulation Optimization
Zhang, Haoting
He, Jinghai
Righter, Rhonda
Zheng, Zeyu
Machine Learning
Artificial Intelligence
Computation and Language
With the advancement in generative language models, the selection of prompts has gained significant attention in recent years. A prompt is an instruction or description provided by the user, serving as a guide for the generative language model in content generation. Despite existing methods for prompt selection that are based on human labor, we consider facilitating this selection through simulation optimization, aiming to maximize a pre-defined score for the selected prompt. Specifically, we propose a two-stage framework. In the first stage, we determine a feasible set of prompts in sufficient numbers, where each prompt is represented by a moderate-dimensional vector. In the subsequent stage for evaluation and selection, we construct a surrogate model of the score regarding the moderate-dimensional vectors that represent the prompts. We propose sequentially selecting the prompt for evaluation based on this constructed surrogate model. We prove the consistency of the sequential evaluation procedure in our framework. We also conduct numerical experiments to demonstrate the efficacy of our proposed framework, providing practical instructions for implementation.
title Language Model Prompt Selection via Simulation Optimization
topic Machine Learning
Artificial Intelligence
Computation and Language
url https://arxiv.org/abs/2404.08164