Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Sypherd, Chris, Petrov, Sergei, George, Sonny, Belle, Vaishak
Format:	Preprint
Published:	2025
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2505.14880
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866918027806638080
author	Sypherd, Chris Petrov, Sergei George, Sonny Belle, Vaishak
author_facet	Sypherd, Chris Petrov, Sergei George, Sonny Belle, Vaishak
contents	In recent years, large language models have demonstrated remarkable performance across diverse tasks. However, their task effectiveness is heavily dependent on the prompting strategy used to elicit output, which can vary widely in both performance and token usage. While task performance is often used to determine prompting strategy success, we argue that efficiency--balancing performance and token usage--can be a more practical metric for real-world utility. To enable this, we propose Big-$O_{tok}$, a theoretical framework for describing the token usage growth of prompting strategies, and analyze Token Cost, an empirical measure of tokens per performance. We apply these to several common prompting strategies and find that increased token usage leads to drastically diminishing performance returns. Our results validate the Big-$O_{tok}$ analyses and reinforce the need for efficiency-aware evaluations.
format	Preprint
id	arxiv_https___arxiv_org_abs_2505_14880
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Incorporating Token Usage into Prompting Strategy Evaluation Sypherd, Chris Petrov, Sergei George, Sonny Belle, Vaishak Computation and Language In recent years, large language models have demonstrated remarkable performance across diverse tasks. However, their task effectiveness is heavily dependent on the prompting strategy used to elicit output, which can vary widely in both performance and token usage. While task performance is often used to determine prompting strategy success, we argue that efficiency--balancing performance and token usage--can be a more practical metric for real-world utility. To enable this, we propose Big-$O_{tok}$, a theoretical framework for describing the token usage growth of prompting strategies, and analyze Token Cost, an empirical measure of tokens per performance. We apply these to several common prompting strategies and find that increased token usage leads to drastically diminishing performance returns. Our results validate the Big-$O_{tok}$ analyses and reinforce the need for efficiency-aware evaluations.
title	Incorporating Token Usage into Prompting Strategy Evaluation
topic	Computation and Language
url	https://arxiv.org/abs/2505.14880

Similar Items