Saved in:
Bibliographic Details
Main Author: Nowak, Robert
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2509.19489
Tags: Add Tag
No Tags, Be the first to tag this record!
Table of Contents:
  • Systems often repeat the same prompt to large language models (LLMs) and aggregate responses to improve reliability. This short note analyzes an estimator of the self-consistency of LLMs and the tradeoffs it induces under a fixed compute budget $B=mn$, where $m$ is the number of prompts sampled from the task distribution and $n$ is the number of repeated LLM calls per prompt; the resulting analysis favors a rough split $m,n\propto\sqrt{B}$.