:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Schoenegger, Philipp, Jones, Cameron R., Tetlock, Philip E., Mellers, Barbara
Format:	Preprint
Published:	2025
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2506.01578
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Wisdom of the Silicon Crowd: LLM Ensemble Prediction Capabilities Rival Human Crowd Accuracy
by: Schoenegger, Philipp, et al.
Published: (2024)

AI-Augmented Predictions: LLM Assistants Improve Human Forecasting Accuracy
by: Schoenegger, Philipp, et al.
Published: (2024)

ForecastBench: A Dynamic Benchmark of AI Forecasting Capabilities
by: Karger, Ezra, et al.
Published: (2024)

Compact Example-Based Explanations for Language Models
by: Schoenegger, Loris, et al.
Published: (2026)

LLMs Can Teach Themselves to Better Predict the Future
by: Turtel, Benjamin, et al.
Published: (2025)

Paraphrase Types Elicit Prompt Engineering Capabilities
by: Wahle, Jan Philip, et al.
Published: (2024)

Large Language Models Pass the Turing Test
by: Jones, Cameron R., et al.
Published: (2025)

Exploring the Capabilities of Prompted Large Language Models in Educational and Assessment Applications
by: Maity, Subhankar, et al.
Published: (2024)

Influential Training Data Retrieval for Explaining Verbalized Confidence of LLMs
by: Xia, Yuxi, et al.
Published: (2026)

Autonomous Prompt Engineering in Large Language Models
by: Kepel, Daan, et al.
Published: (2024)

Role Prompting Guided Domain Adaptation with General Capability Preserve for Large Language Models
by: Wang, Rui, et al.
Published: (2024)

Liar, Liar, Logical Mire: A Benchmark for Suppositional Reasoning in Large Language Models
by: Mondorf, Philipp, et al.
Published: (2024)

An Evaluation of Explanation Methods for Black-Box Detectors of Machine-Generated Text
by: Schoenegger, Loris, et al.
Published: (2024)

Forecasting Frontier Language Model Agent Capabilities
by: Pimpale, Govind, et al.
Published: (2025)

Comparing Inferential Strategies of Humans and Large Language Models in Deductive Reasoning
by: Mondorf, Philipp, et al.
Published: (2024)

Lies, Damned Lies, and Distributional Language Statistics: Persuasion and Deception with Large Language Models
by: Jones, Cameron R., et al.
Published: (2024)

Large Language Models Persuade Without Planning Theory of Mind
by: Moore, Jared, et al.
Published: (2026)

If Probable, Then Acceptable? Understanding Conditional Acceptability Judgments in Large Language Models
by: Orth, Jasmin, et al.
Published: (2025)

Select or Project? Evaluating Lower-dimensional Vectors for LLM Training Data Explanations
by: Hinterleitner, Lukas, et al.
Published: (2026)

Beyond Accuracy: Evaluating the Reasoning Behavior of Large Language Models -- A Survey
by: Mondorf, Philipp, et al.
Published: (2024)

Wordflow: Social Prompt Engineering for Large Language Models
by: Wang, Zijie J., et al.
Published: (2024)

Disinformation Capabilities of Large Language Models
by: Vykopal, Ivan, et al.
Published: (2023)

Unlocking Prompt Infilling Capability for Diffusion Language Models
by: Fujinuma, Yoshinari, et al.
Published: (2026)

Don't Listen To Me: Understanding and Exploring Jailbreak Prompts of Large Language Models
by: Yu, Zhiyuan, et al.
Published: (2024)

Evaluating the Capabilities of Large Language Models for Multi-label Emotion Understanding
by: Belay, Tadesse Destaw, et al.
Published: (2024)

Controllable Abstraction in Summary Generation for Large Language Models via Prompt Engineering
by: Song, Xiangchen, et al.
Published: (2025)

Using Pretrained Large Language Model with Prompt Engineering to Answer Biomedical Questions
by: Zhou, Wenxin, et al.
Published: (2024)

RePrompt: Planning by Automatic Prompt Engineering for Large Language Models Agents
by: Chen, Weizhe, et al.
Published: (2024)

Integrating Chemistry Knowledge in Large Language Models via Prompt Engineering
by: Liu, Hongxuan, et al.
Published: (2024)

Cultural Alignment in Large Language Models Using Soft Prompt Tuning
by: Masoud, Reem I., et al.
Published: (2025)

Large Language Model Capabilities in Perioperative Risk Prediction and Prognostication
by: Chung, Philip, et al.
Published: (2024)

Reasoning Capabilities and Invariability of Large Language Models
by: Raganato, Alessandro, et al.
Published: (2025)

A Sequential Optimal Learning Approach to Automated Prompt Engineering in Large Language Models
by: Wang, Shuyang, et al.
Published: (2025)

Improving Large Language Models for Clinical Named Entity Recognition via Prompt Engineering
by: Hu, Yan, et al.
Published: (2023)

Unlocking General Long Chain-of-Thought Reasoning Capabilities of Large Language Models via Representation Engineering
by: Tang, Xinyu, et al.
Published: (2025)

LogicSkills: A Structured Benchmark for Formal Reasoning in Large Language Models
by: Rabern, Brian, et al.
Published: (2026)

PromptExp: Multi-granularity Prompt Explanation of Large Language Models
by: Dong, Ximing, et al.
Published: (2024)

ORPP: Self-Optimizing Role-playing Prompts to Enhance Language Model Capabilities
by: Duan, Yifan, et al.
Published: (2025)

Improving Natural Language Capability of Code Large Language Model
by: Li, Wei, et al.
Published: (2024)

Towards Goal-oriented Prompt Engineering for Large Language Models: A Survey
by: Li, Haochen, et al.
Published: (2024)