Saved in:
| Main Authors: | Schoenegger, Philipp, Jones, Cameron R., Tetlock, Philip E., Mellers, Barbara |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.01578 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Wisdom of the Silicon Crowd: LLM Ensemble Prediction Capabilities Rival Human Crowd Accuracy
by: Schoenegger, Philipp, et al.
Published: (2024)
by: Schoenegger, Philipp, et al.
Published: (2024)
AI-Augmented Predictions: LLM Assistants Improve Human Forecasting Accuracy
by: Schoenegger, Philipp, et al.
Published: (2024)
by: Schoenegger, Philipp, et al.
Published: (2024)
ForecastBench: A Dynamic Benchmark of AI Forecasting Capabilities
by: Karger, Ezra, et al.
Published: (2024)
by: Karger, Ezra, et al.
Published: (2024)
Compact Example-Based Explanations for Language Models
by: Schoenegger, Loris, et al.
Published: (2026)
by: Schoenegger, Loris, et al.
Published: (2026)
LLMs Can Teach Themselves to Better Predict the Future
by: Turtel, Benjamin, et al.
Published: (2025)
by: Turtel, Benjamin, et al.
Published: (2025)
Paraphrase Types Elicit Prompt Engineering Capabilities
by: Wahle, Jan Philip, et al.
Published: (2024)
by: Wahle, Jan Philip, et al.
Published: (2024)
Large Language Models Pass the Turing Test
by: Jones, Cameron R., et al.
Published: (2025)
by: Jones, Cameron R., et al.
Published: (2025)
Exploring the Capabilities of Prompted Large Language Models in Educational and Assessment Applications
by: Maity, Subhankar, et al.
Published: (2024)
by: Maity, Subhankar, et al.
Published: (2024)
Influential Training Data Retrieval for Explaining Verbalized Confidence of LLMs
by: Xia, Yuxi, et al.
Published: (2026)
by: Xia, Yuxi, et al.
Published: (2026)
Autonomous Prompt Engineering in Large Language Models
by: Kepel, Daan, et al.
Published: (2024)
by: Kepel, Daan, et al.
Published: (2024)
Role Prompting Guided Domain Adaptation with General Capability Preserve for Large Language Models
by: Wang, Rui, et al.
Published: (2024)
by: Wang, Rui, et al.
Published: (2024)
Liar, Liar, Logical Mire: A Benchmark for Suppositional Reasoning in Large Language Models
by: Mondorf, Philipp, et al.
Published: (2024)
by: Mondorf, Philipp, et al.
Published: (2024)
An Evaluation of Explanation Methods for Black-Box Detectors of Machine-Generated Text
by: Schoenegger, Loris, et al.
Published: (2024)
by: Schoenegger, Loris, et al.
Published: (2024)
Forecasting Frontier Language Model Agent Capabilities
by: Pimpale, Govind, et al.
Published: (2025)
by: Pimpale, Govind, et al.
Published: (2025)
Comparing Inferential Strategies of Humans and Large Language Models in Deductive Reasoning
by: Mondorf, Philipp, et al.
Published: (2024)
by: Mondorf, Philipp, et al.
Published: (2024)
Lies, Damned Lies, and Distributional Language Statistics: Persuasion and Deception with Large Language Models
by: Jones, Cameron R., et al.
Published: (2024)
by: Jones, Cameron R., et al.
Published: (2024)
Large Language Models Persuade Without Planning Theory of Mind
by: Moore, Jared, et al.
Published: (2026)
by: Moore, Jared, et al.
Published: (2026)
If Probable, Then Acceptable? Understanding Conditional Acceptability Judgments in Large Language Models
by: Orth, Jasmin, et al.
Published: (2025)
by: Orth, Jasmin, et al.
Published: (2025)
Select or Project? Evaluating Lower-dimensional Vectors for LLM Training Data Explanations
by: Hinterleitner, Lukas, et al.
Published: (2026)
by: Hinterleitner, Lukas, et al.
Published: (2026)
Beyond Accuracy: Evaluating the Reasoning Behavior of Large Language Models -- A Survey
by: Mondorf, Philipp, et al.
Published: (2024)
by: Mondorf, Philipp, et al.
Published: (2024)
Wordflow: Social Prompt Engineering for Large Language Models
by: Wang, Zijie J., et al.
Published: (2024)
by: Wang, Zijie J., et al.
Published: (2024)
Disinformation Capabilities of Large Language Models
by: Vykopal, Ivan, et al.
Published: (2023)
by: Vykopal, Ivan, et al.
Published: (2023)
Unlocking Prompt Infilling Capability for Diffusion Language Models
by: Fujinuma, Yoshinari, et al.
Published: (2026)
by: Fujinuma, Yoshinari, et al.
Published: (2026)
Don't Listen To Me: Understanding and Exploring Jailbreak Prompts of Large Language Models
by: Yu, Zhiyuan, et al.
Published: (2024)
by: Yu, Zhiyuan, et al.
Published: (2024)
Evaluating the Capabilities of Large Language Models for Multi-label Emotion Understanding
by: Belay, Tadesse Destaw, et al.
Published: (2024)
by: Belay, Tadesse Destaw, et al.
Published: (2024)
Controllable Abstraction in Summary Generation for Large Language Models via Prompt Engineering
by: Song, Xiangchen, et al.
Published: (2025)
by: Song, Xiangchen, et al.
Published: (2025)
Using Pretrained Large Language Model with Prompt Engineering to Answer Biomedical Questions
by: Zhou, Wenxin, et al.
Published: (2024)
by: Zhou, Wenxin, et al.
Published: (2024)
RePrompt: Planning by Automatic Prompt Engineering for Large Language Models Agents
by: Chen, Weizhe, et al.
Published: (2024)
by: Chen, Weizhe, et al.
Published: (2024)
Integrating Chemistry Knowledge in Large Language Models via Prompt Engineering
by: Liu, Hongxuan, et al.
Published: (2024)
by: Liu, Hongxuan, et al.
Published: (2024)
Cultural Alignment in Large Language Models Using Soft Prompt Tuning
by: Masoud, Reem I., et al.
Published: (2025)
by: Masoud, Reem I., et al.
Published: (2025)
Large Language Model Capabilities in Perioperative Risk Prediction and Prognostication
by: Chung, Philip, et al.
Published: (2024)
by: Chung, Philip, et al.
Published: (2024)
Reasoning Capabilities and Invariability of Large Language Models
by: Raganato, Alessandro, et al.
Published: (2025)
by: Raganato, Alessandro, et al.
Published: (2025)
A Sequential Optimal Learning Approach to Automated Prompt Engineering in Large Language Models
by: Wang, Shuyang, et al.
Published: (2025)
by: Wang, Shuyang, et al.
Published: (2025)
Improving Large Language Models for Clinical Named Entity Recognition via Prompt Engineering
by: Hu, Yan, et al.
Published: (2023)
by: Hu, Yan, et al.
Published: (2023)
Unlocking General Long Chain-of-Thought Reasoning Capabilities of Large Language Models via Representation Engineering
by: Tang, Xinyu, et al.
Published: (2025)
by: Tang, Xinyu, et al.
Published: (2025)
LogicSkills: A Structured Benchmark for Formal Reasoning in Large Language Models
by: Rabern, Brian, et al.
Published: (2026)
by: Rabern, Brian, et al.
Published: (2026)
PromptExp: Multi-granularity Prompt Explanation of Large Language Models
by: Dong, Ximing, et al.
Published: (2024)
by: Dong, Ximing, et al.
Published: (2024)
ORPP: Self-Optimizing Role-playing Prompts to Enhance Language Model Capabilities
by: Duan, Yifan, et al.
Published: (2025)
by: Duan, Yifan, et al.
Published: (2025)
Improving Natural Language Capability of Code Large Language Model
by: Li, Wei, et al.
Published: (2024)
by: Li, Wei, et al.
Published: (2024)
Towards Goal-oriented Prompt Engineering for Large Language Models: A Survey
by: Li, Haochen, et al.
Published: (2024)
by: Li, Haochen, et al.
Published: (2024)
Similar Items
-
Wisdom of the Silicon Crowd: LLM Ensemble Prediction Capabilities Rival Human Crowd Accuracy
by: Schoenegger, Philipp, et al.
Published: (2024) -
AI-Augmented Predictions: LLM Assistants Improve Human Forecasting Accuracy
by: Schoenegger, Philipp, et al.
Published: (2024) -
ForecastBench: A Dynamic Benchmark of AI Forecasting Capabilities
by: Karger, Ezra, et al.
Published: (2024) -
Compact Example-Based Explanations for Language Models
by: Schoenegger, Loris, et al.
Published: (2026) -
LLMs Can Teach Themselves to Better Predict the Future
by: Turtel, Benjamin, et al.
Published: (2025)