Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Gelpí, Rebekah A., Xue, Eric, Cunningham, William A.
Format:	Preprint
Published:	2025
Subjects:	Artificial Intelligence Machine Learning
Online Access:	https://arxiv.org/abs/2507.03682
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866908434515886080
author	Gelpí, Rebekah A. Xue, Eric Cunningham, William A.
author_facet	Gelpí, Rebekah A. Xue, Eric Cunningham, William A.
contents	We propose a hybrid approach to machine Theory of Mind (ToM) that uses large language models (LLMs) as a mechanism for generating hypotheses and likelihood functions with a Bayesian inverse planning model that computes posterior probabilities for an agent's likely mental states given its actions. Bayesian inverse planning models can accurately predict human reasoning on a variety of ToM tasks, but these models are constrained in their ability to scale these predictions to scenarios with a large number of possible hypotheses and actions. Conversely, LLM-based approaches have recently demonstrated promise in solving ToM benchmarks, but can exhibit brittleness and failures on reasoning tasks even when they pass otherwise structurally identical versions. By combining these two methods, this approach leverages the strengths of each component, closely matching optimal results on a task inspired by prior inverse planning models and improving performance relative to models that utilize LLMs alone or with chain-of-thought prompting, even with smaller LLMs that typically perform poorly on ToM tasks. We also exhibit the model's potential to predict mental states on open-ended tasks, offering a promising direction for future development of ToM models and the creation of socially intelligent generative agents.
format	Preprint
id	arxiv_https___arxiv_org_abs_2507_03682
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Towards Machine Theory of Mind with Large Language Model-Augmented Inverse Planning Gelpí, Rebekah A. Xue, Eric Cunningham, William A. Artificial Intelligence Machine Learning We propose a hybrid approach to machine Theory of Mind (ToM) that uses large language models (LLMs) as a mechanism for generating hypotheses and likelihood functions with a Bayesian inverse planning model that computes posterior probabilities for an agent's likely mental states given its actions. Bayesian inverse planning models can accurately predict human reasoning on a variety of ToM tasks, but these models are constrained in their ability to scale these predictions to scenarios with a large number of possible hypotheses and actions. Conversely, LLM-based approaches have recently demonstrated promise in solving ToM benchmarks, but can exhibit brittleness and failures on reasoning tasks even when they pass otherwise structurally identical versions. By combining these two methods, this approach leverages the strengths of each component, closely matching optimal results on a task inspired by prior inverse planning models and improving performance relative to models that utilize LLMs alone or with chain-of-thought prompting, even with smaller LLMs that typically perform poorly on ToM tasks. We also exhibit the model's potential to predict mental states on open-ended tasks, offering a promising direction for future development of ToM models and the creation of socially intelligent generative agents.
title	Towards Machine Theory of Mind with Large Language Model-Augmented Inverse Planning
topic	Artificial Intelligence Machine Learning
url	https://arxiv.org/abs/2507.03682

Similar Items