Saved in:
Bibliographic Details
Main Authors: Rmus, Milena, Jagadish, Akshay K., Mathony, Marvin, Ludwig, Tobias, Schulz, Eric
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2502.00879
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866918189449871360
author Rmus, Milena
Jagadish, Akshay K.
Mathony, Marvin
Ludwig, Tobias
Schulz, Eric
author_facet Rmus, Milena
Jagadish, Akshay K.
Mathony, Marvin
Ludwig, Tobias
Schulz, Eric
contents Computational cognitive models, which formalize theories of cognition, enable researchers to quantify cognitive processes and arbitrate between competing theories by fitting models to behavioral data. Traditionally, these models are handcrafted, which requires significant domain knowledge, coding expertise, and time investment. However, recent advances in machine learning offer solutions to these challenges. In particular, Large Language Models (LLMs) have demonstrated remarkable capabilities for in-context pattern recognition, leveraging knowledge from diverse domains to solve complex problems, and generating executable code that can be used to facilitate the generation of cognitive models. Building on this potential, we introduce a pipeline for Guided generation of Computational Cognitive Models (GeCCo). Given task instructions, participant data, and a template function, GeCCo prompts an LLM to propose candidate models, fits proposals to held-out data, and iteratively refines them based on feedback constructed from their predictive performance. We benchmark this approach across four different cognitive domains -- decision making, learning, planning, and memory -- using three open-source LLMs, spanning different model sizes, capacities, and families. On four human behavioral data sets, the LLM generated models that consistently matched or outperformed the best domain-specific models from the cognitive science literature. Taken together, our results suggest that LLMs can generate cognitive models with conceptually plausible theories that rival -- or even surpass -- the best models from the literature across diverse task domains.
format Preprint
id arxiv_https___arxiv_org_abs_2502_00879
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Generating Computational Cognitive Models using Large Language Models
Rmus, Milena
Jagadish, Akshay K.
Mathony, Marvin
Ludwig, Tobias
Schulz, Eric
Machine Learning
Computational cognitive models, which formalize theories of cognition, enable researchers to quantify cognitive processes and arbitrate between competing theories by fitting models to behavioral data. Traditionally, these models are handcrafted, which requires significant domain knowledge, coding expertise, and time investment. However, recent advances in machine learning offer solutions to these challenges. In particular, Large Language Models (LLMs) have demonstrated remarkable capabilities for in-context pattern recognition, leveraging knowledge from diverse domains to solve complex problems, and generating executable code that can be used to facilitate the generation of cognitive models. Building on this potential, we introduce a pipeline for Guided generation of Computational Cognitive Models (GeCCo). Given task instructions, participant data, and a template function, GeCCo prompts an LLM to propose candidate models, fits proposals to held-out data, and iteratively refines them based on feedback constructed from their predictive performance. We benchmark this approach across four different cognitive domains -- decision making, learning, planning, and memory -- using three open-source LLMs, spanning different model sizes, capacities, and families. On four human behavioral data sets, the LLM generated models that consistently matched or outperformed the best domain-specific models from the cognitive science literature. Taken together, our results suggest that LLMs can generate cognitive models with conceptually plausible theories that rival -- or even surpass -- the best models from the literature across diverse task domains.
title Generating Computational Cognitive Models using Large Language Models
topic Machine Learning
url https://arxiv.org/abs/2502.00879