Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Author:	Godara, Prakhar
Format:	Preprint
Published:	2025
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2505.08049
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866915286080290816
author	Godara, Prakhar
author_facet	Godara, Prakhar
contents	Recent studies claim that human behavior in a two-armed Bernoulli bandit (TABB) task is described by positivity and confirmation biases, implying that humans do not integrate new information objectively. However, we find that even if the agent updates its belief via objective Bayesian inference, fitting the standard Q-learning model with asymmetric learning rates still recovers both biases. Bayesian inference cast as an effective Q-learning algorithm has symmetric, though decreasing, learning rates. We explain this by analyzing the stochastic dynamics of these learning systems using master equations. We find that both confirmation bias and unbiased but decreasing learning rates yield the same behavioral signatures. Finally, we propose experimental protocols to disentangle true cognitive biases from artifacts of decreasing learning rates.
format	Preprint
id	arxiv_https___arxiv_org_abs_2505_08049
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Bias or Optimality? Disentangling Bayesian Inference and Learning Biases in Human Decision-Making Godara, Prakhar Artificial Intelligence Recent studies claim that human behavior in a two-armed Bernoulli bandit (TABB) task is described by positivity and confirmation biases, implying that humans do not integrate new information objectively. However, we find that even if the agent updates its belief via objective Bayesian inference, fitting the standard Q-learning model with asymmetric learning rates still recovers both biases. Bayesian inference cast as an effective Q-learning algorithm has symmetric, though decreasing, learning rates. We explain this by analyzing the stochastic dynamics of these learning systems using master equations. We find that both confirmation bias and unbiased but decreasing learning rates yield the same behavioral signatures. Finally, we propose experimental protocols to disentangle true cognitive biases from artifacts of decreasing learning rates.
title	Bias or Optimality? Disentangling Bayesian Inference and Learning Biases in Human Decision-Making
topic	Artificial Intelligence
url	https://arxiv.org/abs/2505.08049

Similar Items