Saved in:
Bibliographic Details
Main Author: Godara, Prakhar
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2505.08049
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866915286080290816
author Godara, Prakhar
author_facet Godara, Prakhar
contents Recent studies claim that human behavior in a two-armed Bernoulli bandit (TABB) task is described by positivity and confirmation biases, implying that humans do not integrate new information objectively. However, we find that even if the agent updates its belief via objective Bayesian inference, fitting the standard Q-learning model with asymmetric learning rates still recovers both biases. Bayesian inference cast as an effective Q-learning algorithm has symmetric, though decreasing, learning rates. We explain this by analyzing the stochastic dynamics of these learning systems using master equations. We find that both confirmation bias and unbiased but decreasing learning rates yield the same behavioral signatures. Finally, we propose experimental protocols to disentangle true cognitive biases from artifacts of decreasing learning rates.
format Preprint
id arxiv_https___arxiv_org_abs_2505_08049
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Bias or Optimality? Disentangling Bayesian Inference and Learning Biases in Human Decision-Making
Godara, Prakhar
Artificial Intelligence
Recent studies claim that human behavior in a two-armed Bernoulli bandit (TABB) task is described by positivity and confirmation biases, implying that humans do not integrate new information objectively. However, we find that even if the agent updates its belief via objective Bayesian inference, fitting the standard Q-learning model with asymmetric learning rates still recovers both biases. Bayesian inference cast as an effective Q-learning algorithm has symmetric, though decreasing, learning rates. We explain this by analyzing the stochastic dynamics of these learning systems using master equations. We find that both confirmation bias and unbiased but decreasing learning rates yield the same behavioral signatures. Finally, we propose experimental protocols to disentangle true cognitive biases from artifacts of decreasing learning rates.
title Bias or Optimality? Disentangling Bayesian Inference and Learning Biases in Human Decision-Making
topic Artificial Intelligence
url https://arxiv.org/abs/2505.08049