Dynamic decision policy reconfiguration under outcome uncertainty

Abstract

In uncertain or unstable environments, sometimes the best decision is to change your mind. To shed light on this flexibility, we evaluated how the underlying decision policy adapts when the most rewarding action changes. Human participants performed a dynamic two-armed bandit task that manipulated the certainty in relative reward (conflict) and the reliability of action-outcomes (volatility). Continuous estimates of conflict and volatility contributed to shifts in exploratory states by changing both the rate of evidence accumulation (drift rate) and the amount of evidence needed to make a decision (boundary height), respectively. At the trialwise level, following a switch in the optimal choice, the drift rate plummets and the boundary height weakly spikes, leading to a slow exploratory state. We find that the drift rate drives most of this response, with an unreliable contribution of boundary height across experiments. Surprisingly, we find no evidence that pupillary responses associated with decision policy changes. We conclude that humans show a stereotypical shift in their decision policies in response to environmental changes.

Document Details

Document Type
Pub Defense Publication
Publication Date
Dec 24, 2021
Source ID
10.7554/elife.65540

Entities

People

  • Alexis Porter
  • Jonathan E. Rubin
  • Krista M Bond
  • Kyle Dunovan
  • Timothy Verstynen

Organizations

  • Air Force Research Laboratory
  • Carnegie Mellon University
  • Northwestern University
  • University of Pittsburgh

Tags

Readers

  • Adaptive Control and Estimation with Uncertainty in Dynamic Systems.
  • Educational Psychology
  • Systems Analysis and Design