Strategically managing learning during perceptual decision making

Abstract

Making optimal decisions in the face of noise requires balancing short-term speed and accuracy. But a theory of optimality should account for the fact that short-term speed can influence long-term accuracy through learning. Here, we demonstrate that long-term learning is an important dynamical dimension of the speed-accuracy trade-off. We study learning trajectories in rats and formally characterize these dynamics in a theory expressed as both a recurrent neural network and an analytical extension of the drift-diffusion model that learns over time. The model reveals that choosing suboptimal response times to learn faster sacrifices immediate reward, but can lead to greater total reward. We empirically verify predictions of the theory, including a relationship between stimulus exposure and learning speed, and a modulation of reaction time by future learning prospects. We find that rats’ strategies approximately maximize total reward over the full learning epoch, suggesting cognitive control over the learning process.

Document Details

Document Type
Pub Defense Publication
Publication Date
Feb 14, 2023
Source ID
10.7554/elife.64978

Entities

People

  • Andrew Saxe
  • David D. Cox
  • Javier Masís
  • Juliana Y Rhee
  • Travis Chapman

Organizations

  • Harvard University
  • Intelligence Advanced Research Projects Activity
  • Princeton University
  • Richard and Susan Smith Family Foundation
  • Royal Society
  • University of Oxford
  • Wellcome

Tags

Fields of Study

  • Biology

Readers

  • Adaptive Control and Estimation with Uncertainty in Dynamic Systems.
  • Agent-Based Social Robotics and Mobile-Assisted Learning in Virtual Environments.
  • Brain and Cognitive Science; Experimental Psychology; Cognitive Neuroscience

Technology Areas

  • AI & ML
  • AI & ML - Machine Learning Algorithms
  • AI & ML - Neural Networks