Reward-based training of recurrent neural networks for cognitive and value-based tasks

Abstract

Trained neural network models, which exhibit features of neural activity recorded from behaving animals, may provide insights into the circuit mechanisms of cognitive functions through systematic analysis of network activity and connectivity. However, in contrast to the graded error signals commonly used to train networks through supervised learning, animals learn from reward feedback on definite actions through reinforcement learning. Reward maximization is particularly relevant when optimal behavior depends on an animal’s internal judgment of confidence or subjective preferences. Here, we implement reward-based training of recurrent neural networks in which a value network guides learning by using the activity of the decision network to predict future reward. We show that such models capture behavioral and electrophysiological findings from well-known experimental paradigms. Our work provides a unified framework for investigating diverse cognitive and value-based computations, and predicts a role for value representation that is essential for learning, but not executing, a task.

Document Details

Document Type
Pub Defense Publication
Publication Date
Jan 13, 2017
Source ID
10.7554/elife.21492

Entities

People

  • Guangyu R Yang
  • Hao Song
  • Xiao-Jing Wang

Organizations

  • Google
  • New York University
  • Office of Naval Research

Tags

Fields of Study

  • Biology
  • Computer science
  • Psychology

Readers

  • Agent-Based Social Robotics and Mobile-Assisted Learning in Virtual Environments.
  • Neural Network Machine Learning.
  • Team-Based Human-Centered Cognitive Task Decision Making and Information Performance.

Technology Areas

  • AI & ML
  • AI & ML - Neural Networks