Balancing Multiple Sources of Reward in Reinforcement Learning

Abstract

For many problems which would be natural for reinforcement learning, the reward signal is not a single scalar value but has multiple scalar components. Examples of such problems include agents with multiple goals and agents with multiple users. Creating a single reward value by combining the multiple components can throw away vital information and can lead to incorrect solutions. We describe the multiple reward source problem and discuss the problems with applying traditional reinforcement learning. We then present an new algorithm for finding a solution and results on simulated environments.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jan 01, 2006
Accession Number
ADA454702

Entities

People

  • Christian R. Shelton

Organizations

  • Massachusetts Institute of Technology

Tags

DTIC Thesaurus Topics

  • Algorithms
  • Applied Computer Science
  • Artificial Intelligence
  • Cognitive Science
  • Computations
  • Computer Science
  • Electric Power
  • Equations
  • Estimators
  • Game Theory
  • Information Operations
  • Iterations
  • Learning
  • Observation
  • Reinforcement Learning
  • Training

Readers

  • Brain and Cognitive Science; Experimental Psychology; Cognitive Neuroscience
  • Linear Algebra
  • Systems Analysis and Design

Technology Areas

  • AI & ML
  • AI & ML - Bayesian Inference
  • AI & ML - Machine Learning Algorithms