Satisficing Q-Learning: Efficient Learning in Problems With Dichotomous Attributes

Abstract

In some environments, a learning agent must learn to balance competing objectives. For example, a Q-learner agent may meed to learn which choices expose the agent to risk and which choices lead to a goal. This paper presents a variant of Q learning that learns a pair of utilities die worlds with dicotomous attributes and showe that this algorithm prpperly balances the competing objectives and, as a result, efficiently identifies satisficing solutions. This occurs because exploration of the environment is restricted to those options which, according to current knowledge, are likely to avoid exposure to risk. We empirically validate the algorithm by (a) showing that the algorithm quickly comnverges to good policies in several simulated worlds of various complexities and (b) applying the algorithm to learning a force feedback profile for a gas pedal that helps drivers avoid risk situations.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jan 01, 2004
Accession Number
ADA451568

Entities

People

  • Michael A Goodrich
  • Morgan Quigley

Organizations

  • Brigham Young University

Tags

Communities of Interest

  • Human Systems
  • Materials and Manufacturing Processes

DTIC Thesaurus Topics

  • Abstracts
  • Acceptability
  • Algorithms
  • Collisions
  • Computer Science
  • Convergence
  • Environment
  • Feedback
  • Learning
  • Observation
  • Probability
  • Simulations
  • Simulators
  • Standards
  • Steady State
  • Training
  • Workload

Fields of Study

  • Computer science

Readers

  • Economics
  • Neural Network Machine Learning.
  • Operations Research