Reinforcement Learning: A Tutorial.

Abstract

The purpose of this tutorial is to provide an introduction to reinforcement learning (RL) at a level easily understood by students and researchers in a wide range of disciplines. The intent is not to present a rigorous mathematical discussion that requires a great deal of effort on the part of the reader, but rather to present a conceptual framework that might serve as an introduction to a more rigorous study of RL. The fundamental principles and techniques used to solve RL problems are presented. The most popular RL algorithms are presented. Section (1) presents an overview of RL and provides a simple example to develop intuition of the underlying dynamic programming mechanism. In Section (2) the parts of a reinforcement learning problem are discussed. These include the environment, reinforcement function, and value function. Section (3) gives a description of the most widely used reinforcement learning algorithms. These include TD(lambda) and both the residual and direct forms of value iteration, Q-learning, and advantage learning. In Section (4) some of the ancillary issues of RL are briefly discussed, such as choosing an exploration strategy and a discount factor. The conclusion is given in Section (5). Finally, Section (6) is a glossary of commonly used terms followed by references and bibliography.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jan 01, 1997
Accession Number
ADA323194

Entities

People

  • Mance E. Harmon
  • Stephanie S. Harmon

Organizations

  • Wright Laboratory

Tags

Communities of Interest

  • Air Platforms
  • Autonomy
  • Weapons Technologies

DTIC Thesaurus Topics

  • Air Force
  • Aircrafts
  • Algorithms
  • Artificial Intelligence
  • Computer Programming
  • Computers
  • Control Systems
  • Dynamic Programming
  • Environment
  • Governments
  • Iterations
  • Machine Learning
  • Markov Chains
  • Neural Networks
  • Probability
  • Reinforcement Learning
  • Supervised Machine Learning

Readers

  • Agent-Based Social Robotics and Mobile-Assisted Learning in Virtual Environments.
  • Library and Information Science
  • Theoretical Analysis.

Technology Areas

  • AI & ML
  • AI & ML - Machine Learning Algorithms