Human Aided Reinforcement Learning in Complex Environments

Abstract

Reinforcement learning algorithms enable computer programs (agents) to learn to solve tasks through a trial-and-error process. As an agent takes actions inan environment, it receives positive and negative signals that shape its future behavior. To assist the process of learning, and to learn the task faster andmore accurately, a human expert can be added to the system to guide an agent in solving the task. This project seeks to expand on current systems thatcombine a human expert with a reinforcement learning agent. Current systems use human input to modify the signal the agent receives from theenvironment, which works particularly well for reactive tasks. In more complex tasks, these systems do not work as intended. The manipulation of theenvironment's signal structure results in undesired and unexpected results for the agent's behavior following human training. Our systems attempt toincorporate humans in ways that do not modify the environment, but rather modify the decisions the agent makes at critical times in training. One of oursolutions (Time Warp) allows the human expert to revert back several seconds in the training of the agent to provide an alternate sequence of actions forthe agent to take. Another solution (Curriculum Development) allows the human expert to set up critical training points for the agent to learn. The agentthen learns how to solve these necessary subskills prior to training in the entire world. Our systems seek to solve the planning requirement by employing ahuman expert during critical times of learning, as the expert sees fit. Our approaches to the planning requirement will allow the human expert-agent modelto be expanded to more complex environments than the previous human systems developed.

Open PDF

Document Details

Document Type: Technical Report
Publication Date: May 21, 2018
Accession Number: AD1054389

Entities

People

Carter B. Burn

Organizations

United States Naval Academy

Human Aided Reinforcement Learning in Complex Environments

Abstract

Document Details

Entities

People

Organizations

Tags

Communities of Interest

DTIC Thesaurus Topics

Fields of Study

Readers

Technology Areas