Analyzing Human-Induced Pathology in the Training of Reinforcement Learning Algorithms

Abstract

Modern artificial intelligence (AI) systems trained with reinforcement learning (RL) are increasingly more capable, but agents training to complete tasks in safety critical environments still require millions of trial-and-error training steps. Previous research with a Pong agent has shown that some human heuristics initially accelerate training but cause agent performance to regress to a state of performance collapse. This thesis utilizes the FlappyBird environment to evaluate if the pathology is generalizable. After initially confirming a similar pathology in an unaided agent, comprehensive experimentation was performed with optimizers, weight initialization methods, activation functions, and varied hyperparameters. The pathology persisted across all experiments and the results show the network architecture is likely the principal cause. At a high level, this work illustrates the importance of determining the inherent capacity of an architecture to learn and model complex environments and how more systematic methods to quantify capacity would greatly enhance RL.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Sep 01, 2022
Accession Number
AD1200383

Entities

People

  • Brian R. Atkinson

Organizations

  • Naval Postgraduate School

Tags

Communities of Interest

  • Energy and Power Technologies
  • Weapons Technologies

DTIC Thesaurus Topics

  • Artificial Intelligence
  • Artificial Intelligence Software
  • Automata Theory
  • Bayesian Networks
  • California
  • Computer Languages
  • Computer Programs
  • Computer Science
  • Computers
  • Convolutional Neural Networks
  • Data Mining
  • Feature Extraction
  • Information Processing
  • Information Science
  • Information Systems
  • Machine Learning
  • Multiagent Systems
  • Network Architecture
  • Network Science
  • Neural Networks
  • Reinforcement Learning

Fields of Study

  • Computer science

Readers

  • Artificial Intelligence
  • Neural Network Machine Learning.
  • Systems Analysis and Design

Technology Areas

  • AI & ML
  • AI & ML - Machine Learning Algorithms
  • AI & ML - Neural Networks