Team Air Combat Using Model-based Reinforcement Learning

Abstract

We formulate the xC;first generalized air combat maneuvering problem (ACMP), called the MvN ACMP, wherein M friendly AUCAVs engage against N enemy AUCAVs, developing a Markov decision process (MDP) model to control the team of M Blue AUCAVs. The MDP model leverages a 5-degree-of-freedom aircraft state transition model and formulates a directed energy weapon capability. Instead, a model-based reinforcement learning approach is adopted wherein an approximate policy iteration algorithmic strategy is implemented to attain high-quality approximate policies relative to a high performing benchmark policy. The ADP algorithm utilizes a multi-layer neural network for the value function approximation regression mechanism. One-versus-one and two-versus-one scenarios are constructed to test whether an AUCAV can outmaneuver and destroy a superior enemy AUCAV. The performance is evaluated across offensive, defensive, and neutral starts, leading to 6 problem instances. The ADP policies outperform the position-energy benchmark policy in 4 of 6 problem instances. Results show the ADP approach mimics certain basic xC;fighter maneuvers and section tactics.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Mar 01, 2022
Accession Number
AD1172381

Entities

People

  • David A Mottice

Organizations

  • Air Force Institute of Technology

Tags

Communities of Interest

  • Air Platforms
  • Energy and Power Technologies
  • Human Systems
  • Weapons Technologies

DTIC Thesaurus Topics

  • Aerial Warfare
  • Air Force
  • Air Power
  • Aircrafts
  • Algorithms
  • Artificial Intelligence
  • Computer Programming
  • Department Of Defense
  • Directed Energy Weapons
  • Dynamic Programming
  • Game Theory
  • Military Organizations
  • National Security
  • Neural Networks
  • Operations Research
  • Reinforcement Learning
  • United States
  • Warfare
  • Weapon Systems

Readers

  • Neural Network Machine Learning.
  • Neurological Diseases/Conditions/Disorders
  • Operations Research

Technology Areas

  • AI & ML
  • AI & ML - Machine Learning Algorithms
  • Directed Energy