Utilizing Statistical Inference to Guide Expectations and Test Structuring During Operational Testing and Evaluation

Abstract

Comparative tests are commonly used during the operational testing phase to baseline the system under test (SUT) against the current status quo. Depending on the type of SUT, the comparative test may be costly and resource intensive. Thus any insights which may be gleaned about the potential results of the test beforehand may provide guidance on (1) the potential benefits of conducting the test and (2) the structuring of the test. This paper offers a statistical approach to understanding the type of results which may emerge during comparative testing of the SUT. Specifically, we utilize the concept of statistical inference to determine the needed performance difference between the SUT and the baseline system. If performance differences are statistically different, there may be useful information to be gained from conducting the test as is. Performance differences, which are not statistically different, may indicate that the test should be restructured or postponed. In either case, the relevant decision-maker is provided with information about the potential results of the test beforehand in order to make an informed decision. We illustrate the method of statistical inference on a system which improves situational awareness on the battlefield. We define a number of comparative metrics used to evaluate the operational effectiveness of the baseline system and the SUT. From the notional situational awareness system presented in this paper, we demonstrate the insights which may be gleaned and the implications for operational testing using statistical inference.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Apr 30, 2011
Accession Number
ADA544009

Entities

People

  • Alton Wallace
  • Joy Brathwaite
  • Robert Holcomb

Organizations

  • Georgia Tech

Tags

Communities of Interest

  • Human Systems
  • Space
  • Weapons Technologies

DTIC Thesaurus Topics

  • Air Force
  • Business Administration
  • Data Science
  • Department Of Defense
  • Information Science
  • Operational Effectiveness
  • Operations Research
  • Public Policy
  • Situational Awareness
  • Social Sciences
  • Statistical Analysis
  • Statistical Inference
  • Statistics
  • Students
  • Systems Engineering
  • Test And Evaluation
  • Warfare

Readers

  • Aerospace Test and Evaluation
  • Computational Modeling and Simulation
  • Team-Based Human-Centered Cognitive Task Decision Making and Information Performance.

Technology Areas

  • AI & ML
  • AI & ML - Bayesian Inference
  • AI & ML - DoD AI Strategy