Statistical Inference on Optimal Points to Evaluate Multi-State Classification Systems

Abstract

In decision making, an optimal point represents the settings for which a classification system should be operated to achieve maximum performance. Clearly, these optimal points are of great importance in classification theory. Not only is the selection of the optimal point of interest, but quantifying the uncertainty in the optimal point and its performance is also important. The Youden index is a metric currently employed for selection and performance quantification of optimal points for classification system families. The Youden index quantifies the correct classification rates of a classification system, and its confidence interval quantifies the uncertainty in this measurement. This metric currently focuses on two or three classes, and only allows for the utility of correct classifications and the cost of total misclassifications to be considered. An alternative to this metric for three or more classes is a cost function which considers the sum of incorrect classification rates. This new metric is preferable as it can include class prevalences and costs associated with every classification. In multi-class settings this informs better decisions and inferences on optimal points. The work in this dissertation develops theory and methods for confidence intervals on a metric based on misclassfication rates, Bayes Cost, and where possible, the thresholds found for an optimal point using Bayes Cost. Hypothesis tests for Bayes Cost are also developed to test a classification systems performance or compare systems with an emphasis on classification systems involving three or more classes. Performance of the newly proposed methods is demonstrated with simulation.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Sep 18, 2014
Accession Number
ADA608717

Entities

People

  • Katherine A. Batterton

Organizations

  • Air Force Institute of Technology

Tags

Communities of Interest

  • Biomedical
  • C4I

DTIC Thesaurus Topics

  • Accuracy
  • Air Force
  • Algorithms
  • Data Science
  • Department Of Defense
  • Distribution Functions
  • Information Processing
  • Information Science
  • Knowledge Management
  • Machine Learning
  • Monte Carlo Method
  • Normal Distribution
  • Probability Distributions
  • Random Variables
  • Statistical Algorithms
  • Statistical Inference
  • Test Methods

Fields of Study

  • Computer science

Readers

  • Adaptive Control and Estimation with Uncertainty in Dynamic Systems.
  • Life Cycle Cost Analysis
  • Regression Analysis.

Technology Areas

  • AI & ML
  • AI & ML - Bayesian Inference