Multivariable and Multigroup Receiver Operating Characteristics Curve Analyses for Qualitative and Quantitative Analysis

Abstract

An algorithm was developed using univariate statistics to reduce and analyze multivariate and multiple group data sets. The algorithm features the quantitative and selectivity figures of merit of receiver operating characteristics (ROC) curve methodology. This "merging" of two separate statistical analysis techniques resulted in the ability to address more than one variable in more than two experimental groups in a systematic fashion. The classic Fisher iris flower data set is treated as one variable and two cases at a time following conventional ROC curve methodology. Redundant, noisy, and low information containing variables are removed. The remaining information-rich variables are systematically merged using ROC curve techniques. The new algorithm using ROC curve techniques produces a "master" vector of down selected variables. The ROC curve technique can be used to process any data distribution whether linear or nonlinear; the inherent trend and fundamental nature of the raw data arc not compromised. No data set normalization or scaling procedures are necessary. Combining qualitative and quantitative aspects of data analysis into a univariate statistical method provides advantages in terms of algorithm understanding for the layman as well as enhanced computer efficiency and information-rich analysis.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jan 01, 2012
Accession Number
ADA554489

Entities

People

  • A. P. Snyder
  • Waleed M. Maswadeh

Organizations

  • Edgewood Chemical Biological Center

Tags

Communities of Interest

  • Biomedical

DTIC Thesaurus Topics

  • Algorithms
  • Data Analysis
  • Data Mining
  • Data Reduction
  • Data Science
  • Data Sets
  • Databases
  • Distribution Curves
  • Experimental Data
  • Frequency
  • Information Processing
  • Information Science
  • Multivariate Analysis
  • Pattern Recognition
  • Statistical Algorithms
  • Statistical Analysis
  • Statistics

Readers

  • Approximation Theory.
  • Image Processing and Computer Vision.
  • Systems Analysis and Design

Technology Areas

  • AI & ML
  • AI & ML - Bayesian Inference