Data Reduction System for Improving Classifier Performance

Abstract

A data reduction method for a classification system using quantized feature vectors for each class with a plurality of features and levels. The reduction algorithm consisting of applying a Bayesian data reduction algorithm to the classification system for developing reduced feature vectors. Test data is then quantified into the reduced feature vectors. The reduced classification system is then tested using the quantized test data. A Bayesian data reduction algorithm is further provided having by computing an initial probability of error for the classification system. Adjacent levels are merged for each feature in the quantized feature vectors. Level based probabilities of error are then calculated for these merged levels among the plurality of features. The system then selects and applies the merged adjacent levels having the minimum level based probability of error to create an intermediate classification system. Steps of merging, selecting and applying are performed until either the probability of error stops improving or the features and levels are incapable of further reduction.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Mar 18, 1999
Accession Number
ADD019448

Entities

People

  • Peter Willett
  • Robert S. Lynch Jr.

Organizations

  • United States Department of the Navy

Tags

Communities of Interest

  • Ground and Sea Platforms
  • Materials and Manufacturing Processes
  • Sensors

DTIC Thesaurus Topics

  • Algorithms
  • Artificial Intelligence
  • Artificial Intelligence Computing
  • Artificial Intelligence Software
  • Bayesian Networks
  • Computational Science
  • Computer Programs
  • Computers
  • Data Reduction
  • Data Science
  • Information Science
  • Inventions
  • Machine Learning
  • Neural Networks
  • Probability
  • Signal Processing
  • Statistics

Fields of Study

  • Computer science

Readers

  • Applied Combinatorial Optimization and Logic Circuit Design.
  • Computer Science/Computer Engineering/Data Science/Digital Signal Processing.
  • Regression Analysis.

Technology Areas

  • AI & ML
  • AI & ML - Bayesian Inference