An Automatic Cluster Analysis Algorithm.

Abstract

This technical report presents an algorithm that finds clusters in a set of data. Examples of applications of this algorithm are the separation of targets from clutter in reconnaissance imagery and the determination of prototypes from any set of data represented in vector form, such as hand-printed letters, electrocardiographic data and electroencephalographic data. The approach used is to start with the closet vector to the data mean as the trial prototype for the first cluster. A hypersphere that is centered on the trial prototype is then defined. The radius of the hypersphere is incremented, and the prototype updated, until one of several termination criteria is met. The furthest vector from the resultant cluster is then used to start the search for the next cluster and the process is repeated until no new clusters can be located. The algorithm also merges clusters which are their own closest neighbors, are sufficiently close to each other, and result in a sufficiently small variance. These criteria were empirically derived and are related to data statistics. (Author)

Document Details

Document Type
Technical Report
Publication Date
Feb 01, 1976
Accession Number
ADA022898

Entities

People

  • Roger A. Gagnon

Organizations

  • Air Force Research Laboratory

Tags

DTIC Thesaurus Topics

  • Algorithms
  • Automatic
  • Data Science
  • Information Science
  • Mathematics
  • Prototypes
  • Reconnaissance
  • Statistics

Readers

  • Approximation Theory.
  • Computer Vision.