Clustering by Scenario with Special Application to Two-Way Tables of Counts,

Abstract

The formation of a partition of objects, each with an associated random measurement X, is given operational meaning and a figure of merit. The information about X given each object is first reduced to information about X given the cluster in the partition to which that object belongs. The figure of merit for a partition is then the probability of a correct object identification, on the basis of a realization of X, after the information loss. This leads both to a method for evaluating partitions and a clustering algorithm. The methods are discussed in the context of a particular example--clustering states in a two-way table of counts (states by nationality) of U.S. residents in 1970 who were foreign-born or had at least one foreign-born parent.

Document Details

Document Type
Technical Report
Publication Date
Oct 01, 1973
Accession Number
ADA002201

Entities

People

  • Daniel A. Relles
  • William S. Cleveland

Organizations

  • RAND Corporation

Tags

DTIC Thesaurus Topics

  • Algorithms
  • Clustering
  • Figure Of Merit
  • Identification
  • Measurement
  • Probability

Fields of Study

  • Mathematics

Readers

  • Graph Algorithms and Convex Optimization.
  • Image Processing and Computer Vision.
  • Regression Analysis.