High Dimensional Clustering Using Parallel Coordinates and the Grand Tour.

Abstract

In this paper, we present some graphical techniques for cluster analysis of high-dimensional data. Parallel coordinate plots and parallel coordinate density plots are graphical techniques which map multivariate data into a two-dimensional display. The method has some elegant duality properties with ordinary Cartesian plots so that higher-dimensional mathematical structures can be analyzed. Our high interaction software allows for rapid editing of data to remove outliers and isolate clusters by brushing. Our brushing techniques allow not only for hue adjustment, but also for saturation adjustment. Saturation adjustment allows for the handling of comparatively massive data sets by using the alpha-channel of the Silicon Graphics workstation to compensate for heavy overplotting. The grand tour is a generalized rotation of coordinate axes in a high-dimensional space. Coupled with the full-dimensional plots allowed by the parallel coordinate display, these techniques allow the data analyst to explore data which is both high-dimensional and massive in size. In this paper we give a description of both techniques and illustrate their use to do inverse regression and clustering. We have used these techniques to analyze data on the order of 250,000 observations in 8 dimensions. Because the analysis requires the use of color graphics, in the present paper we illustrate the methods with a more modest data set of 3848 observations. Other illustrations are available on our web page.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Apr 01, 1996
Accession Number
ADA313545

Entities

People

  • Edward Wegman
  • Qiang Luo

Organizations

  • George Mason University

Tags

Communities of Interest

  • Advanced Electronics
  • Air Platforms

DTIC Thesaurus Topics

  • Abstracts
  • Algorithms
  • Cartesian Coordinates
  • Clustering
  • Computer Science
  • Coordinate Systems
  • Data Analysis
  • Data Sets
  • Graphics
  • Observation
  • Rotation
  • Saturation
  • Specific Volume
  • Statistics
  • Theses
  • Two Dimensional
  • Visualizations

Fields of Study

  • Computer science

Readers

  • Computer Science.
  • Finite Element Method (FEM) for solving Partial Differential Equations (PDEs)
  • Neural Network Machine Learning.

Technology Areas

  • Space