CLUTO - A Clustering Toolkit

Abstract

Clustering algorithms divide data into meaningful or useful groups, called clusters, such that the intra-cluster similarity is maximized and the inter-cluster similarity is minimized. These discovered clusters can be used to explain the characteristics of the underlying data distribution and thus serve as the foundation for various data mining and analysis techniques. The applications of clustering include characterization of different customer groups based upon purchasing patterns, categorization of documents on the World Wide Web, grouping of genes and proteins that have similar functionality, grouping of spatial locations prone to earth quakes from seismological data, etc. CLUTO is a software package for clustering low and high dimensional datasets and for analyzing the characteristics of the various clusters.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Apr 23, 2002
Accession Number
ADA439508

Entities

People

  • George Karypis

Organizations

  • University of Minnesota

Tags

Communities of Interest

  • Materials and Manufacturing Processes

DTIC Thesaurus Topics

  • Abstracts
  • Algorithms
  • Clustering
  • Computer Science
  • Copyrights
  • Debugging
  • Information Operations
  • Law
  • Military Research
  • Minnesota
  • Schools
  • Sparse Matrix
  • Statistics
  • Universities
  • Visualizations
  • World Wide Web

Fields of Study

  • Computer science

Readers

  • Database Systems and Applications
  • Neural Network Machine Learning.

Technology Areas

  • AI & ML