Dynamic Load Balancing Algorithms for Sequence Mining

Abstract

Discovery of sequential patterns is becoming increasingly useful and essential in many scientific and commercial domains. Enormous sizes of available datasets and possibly large number of mined patterns demand efficient and scalable algorithms. In this paper we present a parallel formulation of a serial sequential pattern discovery algorithm based on tree projection that uses a novel dynamic load balancing algorithm which is well suited for distributed memory parallel computers. Our experimental evaluation on a 32 processor IBM SP show that this algorithms are capable of achieving good speedups, substantially reducing the amount of the required work to find sequential patterns in large databases.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
May 08, 2001
Accession Number
AD1020012

Entities

People

  • George Karypis
  • Valerie Guralnik

Organizations

  • University of Minnesota

Tags

Communities of Interest

  • Materials and Manufacturing Processes

DTIC Thesaurus Topics

  • Algorithms
  • Computer Science
  • Computers
  • Data Sets
  • Databases
  • Dynamic Loads
  • Engineering
  • High Performance Computing
  • Parallel Computing
  • Parallel Processing
  • Sequences
  • Static Loads
  • Test And Evaluation

Fields of Study

  • Computer science

Readers

  • Distributed Systems and Data Platform Development
  • Parallel and Distributed Computing.