Iterative Structure Discovery in Graph-Based Data

Abstract

Much of current data mining research is focused on discovering sets of attributes that discriminate data entities into classes, such as shopping trends for a particular demographic group. In contrast, we are working to develop data mining techniques to discover patterns consisting of complex relationships between entities. Our research is particularly applicable to domains in which the data is event-driven or relationally structured. In this paper we present approaches to address two related challenges; the need to assimilate incremental data updates and the need to mine monolithic datasets. Many realistic problems are continuous in nature and therefore require a data mining approach that can evolve discovered knowledge over time. Similarly, many problems present data sets that are too large to fit into dynamic memory on conventional computer systems. We address incremental data mining by introducing a mechanism for summarizing discoveries from previous data increments so that the globally-best patterns can be computed by mining only the new data increment. To address monolithic datasets we introduce a technique by which these datasets can be partitioned and mined serially with minimal impact on the result quality. We present applications of our work in both the counter-terrorism and bioinformatics domains.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jan 01, 2005
Accession Number
ADA459054

Entities

People

  • Diane Cook
  • Jeffrey A. Coble
  • Lawrence B. Holder
  • Runu Rathi

Organizations

  • University of Texas at Arlington

Tags

Communities of Interest

  • Autonomy

DTIC Thesaurus Topics

  • Air Force Research Laboratories
  • Algorithms
  • Automated Text Summarization
  • Batch Processing
  • Bayesian Networks
  • Boundaries
  • Chemical Compounds
  • Compression
  • Compression Ratio
  • Computer Science
  • Computers
  • Counterterrorism
  • Data Mining
  • Data Sets
  • Machine Learning
  • Probability
  • Terrorism

Fields of Study

  • Computer science

Readers

  • Computational Modeling and Simulation
  • Database Systems and Applications
  • Neural Network Machine Learning.

Technology Areas

  • AI & ML