Fast Algorithms for Mining Co-evolving Time Series

Abstract

Time series data arise in many applications, from motion capture, environmental monitoring temperatures in data centers, to physiological signals in health care. In the thesis, I will focus on the theme of learning and mining large collections of co-evolving sequences, with the goal of developing fast algorithms for finding patterns, summarization, and anomalies. In particular, this thesis will answer the following recurring challenges for time series 1. Forecasting and imputation: How to do forecasting and to recover missing values in time series data? 2. Pattern discovery and summarization: How to identify the patterns in the time sequences that would facilitate further mining tasks such as compression, segmentation and anomaly detection? 3. Similarity and feature extraction: How to extract compact and meaningful features from multiple co-evolving sequences that will enable better clustering and similarity queries of time series? 4. Scale up: How to handle large data sets on modern computing hardware? We develop models to mine time series with missing values, to extract compact representation from time sequences, to segment the sequences, and to do forecasting. For large scale data, we propose algorithms for learning time series models, in particular, including Linear Dynamical Systems (LDS) and Hidden Markov Models (HMM). We also develop a distributed algorithm for finding patterns in large web-click streams. Our thesis will present special models and algorithms that incorporate domain knowledge. For motion capture, we will describe the natural motion stitching and occlusion filling for human motion. In particular we provide a metric for evaluating the naturalness of motion stitching, based which we choose the best stitching.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Sep 01, 2011
Accession Number
ADA557360

Entities

People

  • Lei Li

Organizations

  • Carnegie Institute of Technology

Tags

Communities of Interest

  • Autonomy
  • C4I
  • Cyber
  • Energy and Power Technologies
  • Sensors
  • Weapons Technologies

DTIC Thesaurus Topics

  • Artificial Intelligence
  • Bayesian Networks
  • Change Detection
  • Computational Fluid Dynamics
  • Computational Science
  • Computer Programming
  • Computers
  • Data Centers
  • Data Mining
  • Databases
  • Dimensionality Reduction
  • Information Processing
  • Information Science
  • Machine Learning
  • Mathematical Filters
  • Network Science
  • Neural Networks

Fields of Study

  • Computer science

Readers

  • Adaptive Control and Estimation with Uncertainty in Dynamic Systems.
  • Atmospheric Science/Meteorology
  • Computational Linguistics

Technology Areas

  • AI & ML