Ensemble Learning Method for Hidden Markov Models

Abstract

This dissertation introduces an ensemble learning method for temporal data that uses a mixture of Hidden Markov Model (HMM) classfiers. We hypothesize that the data is generated by K models, each of which reflects a particular trend in the data. Model identfication could be achieved through clustering in the feature space or in the parameters space. However, this approach is inappropriate in the context of sequential data. The proposed approach is based on clustering in the log-likelihood space, and has two main steps. First, one HMM is fit to each of the N individual sequences. For each fitted model, we evaluate the log-likelihood of each sequence. This will result in an N-by-N log-likelihood distance matrix that will be partitioned into K groups using a relational clustering algorithm. In the second step, we pool the sequences belonging to the same cluster into K groups. Then, we learn the parameters of one HMM per group. We propose using and optimizing various training approaches for the different K groups depending on their size and homogeneity. In particular, we investigate the maximum likelihood (ML), the minimum classification error (MCE) based discriminative, and the Variational Bayesian (VB) training approaches. Finally, to test a new sequence, its likelihood is computed in all the models and a final confidence value is assigned by combining the multiple models outputs using a decision level fusion method such as an artficial neural network or a hierarchical mixture of experts. Our approach was evaluated on two real-world applications: (1) identification of Cardio-Pulmonary Resuscitation (CPR) scenes in video simulating medical crises; and (2) landmine detection using Ground Penetrating Radar (GPR). Results on both applications show that the proposed method can identify meaningful and coherent HMM mixture components that describe different properties of the data. Each HMM mixture component models a group of data that share common attributes.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Dec 01, 2014
Accession Number
AD1010447

Entities

People

  • Anis Hamdi

Organizations

  • University of Louisville

Tags

Communities of Interest

  • Counter IED
  • Energy and Power Technologies
  • Human Systems

DTIC Thesaurus Topics

  • Bayesian Networks
  • Computational Science
  • Data Mining
  • Detection
  • Feature Extraction
  • Hidden Markov Models
  • Information Processing
  • Information Science
  • Machine Learning
  • Monte Carlo Method
  • Network Science
  • Neural Networks
  • Probabilistic Models
  • Probability
  • Supervised Machine Learning
  • Three Dimensional
  • Two Dimensional

Fields of Study

  • Computer science

Readers

  • Neural Network Machine Learning.
  • Regression Analysis.
  • Sensor Fusion and Tracking Systems.

Technology Areas

  • AI & ML
  • AI & ML - Bayesian Inference
  • Space