A Segmental Hidden Markov Model for Speech Pattern Processing

Abstract

A simple statistical segmental approach to speech pattern modelling, based on segmental hidden Markov models, is proposed which addresses some of the limitations of conventional hidden Markov model based methods. The most important features of the new approach are the use of an underlying semi-Markov process to model speech at the segment level, rather than time-synchronous frame level, and to enable improved segment duration modelling, and the development of a segment model in which separate statistical processes are used to characterise extra-state and intrastate variability, thus making the temporal independence assumption more acceptable within a segment. A basic mathematical analysis of gaussian segmental hidden Markov models is presented and model parameter re- estimation equations are derived. The relationship between the new type of model and variable frame rate analysis and conventional gaussian mixture based hidden Markov models is exposed.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jul 29, 1992
Accession Number
ADA258220

Entities

People

  • Martin J. Russell

Tags

DTIC Thesaurus Topics

  • Algorithms
  • Artificial Intelligence
  • Automated Speech Recognition
  • Dynamic Programming
  • Gaussian Processes
  • Hidden Markov Models
  • Language
  • Markov Models
  • Markov Processes
  • Mathematical Analysis
  • Probability
  • Probability Density Functions
  • Signal Processing
  • Statistical Analysis
  • Statistical Processes
  • Statistics
  • Stochastic Processes

Fields of Study

  • Engineering

Readers

  • Auditory Neuroscience/Auditory Physiology.
  • Computational Modeling and Simulation
  • Statistical inference.