Comparing and Combining Generative and Posterior Probability Models: Some Advances in Sentence Boundary Detection in Speech

Abstract

We compare and contrast two different models for detecting sentence-like units in continuous speech. The first approach uses hidden Markov sequence models based on N-grams and maximum likelihood estimation, and employs model interpolation to combine different representations of the data. The second approach models the posterior probabilities of the target classes; it is discriminative and integrates multiple knowledge sources in the maximum entropy (maxent) framework. Both models combine lexical, syntactic, and prosodic information. We develop a technique for integrating pretrained probability models into the maxent framework, and show that this approach can improve on an HMM-based state-of-the-art system for the sentence-boundary detection task. An even more substantial improvement is obtained by combining the posterior probabilities of the two systems.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jul 26, 2004
Accession Number
AD1002426

Entities

People

  • Andreas Stolcke
  • Elizabeth Shriberg
  • Mary Harper
  • Yang Liu

Organizations

  • SRI International

Tags

Communities of Interest

  • Autonomy
  • Energy and Power Technologies
  • Human Systems

DTIC Thesaurus Topics

  • Algorithms
  • Artificial Intelligence Software
  • Automated Speech Recognition
  • Computational Science
  • Gaussian Distributions
  • Hidden Markov Models
  • Language
  • Machine Learning
  • Markov Models
  • Natural Language Processing
  • Natural Languages
  • Probabilistic Models
  • Probability
  • Recognition
  • Standards
  • Stochastic Processes
  • Word Recognition

Readers

  • Computational Linguistics
  • Speech Processing/Speech Recognition.
  • Statistical inference.