Improved HMM Models for High Performance Speech Recognition

Abstract

In this paper we report on the various techniques that we implemented in order to improve the basic speech recognition performance of the BYBLOS system. Some of these methods are new, while others are not. We present methods that improved performance as well as those that did not. The methods include Linear Discriminant Analysis, Supervised Vector Quantization, Shared Mixture VQ. Deleted Estimation of Context Weights, MMI Estimation Using "N-Best" Alternatives, Cross-Word Triphone Models. While we have not yet combined all of the methods in one system, the overall word recognition error rate on the May 1988 test set using the Word-Pair grammar has decreased from 3.4% to 1.7%.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jan 01, 1989
Accession Number
ADA460743

Entities

People

  • Alan Derr
  • Chris Barry
  • Francis Kubala
  • George Yu
  • John Makhoul
  • Owen Kimball
  • Paul Placeway
  • Richard Schwartz
  • Steve Austin
  • William Russell
  • Yen-lu Chow

Organizations

  • BBN Technologies

Tags

Communities of Interest

  • Energy and Power Technologies

DTIC Thesaurus Topics

  • Accuracy
  • Algorithms
  • Automated Speech Recognition
  • Clustering
  • Decoders
  • Dictionaries
  • Discriminant Analysis
  • Errors
  • Grammars
  • Language
  • Numbers
  • Power Spectra
  • Probability
  • Recognition
  • Test Sets
  • Training
  • Word Recognition

Readers

  • Adaptive Control and Estimation with Uncertainty in Dynamic Systems.
  • Speech Processing/Speech Recognition.

Technology Areas

  • AI & ML
  • AI & ML - Machine Translation