Improving State-of-the-Art Continuous Speech Recognition System Using the N-Best Paradigm with Neural Networks

Abstract

In an effort to advance the state of the art in continuous speech recognition employing hidden Markov models (HMM), Segmental Neural Nets (SNN) were introduced recently to ameliorate the well- known limitations of HMMs, namely, the conditional-independence limitation and the relative difficulty with which HMMs can handle segmental features. We describe a hybrid SNN/I-IMM system that combines the speed and performance of our HMM system with the segmental modeling capabilities of SNNs. The integration of the two acoustic modeling techniques is achieved successfully via the N-best rescoring paradigm. The N-best lists are used not only for recognition, but also during training. This discriminative training using N-best is demonstrated to improve performance. When tested on the DARPA Resource Management speaker-independent corpus, the hybrid SNN/HMM system decreases the error by about 20% compared to the state-of-the-art HMM system.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jan 01, 1992
Accession Number
ADA460339

Entities

People

  • G. Zavaliagkost
  • J. Makhoul
  • Robert E. Schwartz
  • S. Austin

Organizations

  • BBN Technologies

Tags

Communities of Interest

  • Energy and Power Technologies

DTIC Thesaurus Topics

  • Algorithms
  • Artificial Intelligence Computing
  • Artificial Intelligence Software
  • Automated Speech Recognition
  • Computations
  • Hidden Markov Models
  • Hybrid Systems
  • Language
  • Markov Models
  • Models
  • Natural Language Processing
  • Natural Languages
  • Neural Networks
  • Probability
  • Recognition
  • Resource Management
  • Test Sets

Fields of Study

  • Computer science

Readers

  • Neural Network Machine Learning.
  • Speech Processing/Speech Recognition.

Technology Areas

  • AI & ML
  • AI & ML - Neural Networks