Applying SPHINX-II to the DARPA Wall Street Journal CSR Task

Abstract

This paper reports recent efforts to apply the speaker-independent SPHINX-II system to the DARPA Wall Street Journal continuous speech recognition task. In SPHINX-II, we incorporated additional dynamic and speaker-normalized features, replaced discrete models with sex-dependent semi-continuous hidden Markov models, augmented within-word triphones with between-word triphones, and extended generalized triphone models to shared-distribution models. The configuration of SPHINX-II being used for this task includes sex-dependent, semi-continuous, shared-distribution hidden Markov models and left context dependent between-word triphones. In applying our technology to this task we addressed issues that were not previously of concern owing to the (relatively) small size of the Resource Management task.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jan 01, 1992
Accession Number
ADA460229

Entities

People

  • F. Alleva
  • H. Hon
  • M. Hwang
  • R. Rosenfeld
  • R. Weide
  • Xinyi Huang

Organizations

  • Carnegie Mellon University

Tags

Communities of Interest

  • Energy and Power Technologies

DTIC Thesaurus Topics

  • Abstracts
  • Algorithms
  • Computational Science
  • Computations
  • Computer Science
  • Data Sets
  • Decoding
  • Dictionaries
  • Hidden Markov Models
  • Language
  • Linguistics
  • Markov Models
  • Probability
  • Probability Distributions
  • Recognition
  • Standards
  • Test Sets

Readers

  • Mathematical Modeling and Probability Theory.
  • Speech Processing/Speech Recognition.

Technology Areas

  • AI & ML
  • AI & ML - Information Retrieval