Speech Recognition: Acoustic-Phonetic Knowledge Acquisition and Representation

Abstract

The long-term research goal is to develop and implement speaker- independent continuous speech recognition systems. It is believed that the proper utilization of speech-specific knowledge is essential for such advanced systems. This research is thus directed toward the acquisition, quantification, and representation, of acoustic-phonetic and lexical knowledge, and the application of this knowledge to speech recognition algorithms. In addition, we are exploring new speech recognition alternatives based on artificial intelligence and connectionist techniques. We developed a statistical model for predicting the acoustic realization of stop consonants in various positions in the syllable template. A unification-based grammatical formalism was developed for incorporating this model into the lexical access algorithm. We provided an information-theoretic justification for the hierarchical structure of the syllable template. We analyzed segmented duration for vowels and fricatives in continuous speech. Based on contextual information, we developed durational models for vowels and fricatives that account for over 70% of the variance, using data from multiple, unknown speakers. We rigorously evaluated the ability of human spectrogram readers to identify stop consonants spoken by many talkers and in a variety of phonetic contexts. Incorporating the declarative knowledge used by the readers, we developed a knowledge-based system for stop identification. We achieved comparable system performance to that of the readers. We developed a technique for phonetic classification using artificial neural nets (ANN). Vowel classification accuracy was achieved, ranging from 66 to 100% under varying experimental conditions.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Sep 30, 1988
Accession Number
ADA203081

Entities

People

  • Victor W. Zue

Organizations

  • Massachusetts Institute of Technology

Tags

DTIC Thesaurus Topics

  • Acquisition
  • Algorithms
  • Artificial Intelligence
  • Artificial Intelligence Computing
  • Artificial Intelligence Software
  • Automated Speech Recognition
  • Computer Science
  • Consonants
  • Contracts
  • Identification
  • Knowledge Based Systems
  • Massachusetts
  • Military Research
  • Neural Networks
  • Recognition
  • Signal Processing
  • Syllables

Readers

  • Computational Linguistics
  • Speech Processing/Speech Recognition.
  • Systems Analysis and Design

Technology Areas

  • AI & ML
  • AI & ML - Neural Networks