Finding Acoustic Regularities in Speech: Applications to Phonetic Recognition

Abstract

Phonetic recognition can be viewed as a process through which the acoustic signal is mapped to a set of phonological units used to represent a lexicon. Traditionally, researchers often prescribe an intermediate, phonetic description to account for coarticulation. This thesis presents an alternative approach whereby this phonetic-level description is bypassed in favor of directly relating the acoustic realization to the underlying phonemic forms. In this approach, the speech signal is transformed into a set of segments which are described completely in a acoustic terms. Next, these acoustic segments are related to the phonemes by a grammar which is determined using automated procedures operating on a set of training data. Thus important acoustic regularities that describes contextual variations are discovered without the need to specify a set of preconceived units such as allophones. The viability of this approach depends critically on the ability to detect important acoustic landmarks in the speech signal, and to describe these events in terms of an inventory of labels that captures the regularity of phonetic variations. Keywords: Acoustic segmentation; Acoustic classification.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Dec 01, 1988
Accession Number
ADA207072

Entities

People

  • James R. Glass

Organizations

  • Massachusetts Institute of Technology

Tags

Communities of Interest

  • Materials and Manufacturing Processes

DTIC Thesaurus Topics

  • Acoustic Signals
  • Algorithms
  • Automated Speech Recognition
  • Computer Programming
  • Data Science
  • Databases
  • Decoding
  • Electrical Engineering
  • Factor Analysis
  • Frequency
  • Information Science
  • Language
  • Probability
  • Recognition
  • Signal Processing
  • Speech Analysis
  • Statistical Analysis

Readers

  • Speech Processing/Speech Recognition.
  • Theoretical Analysis.