TEMPORAL EFFECTS IN SPEECH ANALYSIS AND SYNTHESIS,

Abstract

Two major methods of adjusting the time dimention of speech were investigated. One was linear time normalization and the other was arc-length time normalization. In the experiments conducted the distribution of spectral energy was derived by scanning the output of a bank of vocoder analyzer filters. A hybrid computing facility was employed to accomplish the various methods of speech time adjustment. The perceptual effects of time-adjusted synthesized utterances were investigated in an intelligibility test. A dichotic method of time-compression of speech was studied in which information from alternate time samples was supplied to alternate ears. A comparison of the linear and non-linear methods of time normalization suggests that the arc-length method may be superior for use in automatic speech recognition. A number of potential applications of time adjusted speech and possibilities for future research are suggested. (Author)

Document Details

Document Type
Technical Report
Publication Date
Dec 01, 1965
Accession Number
AD0626659

Entities

People

  • Robert James Scott

Tags

DTIC Thesaurus Topics

  • Analyzers
  • Automated Speech Recognition
  • Automatic
  • Compression
  • Intelligibility
  • Recognition
  • Scanning
  • Speech
  • Speech Analysis
  • Time Compression

Readers

  • Brain and Cognitive Science; Experimental Psychology; Cognitive Neuroscience
  • Computational Modeling and Simulation
  • Speech Processing/Speech Recognition.

Technology Areas

  • AI & ML
  • AI & ML - Bayesian Inference