STUDY AND ANALYSIS OF SIGNAL DISPLAY AND BANDWIDTH COMPRESSION TECHNIQUES.

Abstract

Parameter signals were generated representing (1) short-time running-average rate of occurrence of threshold crossings for an input speech signal (Z sub 1) or for the derivative of input speech signal (Z sub 2); and (2) short-time running-average amplitude envelope of the input speech (A). Pairs of these parameter signals were plotted as x- and y-deflections on the display of a storage oscilloscope (the 'vector' display), for speech elements of syllabic length, and the resulting trace patterns ('vectorgrams') were studied. Gross structure of the vectorgram patterns shows a significant tendency to be phoneme-correlated and talker-invariant, is little influenced by accents or by male-female voice distinctions, but is quite sensitive to clarify of enunciation. Fine structure is also seen, some of which is also seen, some of which is also phoneme-correlated and voice-invariant, and which is often repeated with great exactitude when a given speaker repeats a syllable. The relation of these patterns to other work on formant changes is discussed, as is their application to automatic speech recognition and bandwidth compression. Experiments are described in which the A and Z sub 1 or Z sub 2 parameter signals were passed through narrowband channels and used to reconstruct synthetic speech in a simulated receiver. Questions of instrumentation are discussed, and a technique of system evaluation by direct comparison of input and output vectorgrams is described. Possibilities for improving performance are considered. (Author)

Document Details

Document Type
Technical Report
Publication Date
Jun 14, 1965
Accession Number
AD0616644

Entities

People

  • Berry O. Pyron
  • Frank R. Williamson Jr.

Organizations

  • Georgia Tech

Tags

DTIC Thesaurus Topics

  • Amplitude
  • Automated Speech Recognition
  • Automatic
  • Bandwidth
  • Compression
  • Contracts
  • Crossings
  • Deflection
  • Instrumentation
  • Measuring Instruments
  • Narrowband
  • Oscilloscopes
  • Recognition
  • Syllables
  • Test And Evaluation

Readers

  • Computer Science/Computer Engineering/Data Science/Digital Signal Processing.
  • Speech Processing/Speech Recognition.

Technology Areas

  • AI & ML
  • AI & ML - Bayesian Inference