Rule Based Sinusoidal Encoding of Speech

Abstract

A system was developed to investigate the data rate necessary to transmit speech using a rule based sinusoidal model. The system consists of a speech analyzer and a synthesizer. The analyzer outputs discrete frequencies and quantized amplitudes and phases of selected speech spectral components. The synthesizer reconstructs speech from these components based on a sinusoidal model. The selection of spectral components for voiced speech regions is based on the detection of harmonics of the fundamental frequency. To obtain a specific number of spectral components, a variable amplitude threshold is applied to the detected harmonics and their nearest neighbors. For unvoiced regions only the variable amplitude step is applied. The lowest data rate obtained for toll quality speech was about 18 Kbps. This system was implemented in Fortran 77 on a VAX 11/780 computer. Visual analysis of speech was provided by the software package SPIRE (Speech and Phonetics Interactive Research Environment). Keywords: Theses.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Mar 01, 1990
Accession Number
ADA220107

Entities

People

  • Luis M. Alenquer

Organizations

  • Air Force Institute of Technology

Tags

Communities of Interest

  • Energy and Power Technologies

DTIC Thesaurus Topics

  • Abstracts
  • Air Force
  • Analyzers
  • Coding
  • Computer Programs
  • Computers
  • Data Rate
  • Data Reduction
  • Discrete Fourier Transforms
  • Electrical Engineering
  • Engineering
  • Frequency
  • Frequency Domain
  • Operating Systems
  • Software Development
  • Speech Transmission
  • Time Domain

Fields of Study

  • Engineering

Readers

  • Computer Vision.
  • Database Systems and Applications
  • Speech Processing/Speech Recognition.