Speech Compression and Synthesis

Abstract

This report describes our work for the past year on speech compression and synthesis. We implemented a real-time variable-frame-rate LPC vocoder operating at an average rate of 2000 bits/s. We also tested our mixed- source model as part of the vocoder. To improve the reliability of the extraction of LPC parameters, we implemented and tested a range of adaptive lattice and autocorrelation algorithms. For data rates above 5000 bits/s, we developed and tested a new high-frequency regeneration technique, spectral duplication, which reduces the roughness in the synthetic speech. As part of an overall very-low-rate speech compression system, we designed and implemented a program for phonetic synthesis from diphone templates. The lengthy process of extracting the 3000 needed diphone templates has been half completed.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Aug 01, 1979
Accession Number
ADA074497

Entities

People

  • John Klovstad
  • John Makhoul
  • Lynn Cosell
  • Michael Berouti
  • Richard Schwartz

Organizations

  • BBN Technologies

Tags

Communities of Interest

  • Energy and Power Technologies
  • Materials and Manufacturing Processes

DTIC Thesaurus Topics

  • Algorithms
  • Coding
  • Computer Programming
  • Data Rate
  • Databases
  • Debugging
  • Frequency
  • Frequency Bands
  • Information Science
  • Ions
  • Larynx
  • Operating Systems
  • Signal Processing
  • Speech Compression
  • Speech Quality
  • Speech Transmission
  • Waveforms

Fields of Study

  • Computer science

Readers

  • Defense Acquisition Program Management
  • Speech Processing/Speech Recognition.