Maximum Likelihood Spectral Estimation and Its Application to Narrowband Speech Coding.

Abstract

Using the maximum likelihood (ML) method the Itakura-Saito 1 spectral matching criterion is generalized to aperiodic and periodic processes having arbitrary model spectra. For the all-pole model, Kay's 2 covariance domain solution to the exact ML problem is cast into the spectral domain and used to obtain the exact solution for periodic processes. It is shown that if the number of independent power measurements greatly exceeds the model order, then the ML algorithm reduces to a pitch-directed, frequency domain version of Linear Predictive (LP) spectral analysis. Using a real-time vocoder based on the exact ML analysis revealed that, in contrast to standard LPC, the synthetic speech has the quality of being heavily smoothed. This suggests that it is generally incorrect to interpret LPC spectral matching in terms of the Itakura-Saito criterion. (Author)

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Mar 05, 1982
Accession Number
ADA114070

Entities

People

  • Robert J. Mcaulay

Organizations

  • Massachusetts Institute of Technology

Tags

Communities of Interest

  • Materials and Manufacturing Processes

DTIC Thesaurus Topics

  • Algorithms
  • Bandwidth
  • Computational Complexity
  • Computational Science
  • Contrast
  • Estimators
  • Frequency
  • Frequency Domain
  • High Resolution
  • Mathematical Analysis
  • Measurement
  • Narrowband
  • Power Measurement
  • Probabilistic Models
  • Random Variables
  • Speech Compression
  • Standards

Readers

  • Calculus or Mathematical Analysis
  • Speech Processing/Speech Recognition.
  • Statistical inference.