A Post-Processing Algorithm for Time Domain Pitch Trackers.

Abstract

This paper describes a powerful post-processing algorithm for time-domain pitch trackers. On two successive passes, the post-processing algorithm eliminates errors produced during a first pass by a time-domain pitch tracker. During the second pass, incorrect pitch values are detected as outliers by computing the distribution of values over a sliding 80 msec window. During the third pass (based on artificial intelligence techniques), remaining pitch pulses are used as anchor points to reconstruct the pitch train from the original waveform. The algorithm produced a decrease in the error rate from 21% obtained with the original time domain pitch tracker to 2% for isolated words and sentences produced in an office environment by 3 male and 3 female talkers. In a noisy computer room errors decreased from 52% to 2.9% for the same stimuli produced by 2 male talkers. The algorithm is efficient, accurate, and resistant to noise. The fundamental frequency micro-structure is tracked sufficiently well to be used in extracting phonetic features in a feature-based recognition system. (Author)

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jan 21, 1983
Accession Number
ADA127671

Entities

People

  • Philippe Specker

Organizations

  • Carnegie Mellon University

Tags

Communities of Interest

  • Energy and Power Technologies
  • Materials and Manufacturing Processes

DTIC Thesaurus Topics

  • Accuracy
  • Algorithms
  • Artificial Intelligence
  • Automated Speech Recognition
  • Computations
  • Computer Science
  • Computers
  • Environment
  • Errors
  • False Alarms
  • Feature Extraction
  • Frequency
  • Intervals
  • Recognition
  • Signal Processing
  • Time Domain
  • Warning Systems

Readers

  • Approximation Theory.
  • Sensor Fusion and Tracking Systems.
  • Speech Processing/Speech Recognition.

Technology Areas

  • AI & ML
  • AI & ML - Bayesian Inference