An Automatic Speech Recognition System Using a Vocoder Input.

Abstract

Speech recognition is accomplished by off-line machine processes based on visual pattern recognition techniques. The fundamental system uses digitized data output from a KY-585 Vocoder, and two-dimensional discrete Fourier transforms with spatial frequency filters.. Two male speakers generated data for the computer processes which include a speaker adaptation routine. A relation to the human physiology is maintained through an elementary model. For a 39 word vocabulary, recognition rates reached 92% for the single speaker process, and 79% for an either-of-two-speaker process. Departures from visual pattern recognition techniques are introduced and proven effective. (Author)

Document Details

Document Type
Technical Report
Publication Date
Mar 01, 1972
Accession Number
AD0742441

Entities

People

  • Frankie S. Sutton
  • Keith G. Dailey

Organizations

  • Air Force Institute of Technology

Tags

Communities of Interest

  • Materials and Manufacturing Processes

DTIC Thesaurus Topics

  • Automated Speech Recognition
  • Automatic
  • Computers
  • Discrete Fourier Transforms
  • Frequency
  • Pattern Recognition
  • Physiology
  • Recognition
  • Two Dimensional
  • Vocabulary

Readers

  • Approximation Theory.
  • Speech Processing/Speech Recognition.
  • Vision Science/Vision Psychology/Cognitive Neuroscience.

Technology Areas

  • AI & ML
  • AI & ML - Machine Translation