An Automatic Speech Recognition System Using a Vocoder Input.
Abstract
Speech recognition is accomplished by off-line machine processes based on visual pattern recognition techniques. The fundamental system uses digitized data output from a KY-585 Vocoder, and two-dimensional discrete Fourier transforms with spatial frequency filters.. Two male speakers generated data for the computer processes which include a speaker adaptation routine. A relation to the human physiology is maintained through an elementary model. For a 39 word vocabulary, recognition rates reached 92% for the single speaker process, and 79% for an either-of-two-speaker process. Departures from visual pattern recognition techniques are introduced and proven effective. (Author)
Document Details
- Document Type
- Technical Report
- Publication Date
- Mar 01, 1972
- Accession Number
- AD0742441
Entities
People
- Frankie S. Sutton
- Keith G. Dailey
Organizations
- Air Force Institute of Technology