Speech Recognition Using Multiple Features and Multiple Recognizers

Abstract

The purpose of this thesis is to demonstrate the feasibility of using multiple features and multiple recognizers to perform isolated word recognition. This is accomplished by performing multiple independent recognition tests and fusing the results together to get a single recognition result. The speech data is recorded and each word is extracted into a separate file. Eight features are calculated for each word. The features are calculated on 512 sample time slices and produce 16 component vector output. The three recognizers use the eight features to produce a total of 24 error distance lists. These lists are then fused together by adding the error values corresponding to each word. The word with the smallest fused error value is declared the recognition winner. Talker dependent and independent tests were run on a word set of zero through nine and A through Z. The talker dependent tests achieved accuracies between 87% and 100% depending on the talker. The talker independent tests achieved accuracies between 81% and 97%.

Open PDF

Document Details

Document Type: Technical Report
Publication Date: Dec 03, 1991
Accession Number: ADA243791

Entities

People

Thomas F. Rathbun

Organizations

Air Force Institute of Technology

Speech Recognition Using Multiple Features and Multiple Recognizers

Abstract

Document Details

Entities

People

Organizations

Tags

Communities of Interest

DTIC Thesaurus Topics

Readers

Technology Areas