Isolated-Word Speech Recognition Using Multi-Section Vector Quantization Code Books.

Abstract

A new approach to isolated-word speech recognition using vector quantization (VQ) is examined. in this approach, words are recognized by means of sequences of VQ code books called multi-section code books. A separate multi-section code book is designed for each word in the recognition vocabulary by dividing the word into equal-length sections and designing a standard VQ code book for each section. Unknown words are classified by dividing them into corresponding sections, encoding them with the multi-section code books, and finding the multi-section code book that yields the smallest average distortion. For speaker-independent recognition of a 20-word vocabulary containing the digits, this approach achieves 95% recognition accuracy for the full vocabulary and 99% for the digits, in both causes with approximately 90% fewer distortion computations than typical dynamic-time-warping approaches. In addition, the approach achieves greater than 99% accuracy for speaker-dependent recognition of the digits with only 1 distortion computation per input frame per vocabulary word. The approach is described, detailed experimental results are presented and discussed, and computational requirements are analyzed. (Author)

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jul 13, 1984
Accession Number
ADA144433

Entities

People

  • D. K. Burton
  • J. E. Shore
  • J. T. Buck

Organizations

  • United States Naval Research Laboratory

Tags

Communities of Interest

  • Energy and Power Technologies

DTIC Thesaurus Topics

  • Accuracy
  • Algorithms
  • Automated Speech Recognition
  • Classification
  • Coding
  • Computations
  • Computer Programming
  • Computer Science
  • Data Compression
  • Databases
  • Distortion
  • Probability
  • Probability Distributions
  • Recognition
  • Security
  • Standards
  • Word Recognition

Fields of Study

  • Engineering

Readers

  • Computational Fluid Dynamics (CFD)
  • Library and Information Science
  • Speech Processing/Speech Recognition.

Technology Areas

  • AI & ML
  • AI & ML - Machine Translation