Recognition Using Classification and Segmentation Scoring

Abstract

Traditional statistical speech recognition systems typically make strong assumptions about the independence of observation frames and generally do not make use of segmental information. In contrast, when the segmentation is known, existing classifiers can readily accommodate segmental information in the decision process. We describe an approach to connected word recognition that allows the use of segmental information through an explicit decomposition of the recognition criterion into classification and segmentation scoring. Preliminary experiments are presented, demonstrating that the proposed framework, using fixed length sequences of cepstral feature vectors for classification of individual phonemes, performs comparably to more traditional recognition approaches that use the entire observation sequence. We expect that performance gain can be obtained using this structure with additional, more general features.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jan 01, 1992
Accession Number
ADA457477

Entities

People

  • Mari Ostendorf
  • Owen Kimball
  • Robin Rohlicek

Organizations

  • Boston University

Tags

Communities of Interest

  • Energy and Power Technologies

DTIC Thesaurus Topics

  • Algorithms
  • Artificial Intelligence
  • Automated Speech Recognition
  • Computer Vision
  • Databases
  • Hidden Markov Models
  • Language
  • Machine Learning
  • Markov Models
  • Models
  • Natural Languages
  • Neural Networks
  • Probability
  • Recognition
  • Signal Processing
  • Test Sets
  • Word Recognition

Readers

  • Speech Processing/Speech Recognition.
  • Statistical inference.
  • Systems Analysis and Design

Technology Areas

  • AI & ML