A Phone-dependent Confidence Measure for Utterance Rejection

Abstract

An acoustic confidence measure for acceptance/rejection of recognition hypotheses for continuous speech utterances is proposed. This measure is useful for rejecting utterances that are out of domain, or contain out-of-vocabulary words or speech disfluencies. A phone-based approach is implemented so that a single global threshold can be applied to hypothesis rejection for any word sequence. Phone confidence is computed for each frame of speech as the posterior phone probability given the acoustic observation. Word sequence confidence is evaluated as the average phone confidence, either by weighting all frames equally or by normalizing by phone duration. The confidence measure is tested on a database of spoken company names. When normalized by phone duration, it achieves, in some cases with less computational expense, rejection performance comparable to a baseline system implementing a common filler model approach. When all frames are equally weighted, performance is substantially poorer.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
May 09, 1996
Accession Number
AD1002396

Entities

People

  • Michael Cohen
  • Thomas Chung
  • Victor Abrash
  • Ze'ev Rivlin

Organizations

  • SRI International

Tags

DTIC Thesaurus Topics

  • Automated Speech Recognition
  • Databases
  • Equations
  • Hidden Markov Models
  • Hypotheses
  • Language
  • Markov Models
  • Mathematics
  • Models
  • Observation
  • Ocean Surveillance
  • Probability
  • Recognition
  • Rejection
  • Sequences
  • Test Sets
  • Vocabulary

Readers

  • Phased Array Antenna Design.
  • Speech Processing/Speech Recognition.
  • Statistical inference.