Lattice-based Lexical Cues for Word Fragment Detection in Conversational Speech

Abstract

Previous approaches to the problem of word fragment detection in speech have focussed primarily on acoustic prosodic features [1], [2]. This paper proposes that the output of a continuous Automatic Speech Recognition (ASR) system can also be used to derive robust lexical features for the task. We hypothesize that the confusion in the word lattice generated by the ASR system can be exploited for detecting word fragments. Two sets of lexical features are proposed -one which is based on the word confusion, and the other based on the pronunciation confusion between the word hypotheses in the lattice. Classification experiments with a Support Vector Machine (SVM) classifier show that these lexical features perform better than the previously proposed acoustic-prosodic features by around 5.20% (relative) on a corpus chosen from the DARPA Transtac Iraqi-English (San Diego) corpus [3]. A combination of both these feature sets improves the word fragment detection accuracy by 11.50% relative to using just the acoustic-prosodic features.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jan 01, 2009
Accession Number
AD1171165

Entities

People

  • Kartik Audhkhasi
  • Panayiotis Georgiou
  • Shrikanth Narayanan

Organizations

  • University of Southern California

Tags

Communities of Interest

  • Autonomy
  • Energy and Power Technologies

DTIC Thesaurus Topics

  • Automated Speech Recognition
  • Boundaries
  • Classification
  • Computational Linguistics
  • Computational Science
  • Computer Languages
  • Data Acquisition
  • Data Mining
  • Detection
  • Dimensionality Reduction
  • Electrical Engineering
  • Feature Selection
  • Hypotheses
  • Information Science
  • Language
  • Linguistics
  • Machine Learning
  • Machine Translation
  • Recognition
  • Speech
  • Supervised Machine Learning
  • Test Sets

Readers

  • Computational Linguistics
  • Speech Processing/Speech Recognition.

Technology Areas

  • AI & ML
  • AI & ML - Machine Translation