Robust Word Boundary Detection in Spontaneous Speech Using Acoustic and Lexical Cues

Abstract

We consider the problem of word boundary detection in spontaneous speech utterances. Acoustic features have been well explored in the literature in the context of word boundary detection; however, in spontaneous speech of Switchboard-I corpus, we found that the accuracy of word boundary detection using acoustic features is poor (F-score approx. 0.63). We propose a new feature - that captures lexical cues in the context of the word boundary detection problem. We show that including proposed lexical feature along with the usual acoustic features, the accuracy of the word boundary detection improves considerably (F-score approx. 0.81). We also demonstrate the robustness of our proposed feature in presence of different noise levels for additive white and pink noise.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
May 26, 2009
Accession Number
AD1159558

Entities

People

  • Andreas Tsiartas
  • Panayiotis Georgiou
  • Prasanta K. Ghosh
  • Shrikanth Narayanan

Organizations

  • University of Southern California

Tags

Communities of Interest

  • Energy and Power Technologies

DTIC Thesaurus Topics

  • Accuracy
  • Acquisition
  • Additives (Chemicals)
  • Algorithms
  • Automated Speech Recognition
  • Automatic
  • Boundaries
  • California
  • Crossings
  • Data Acquisition
  • Detection
  • Electrical Engineering
  • Hypotheses
  • Language
  • Noise
  • Observation
  • Recognition
  • Switchboards
  • Test Sets
  • Universities
  • White Noise

Readers

  • Acoustical Oceanography.
  • Computational Linguistics
  • Spectroscopy.